/
RC Community Datasets

RC Community Datasets

Overview

There is a growing number of large public datasets that researchers rely on to conduct their work. Some of these datasets are utilized by different research groups or even different research fields, and as such, they are downloaded and hosted on the supercomputer in multiple file system locations.

To reduce the global load on the supercomputer’s shared filesystem and to foster data collaboration, we are pleased to consider hosting datasets in a shared location, specifically under /data/datasets.

Below is a table detailing the current datasets hosted for public use on the system. If you are interested in contributing to this community collection, please contact us with your request by reviewing our RTO Request Help page.

Current Community Datasets

Name

Path

Short Description

Name

Path

Short Description

HuggingFace

/data/datasets/community/huggingface

Popular huggingface models and datasets.

ImageNet

/data/datasets/community/deeplearning/imagenet

Image database organized by the WordNet hierarchy http://www.image-net.org/

LaSOT

/data/datasets/community/compvis/LaSOT_categories

Large-scale Single Object Tracking (LaSOT) provides a collection of thousands of sequences with millions of frames across 70 categories. Each sequence is manually annotated LaSOT - Large-scale Single Object Tracking

Benchmark results are given here: GitHub - HengLan/LaSOT_Evaluation_Toolkit: Evaluation of trackers on a large-scale benchmark LaSOT.

-

/data/datasets/community/compvis/OTB100

Data from the benchmark evaluation of online visual tracking algorithms (see: http://cvlab.hanyang.ac.kr/tracker_benchmark)

Raw zips and resulting images are hosted.

-

/data/datasets/community/compvis/TrackingNet

Work-in-progress

BLAST

/data/datasets/community/blast

The complete BLAST databases, currently at version 5, updated on 2025-01-31. It will be regularly updated. Please check the database names here:
https://ftp.ncbi.nlm.nih.gov/blast/db/v5/

Additional Help

Related content

Application Help
Application Help
Read with this
Community Datasets
Community Datasets
More like this
Connecting to the Supercomputers with SSH
Connecting to the Supercomputers with SSH
Read with this
Workshop Materials
Workshop Materials
More like this
Home Directory Cleanup Guide
Home Directory Cleanup Guide
Read with this
2023 Research Computing Expo
2023 Research Computing Expo
More like this