TikTok-10M Dataset

We just released our first dataset of 10M highly curated TikTok videos. Would love feedback!

If you are training or finetuning video models and need curated datasets, feel free to reachout!

2 Likes

i would love a link

1 Like
1 Like

The-data-company/TikTok-10M on hf

1 Like

I tried loading it using datasets.load_dataset and it worked after grabbing the CSV link directly from the GitHub repo. Make sure to use streaming=True if memory is an issue.

1 Like