3 17 22

AlphaSue

AI & ML interests

None yet

Recent Activity

upvoted an article about 2 months ago

DABStep: Data Agent Benchmark for Multi-step Reasoning

upvoted an article 2 months ago

The 4 Things Qwen-3’s Chat Template Teaches Us

upvoted a paper 3 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

View all activity

Organizations

None yet

liked 3 models 7 months ago

liked a Space 8 months ago

125

TxT360: Trillion Extracted Text

📖

Explore and utilize a large, deduplicated text dataset for LLM training

liked a model 9 months ago

jinaai/ReaderLM-v2

Text Generation • 2B • Updated Mar 4 • 9.09k • • 723

liked a Space 9 months ago

3.45k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a dataset 11 months ago

microsoft/RedStone

Updated Dec 5, 2024 • 15 • 35

liked a model 11 months ago

open-web-math/filtering-models

Updated Nov 2, 2023 • 9

liked a dataset 11 months ago

m-a-p/FineFineWeb

Viewer • Updated Dec 19, 2024 • 4.89B • 426k • 84

liked 2 models about 1 year ago

nvidia/quality-classifier-deberta

Updated Sep 22 • 2.84k • 73

oliverguhr/fullstop-punctuation-multilang-large

Token Classification • 0.6B • Updated Nov 16, 2023 • 1.27M • • 171

liked a dataset over 1 year ago

teknium/OpenHermes-2.5

Viewer • Updated Apr 15, 2024 • 1M • 3.71k • 762

liked a model over 1 year ago

Snowflake/snowflake-arctic-embed-m

liked a Space over 1 year ago

1.16k

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality text data for LLMs using FineWeb

liked 4 datasets over 1 year ago

liwu/MNBVC

Updated 4 days ago • 90.9k • 563

togethercomputer/RedPajama-Data-1T

Viewer • Updated Jun 17, 2024 • 1.73M • 1.25k • 1.11k

allenai/dolma

Updated Apr 17, 2024 • 1.16k • 955

HuggingFaceFW/fineweb

Viewer • Updated Jul 11 • 52.5B • 309k • 2.43k

liked a Space about 2 years ago

1.16k

ControlNet V1.1

📉

Transform images using various artistic effects

liked a model over 2 years ago

TheBloke/Llama-2-7B-Chat-GGML

Text Generation • Updated Sep 27, 2023 • 1.25k • 871