Finetune data - a Deventhedude Collection

Two Minds Better Than One: Collaborative Reward Modeling for LLM Alignment

Paper • 2505.10597 • Published May 15

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7 • 44

FML-bench: A Benchmark for Automatic ML Research Agents Highlighting the Importance of Exploration Breadth

Paper • 2510.10472 • Published Oct 12 • 8

Scientific Algorithm Discovery by Augmenting AlphaEvolve with Deep Research

Paper • 2510.06056 • Published Oct 7 • 5

RECODE-H: A Benchmark for Research Code Development with Interactive Human Feedback

Paper • 2510.06186 • Published Oct 7

AlphaResearch: Accelerating New Algorithm Discovery with Language Models

Paper • 2511.08522 • Published Nov 11 • 15

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 27.9k • 1.51k

open-thoughts/OpenThoughts3-1.2M

Viewer • Updated Jun 9 • 1.2M • 12.7k • 190

nex-agi/agent-sft

Preview • Updated 5 days ago • 1.08k • 88

Open-Bee/Honey-Data-15M

Viewer • Updated Nov 5 • 14.8M • 79.6k • 102

Jina Embeddings 2: 8192-Token General-Purpose Text Embeddings for Long Documents

Paper • 2310.19923 • Published Oct 30, 2023 • 14

nvidia/Nemotron-PII

Viewer • Updated Oct 28 • 200k • 1.89k • 44

HuggingFaceFW/fineweb

Viewer • Updated Jul 11 • 52.5B • 199k • 2.51k

rl-research/dr-tulu-sft-data

Viewer • Updated 19 days ago • 13.1k • 931 • 25

HuggingFaceFW/fineweb-2

Viewer • Updated Oct 27 • 4.48B • 77k • 706

miromind-ai/MiroVerse-v0.1

Viewer • Updated 25 days ago • 228k • 1.03k • 94

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8 • 3.91M • 6.06k • 613

wikimedia/wikipedia

Viewer • Updated Jan 9, 2024 • 61.6M • 74k • 988

HuggingFaceH4/MATH-500

Viewer • Updated Nov 15, 2024 • 500 • 81k • 230

nick007x/github-code-2025

Viewer • Updated Oct 15 • 147M • 8.65k • 109

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8 • 193

nvidia/ToolScale

Viewer • Updated 17 days ago • 4.06k • 2.67k • 130

natolambert/GeneralThought-430K-filtered

Viewer • Updated Mar 26 • 338k • 1.16k • 28

RJT1990/GeneralThoughtArchive

Viewer • Updated Sep 5 • 431k • 3.77k • 69

open-thoughts/OpenThoughts-114k

Viewer • Updated Aug 31 • 228k • 106k • 777

open-r1/OpenR1-Math-Raw

Viewer • Updated Feb 24 • 516k • 553 • 76

PrimeIntellect/SYNTHETIC-1

Viewer • Updated Feb 21 • 1.99M • 748 • 60

PrimeIntellect/synthetic-code-understanding

Viewer • Updated Feb 15 • 60.6k • 78 • 17

PrimeIntellect/INTELLECT-3-SFT

Viewer • Updated 16 days ago • 6.98M • 1.06k

openbmb/InfLLM-V2-data-5B

Viewer • Updated Oct 25 • 7.19M • 443 • 30

kenhktsui/open-react-retrieval-multi-neg-result-new-kw

Viewer • Updated Aug 7, 2023 • 25.2k • 45 • 3

alwaysfurther/tiny-agent-with-tools

Viewer • Updated 11 days ago • 27 • 31

tiny-agents/tiny-agents

Viewer • Updated Sep 2 • 9 • 480 • 32

PleIAs/SYNTH

Viewer • Updated Nov 11 • 68M • 62.8k • 196

TuringEnterprises/Turing-Open-Reasoning

Viewer • Updated 8 days ago • 50 • 11.2k • 117

TeichAI/claude-4.5-opus-high-reasoning-250x

Viewer • Updated 16 days ago • 250 • 807 • 33

PrimeIntellect/INTELLECT-3-RL

Viewer • Updated Nov 8 • 70.7k • 7.72k • 2

PrimeIntellect/Reverse-Text-RL

Viewer • Updated Aug 12 • 1k • 2.59k • 2

PrimeIntellect/Reverse-Text-SFT

Viewer • Updated Aug 12 • 1k • 497 • 1

PrimeIntellect/SYNTHETIC-2-Base-Code

Viewer • Updated Jun 23 • 57.3k • 103

PrimeIntellect/SYNTHETIC-2-Base-Math

Viewer • Updated Jun 23 • 105k • 17 • 1

PrimeIntellect/SYNTHETIC-2-Base

Viewer • Updated Jun 23 • 465k • 54 • 9

PrimeIntellect/SYNTHETIC-2-Base-General-Reasoning

Viewer • Updated Jun 23 • 165k • 26 • 1

PrimeIntellect/SYNTHETIC-2-SFT-verified

Viewer • Updated Jul 10 • 105k • 276 • 6

PrimeIntellect/SYNTHETIC-2-Base-Answer-Critique

Viewer • Updated Jun 23 • 50k • 17 • 1

PrimeIntellect/SYNTHETIC-2-Base-Instruction-Following

Viewer • Updated Jun 23 • 87.5k • 25

PrimeIntellect/SYNTHETIC-2

Viewer • Updated Jul 10 • 51.6k • 325 • 8

PrimeIntellect/AIME-24

Viewer • Updated Jun 24 • 30 • 107

PrimeIntellect/AIME-25

Viewer • Updated Jun 24 • 30 • 66

PrimeIntellect/MATH-500

Viewer • Updated Jun 24 • 500 • 2.34k

PrimeIntellect/LiveCodeBench-v5

Viewer • Updated Jun 25 • 279 • 164

arcee-ai/bfcl_v4_web_search

Viewer • Updated Sep 13 • 100 • 54

arcee-ai/EvolKit-75K

Viewer • Updated Dec 5, 2024 • 74.2k • 70 • 36

arcee-ai/general-dpo-datasets

Viewer • Updated Jul 4, 2024 • 91.6k • 172

arcee-ai/synthetic-data-gen

Viewer • Updated Sep 21, 2023 • 999k • 45 • 2

arcee-ai/DAM

Viewer • Updated Nov 25, 2024 • 10.4k • 72

arcee-ai/EvolKit-20k-vi

Viewer • Updated Nov 7, 2024 • 15.4k • 70 • 7

arcee-ai/reasoning-sharegpt

Viewer • Updated Jul 5, 2024 • 29.9k • 52 • 23

arcee-ai/agent-data

Viewer • Updated Jul 22, 2024 • 486k • 161 • 63

arcee-ai/infini-instruct-top-500k

Viewer • Updated Jun 30, 2024 • 500k • 49 • 6

arcee-ai/cleaned-mlabonne-distilabel-truthy-dpo-v0.1-filtered

Viewer • Updated Jun 18, 2024 • 663 • 27

Nanbeige/ToolMind

Updated 20 days ago • 1.67k • 16

Salesforce/APIGen-MT-5k

Viewer • Updated Oct 10 • 5k • 825 • 87

Team-ACE/ToolACE

Viewer • Updated Sep 4, 2024 • 11.3k • 880 • 148

glaiveai/glaive-function-calling-v2

Viewer • Updated Sep 27, 2023 • 113k • 2.28k • 477

nvidia/When2Call

Viewer • Updated Apr 29 • 28k • 395 • 41

Salesforce/xlam-function-calling-60k

Viewer • Updated Jan 24 • 60k • 4.19k • 553

HuggingFaceFW/fineweb-edu

Viewer • Updated Jul 11 • 3.5B • 251k • 845

Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch

Paper • 2512.02395 • Published 12 days ago • 46

MATRIX: Multimodal Agent Tuning for Robust Tool-Use Reasoning

Paper • 2510.08567 • Published Oct 9

Scaling Agentic Reinforcement Learning for Tool-Integrated Reasoning in VLMs

Paper • 2511.19773 • Published 19 days ago • 9

ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use

Paper • 2510.27363 • Published Oct 31 • 22

Ariadne: A Controllable Framework for Probing and Extending VLM Reasoning Boundaries

Paper • 2511.00710 • Published Nov 1 • 4

VLA-R1: Enhancing Reasoning in Vision-Language-Action Models

Paper • 2510.01623 • Published Oct 2 • 10

DeepEyesV2: Toward Agentic Multimodal Model

Paper • 2511.05271 • Published Nov 7 • 42

DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search

Paper • 2510.12801 • Published Oct 14 • 13

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Paper • 2510.21618 • Published Oct 24 • 99

Open Multimodal Retrieval-Augmented Factual Image Generation

Paper • 2510.22521 • Published Oct 26 • 30

smolagents/android-control

Viewer • Updated May 9 • 15.3k • 2.13k • 11

smolagents/guiact-web-single

Viewer • Updated Aug 5 • 13.3k • 40 • 1

smolagents/tool-scraping

Viewer • Updated Sep 16 • 1.89k • 38 • 5

smolagents/hermes-function-calling-v1-formatted-code-agent

Viewer • Updated Jun 30 • 9k • 31 • 1

smolagents/aguvis-stage-1

Viewer • Updated Aug 5 • 459k • 4.96k • 15

smolagents/aguvis-stage-2

Viewer • Updated Sep 5 • 784k • 4.92k • 23

beyoru/ToolCalll_fusion

Viewer • Updated Sep 15 • 10.5k • 22 • 1

beyoru/ToolCall_synthetic_qwen3

Viewer • Updated Jul 20 • 60k • 58 • 9

qualifire/mcp-tool-use-quality-benchmark

Viewer • Updated Sep 25 • 5k • 47 • 3

mlx-community/hermes-reasoning-tool-use

Viewer • Updated Jul 29 • 51k • 106 • 4

TeichAI/gemini-3-pro-preview-high-reasoning-1000x

Viewer • Updated 4 days ago • 1.02k • 606 • 17

openbmb/Ultra-FineWeb

Viewer • Updated 4 days ago • 1.29B • 42.4k • 247

allenai/Dolci-Instruct-SFT-Tool-Use

Viewer • Updated 24 days ago • 228k • 906 • 5

nvidia/Nemotron-Content-Safety-Reasoning-Dataset

Preview • Updated 18 days ago • 137 • 2

ai-safety-institute/AgentHarm

Viewer • Updated Dec 19, 2024 • 468 • 4.08k • 44

Voxel51/ScreenSpot-v2

Viewer • Updated Jun 25 • 1.27k • 3.15k • 1

rootsautomation/ScreenSpot

Viewer • Updated Apr 10, 2024 • 1.27k • 2.1k • 43

microsoft/WebTailBench

Preview • Updated 15 days ago • 278 • 14

DeepShop/DeepShop

Viewer • Updated May 13 • 150 • 64 • 3

osunlp/Online-Mind2Web

Viewer • Updated 2 days ago • 300 • 371 • 17

zai-org/T1

Preview • Updated Mar 2 • 49 • 8

zai-org/LongBench-v2

Viewer • Updated Dec 20, 2024 • 503 • 13.9k • 26