1 2

Mehul Damani PRO

mehuldamani

https://damanimehul.github.io

AI & ML interests

Reinforcement Learning, Large Language Models

Recent Activity

published a model about 1 hour ago

mehuldamani/qwen3_8b_hotpot_rlvr_single

published a model about 1 hour ago

mehuldamani/qwen3_8b_hotpot_rlcr_single

updated a dataset about 2 hours ago

mehuldamani/judge-v1-step600

View all activity

Organizations

None yet

Collections 1

Papers 4

models 85

datasets 16

mehuldamani/judge-v1-step600

Updated about 2 hours ago

mehuldamani/judge-v1-base

Updated about 2 hours ago

mehuldamani/judge-step600

Updated about 2 hours ago

mehuldamani/judge-base

Updated about 2 hours ago

mehuldamani/gpt-5-trial

Updated about 2 hours ago

mehuldamani/chat-test-classified

Viewer • Updated Oct 3 • 22.3k • 8

mehuldamani/synthtool-v1-modified

Viewer • Updated Aug 20 • 10k • 6

mehuldamani/gpt5-simpleqa-20

Viewer • Updated Aug 19 • 20 • 6 • 1

mehuldamani/grok-4-trial

Viewer • Updated Aug 13 • 20 • 5

mehuldamani/r1-trial

Viewer • Updated Aug 13 • 20 • 6

View 16 datasets

Mehul Damani PRO

AI & ML interests

Recent Activity

Organizations

Collections 1

mehuldamani/big-math-digits-v2-correctness

mehuldamani/hotpot-v2-correctness-7b

mehuldamani/orm-big-math-digits-v2-correctness

mehuldamani/big-math-digits-v2-brier

mehuldamani/big-math-digits-v2-correctness

mehuldamani/hotpot-v2-correctness-7b

mehuldamani/orm-big-math-digits-v2-correctness

mehuldamani/big-math-digits-v2-brier

Papers 4

models 85

mehuldamani/qwen3_8b_hotpot_rlvr_single

mehuldamani/qwen3_8b_hotpot_rlcr_single

mehuldamani/math-rlvr-v1-qwen3

mehuldamani/llama-3.1-8b-instruct-user-sim-sft-v3-global-step-600

mehuldamani/hotpot-qwen38b_nov1-rlcr-single-a100

mehuldamani/hotpot-qwen25b_nov4-rlvr-multiple-h100

mehuldamani/hotpot-sept27-rlvr-multiple-h100

mehuldamani/llama-3.1-8b-instruct-user-sim

mehuldamani/writing-qwen3-rl-v2

mehuldamani/writing-qwen3-rlav-v1

datasets 16

mehuldamani/judge-v1-step600

mehuldamani/judge-v1-base

mehuldamani/judge-step600

mehuldamani/judge-base

mehuldamani/gpt-5-trial

mehuldamani/chat-test-classified

mehuldamani/synthtool-v1-modified

mehuldamani/gpt5-simpleqa-20

mehuldamani/grok-4-trial

mehuldamani/r1-trial

Mehul Damani PRO

AI & ML interests

Recent Activity

Organizations

Collections 1

Papers 4

models 85 Sort: Recently updated

datasets 16 Sort: Recently updated

models 85

datasets 16