2 5 14

Zhenrui Yue

yueeeeeeee2837

https://yueeeeeeee.github.io/

AI & ML interests

NLP, RecSys & Data Mining

Recent Activity

upvoted a paper about 1 month ago

Large Reasoning Models Learn Better Alignment from Flawed Thinking

liked a model 3 months ago

openai/gpt-oss-20b

authored a paper 5 months ago

Hybrid Latent Reasoning via Reinforcement Learning

View all activity

Organizations

upvoted a paper about 1 month ago

Large Reasoning Models Learn Better Alignment from Flawed Thinking

Paper • 2510.00938 • Published Oct 1 • 57

liked a model 3 months ago

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26 • 4.58M • • 3.87k

authored a paper 5 months ago

Hybrid Latent Reasoning via Reinforcement Learning

Paper • 2505.18454 • Published May 24 • 6

upvoted a paper 5 months ago

Hybrid Latent Reasoning via Reinforcement Learning

Paper • 2505.18454 • Published May 24 • 6

commented a paper 5 months ago

Hybrid Latent Reasoning via Reinforcement Learning

Paper • 2505.18454 • Published May 24 • 6 •

upvoted a paper 6 months ago

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12 • 36

liked a model 6 months ago

Qwen/Qwen2.5-Omni-3B

Any-to-Any • 6B • Updated Apr 30 • 266k • 301

liked a model 7 months ago

unsloth/Llama-4-Scout-17B-16E-Instruct-unsloth

Image-Text-to-Text • 109B • Updated Apr 12 • 15 • 17

liked a model 8 months ago

Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30 • 203k • 1.81k

authored a paper 8 months ago

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12 • 36

liked 2 models 9 months ago

deepseek-ai/DeepSeek-V3

Text Generation • 685B • Updated Mar 27 • 176k • • 3.99k

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 422k • • 12.8k

liked 2 datasets 9 months ago

bespokelabs/Bespoke-Stratos-17k

Viewer • Updated Jan 31 • 16.7k • 6.55k • 330

open-thoughts/OpenThoughts-114k

Viewer • Updated Aug 31 • 228k • 58.4k • 771

upvoted an article 9 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 884

liked a dataset 10 months ago

BrightData/IMDb-Media

Viewer • Updated Jun 20, 2024 • 249k • 80 • 7

liked a Space 10 months ago

587

Scaling test-time compute

📈

Implement test-time compute scaling for math problems

liked a model about 1 year ago

1bitLLM/bitnet_b1_58-xl

Text Generation • 1B • Updated Mar 29, 2024 • 232 • 37

liked a dataset about 1 year ago

McAuley-Lab/Amazon-Reviews-2023

Updated Dec 8, 2024 • 70.2k • 224

authored a paper about 1 year ago

Retrieval Augmented Fact Verification by Synthesizing Contrastive Arguments

Paper • 2406.09815 • Published Jun 14, 2024

Zhenrui Yue

AI & ML interests

Recent Activity

Organizations

yueeeeeeee2837's activity

Open-R1: a fully open reproduction of DeepSeek-R1

Scaling test-time compute