mertege/checkpoint-2050-merged_linear_Qwen2.5-7B-Instruct Text Generation • 8B • Updated Aug 18 • 8
mertege/checkpoint-2050-merged_linear_Qwen2.5-7B-Instruct Text Generation • 8B • Updated Aug 18 • 8
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models Paper • 2402.13064 • Published Feb 20, 2024 • 50
LexC-Gen: Generating Data for Extremely Low-Resource Languages with Large Language Models and Bilingual Lexicons Paper • 2402.14086 • Published Feb 21, 2024 • 12
ReasonRank: Empowering Passage Ranking with Strong Reasoning Ability Paper • 2508.07050 • Published Aug 9 • 116
Running 3.47k 3.47k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 424
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • 33B • Updated Feb 24 • 1.75M • • 1.47k