Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper • 2511.06221 • Published 7 days ago • 91
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper • 2508.05629 • Published Aug 7 • 178
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 Mar 26 • 173
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 218
Language Models are Realistic Tabular Data Generators Paper • 2210.06280 • Published Oct 12, 2022 • 1
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28, 2024 • 259