view article Article LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR By lightonai and 2 others • 17 days ago • 58
view article Article ModernVBERT: Towards Smaller Visual Document Retrievers By paultltc and 4 others • Oct 3 • 44
view article Article Exploring Environments Hub: Your Language Model needs better (open) environments to learn By anakin87 • Sep 4 • 28
view article Article Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning By codelion • Aug 9 • 12
SauerkrautLM-Multilingual-(Reason)-ColBERT Collection SauerkrautLM ColBERT is a suite of Late-Interaction retrieval models built with PyLate’s ColBERT architecture and tuned for seven European languages. • 7 items • Updated Aug 3 • 18
NER ITA Collection This collection presents my best models tailored for Named Entity Recognition (NER) tasks, exclusively designed for the Italian language. • 3 items • Updated Jul 20 • 2
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 Mar 26 • 169
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 217
view article Article **Intelligence Potentiation: An Evolutionary Perspective on AI Agent Designs** By KnutJaegersberg • Dec 19, 2024 • 4
view article Article SauerkrautLM's Multi-Phase Spectrum Training: A Technical Deep Dive By DavidGF • Nov 9, 2024 • 9