Improving Multilingual Capabilities with Cultural and Local Knowledge in Large Language Models While Enhancing Native Performance Paper β’ 2504.09753 β’ Published Apr 13 β’ 6
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais and 2 others β’ Nov 13, 2024 β’ 104
Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic Paper β’ 2509.01363 β’ Published Sep 1 β’ 58
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. β’ 2 items β’ Updated Aug 7 β’ 374
Whisper Release Collection Whisper includes both English-only and multilingual checkpoints for ASR and ST, ranging from 38M params for the tiny models to 1.5B params for large. β’ 12 items β’ Updated Sep 13, 2023 β’ 141
Cohere Labs Aya Vision Collection Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. β’ 5 items β’ Updated Jul 31 β’ 70