SindBERT, the Sailor: Charting the Seas of Turkish NLP Paper • 2510.21364 • Published 27 days ago • 1
lang-uk/ukr-clip-vit-h-14-frozen-xlm-roberta-large-laion5B-s13B-b90k Zero-Shot Image Classification • Updated Oct 17 • 2
lang-uk/ukr-clip-vit-h-14-frozen-xlm-roberta-large-laion5B-s13B-b90k Zero-Shot Image Classification • Updated Oct 17 • 2
The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models Paper • 2510.13996 • Published Oct 15 • 7
lang-uk/ukr-clip-vit-h-14-frozen-xlm-roberta-large-laion5B-s13B-b90k Zero-Shot Image Classification • Updated Oct 17 • 2
Introducing OmniGEC: A Silver Multilingual Dataset for Grammatical Error Correction Paper • 2509.14504 • Published Sep 18
OmniGEC Collection This is a collection of multilingual silver-standard datasets and models for the task of Grammatical Error Correction (GEC). • 9 items • Updated Sep 19 • 8
Llama-GENBA-10B: A Trilingual Large Language Model for German, English and Bavarian Paper • 2509.05668 • Published Sep 6 • 5