Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published 19 days ago • 10
More than Carbon: Cradle-to-Grave environmental impacts of GenAI training on the Nvidia A100 GPU Paper • 2509.00093 • Published Aug 27 • 4
Doctor Llama Collection Two compact models created from the fine-tuning of the TeenyTinyLama model, using Brazilian Portuguese data containing only medical questions. • 7 items • Updated Aug 24, 2024 • 1
Transformers compatible Mamba Collection This release includes the `mamba` repositories compatible with the `transformers` library • 5 items • Updated Mar 6, 2024 • 39
Trained Models 🏋️ Collection They may be small, but they're training like giants! • 9 items • Updated Aug 16 • 20
Foundation Text-Generation Models Below 360M Parameters Collection Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. • 41 items • Updated Oct 4 • 37
Roberta Legal Portuguese ⚖️ Collection Related resources to the paper RoBERTaLexPT: A Legal RoBERTa Model pretrained with deduplication for Portuguese • 8 items • Updated Apr 24, 2024 • 4