Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated Jul 21 • 347
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated Jul 21 • 125
Sanskrit LLMs Collection Projects I did related to make LLM better in Sanskrit • 10 items • Updated Sep 16 • 2
Indian AI Models Collection Here is list of AI Models developed, trained or Fine Tuned by India Developers or Companies. This is to appreciate the efforts of them. • 40 items • Updated Sep 28 • 5
Llama 4 Collection Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth! • 15 items • Updated 10 days ago • 50
ELECTRA release Collection This collection regroups the ELECTRA models released by the Google team. • 6 items • Updated Jul 10 • 10
BERT release Collection Regroups the original BERT models released by the Google team. Except for the models marked otherwise, the checkpoints support English. • 8 items • Updated Jul 10 • 35
Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated Jul 10 • 344
DeepSeek R1 (All Versions) Collection DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 37 items • Updated 10 days ago • 259
Llama 3.1 Collection Collection Meta's Llama 3.1 models including 8B, 70B, 405B. Includes 4-bit bnb and original versions. • 13 items • Updated 10 days ago • 8