Kalash Shah's picture

Kalash Shah

kalashshah19

·

KalashShah19

AI & ML interests

Mostly LLMs and also Image Generation

Recent Activity

new activity 3 days ago

microsoft/Phi-4-mini-flash-reasoning:Why there isn't even a single Quantized Version for this model ?

new activity 3 days ago

IndianAIDevs/README:Welcomes and Greetings

new activity 3 days ago

unsloth/README:Can you GGUF for bharatgenai/FinanceParam ?

View all activity

Organizations

upvoted 2 changelogs about 2 months ago

Changelog

Repositories total file size is now displayed

Sep 18

• 171

Changelog

Switch settings context between user and organizations

Sep 16

• 37

upvoted 5 collections about 2 months ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated Jul 21 • 347

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated Jul 21 • 125

Qwen3-Coder

5 items • Updated Jul 31 • 129

Sanskrit LLMs

Projects I did related to make LLM better in Sanskrit • 10 items • Updated Sep 16 • 2

Flux Lora

Some Beautiful Flux Loras created by me ❤️ • 5 items • Updated Nov 11, 2024 • 3

upvoted a changelog about 2 months ago

Changelog

Emoji Autocomplete in Discussions and Posts

Sep 11

• 67

upvoted a collection 2 months ago

Indian AI Models

Here is list of AI Models developed, trained or Fine Tuned by India Developers or Companies. This is to appreciate the efforts of them. • 40 items • Updated Sep 28 • 5

upvoted 11 collections 3 months ago

Llama 4

Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth! • 15 items • Updated 10 days ago • 50

ELECTRA release

This collection regroups the ELECTRA models released by the Google team. • 6 items • Updated Jul 10 • 10

BERT release

Regroups the original BERT models released by the Google team. Except for the models marked otherwise, the checkpoints support English. • 8 items • Updated Jul 10 • 35

LLM Chat Spaces

9 items • Updated Sep 11 • 2

Development

3 items • Updated Sep 16 • 1

Search Agents

3 items • Updated Sep 11 • 1

Small LLMs

3 items • Updated Sep 16 • 1

Gemma release

Groups the Gemma models released by the Google team. • 40 items • Updated Jul 10 • 344

Nemotron-UltraLong

3 items • Updated about 14 hours ago • 17

DeepSeek R1 (All Versions)

DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 37 items • Updated 10 days ago • 259

Llama 3.1 Collection

Meta's Llama 3.1 models including 8B, 70B, 405B. Includes 4-bit bnb and original versions. • 13 items • Updated 10 days ago • 8