326 394 626

Yatharth Sharma

YaTharThShaRma999

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

YatharthS/LinaCodec

updated a model 5 days ago

YaTharThShaRma999/ncodec

updated a model 15 days ago

YaTharThShaRma999/miratts_finetune

View all activity

Organizations

upvoted an article 18 days ago

Article

LLM based Audio models

19 days ago

•

upvoted an article about 1 month ago

Article

How to make NeuTTS-air generate over 200 seconds of audio in a single second.

Nov 21, 2025

•

upvoted 2 articles about 2 months ago

Article

📐 Muon Optimizer: The Power of Collective Momentum

Nov 14, 2025

•

Article

⛳ Optimizer: What Does It Do and Why We Need It

Nov 12, 2025

•

upvoted a paper 2 months ago

The Principles of Diffusion Models

Paper • 2510.21890 • Published Oct 24, 2025 • 60

upvoted an article 2 months ago

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

Oct 30, 2025

•

upvoted 9 papers 3 months ago

Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Paper • 2510.06308 • Published Oct 7, 2025 • 54

Ovi: Twin Backbone Cross-Modal Fusion for Audio-Video Generation

Paper • 2510.01284 • Published Sep 30, 2025 • 34

UniVid: The Open-Source Unified Video Model

Paper • 2509.24200 • Published Sep 29, 2025 • 4

MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech

Paper • 2509.25131 • Published Sep 29, 2025 • 15

HunyuanImage 3.0 Technical Report

Paper • 2509.23951 • Published Sep 28, 2025 • 21

SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer

Paper • 2509.24695 • Published Sep 29, 2025 • 44

upvoted 3 collections 3 months ago

SVDQuant

Collection

Models and datasets for "SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models" • 20 items • Updated May 29, 2025 • 64

Nunchaku

Collection

10 items • Updated Jun 29, 2025 • 35

LPD

Collection

Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation • 6 items • Updated Jul 2, 2025 • 2

upvoted 2 papers 4 months ago

<think> So let's replace this phrase with insult... </think> Lessons learned from generation of toxic texts with LLMs

Paper • 2509.08358 • Published Sep 10, 2025 • 13

Q-Sched: Pushing the Boundaries of Few-Step Diffusion Models with Quantization-Aware Scheduling

Paper • 2509.01624 • Published Sep 1, 2025 • 7

Yatharth Sharma

AI & ML interests

Recent Activity

Organizations

YaTharThShaRma999's activity

LLM based Audio models

How to make NeuTTS-air generate over 200 seconds of audio in a single second.

📐 Muon Optimizer: The Power of Collective Momentum

⛳ Optimizer: What Does It Do and Why We Need It

Why Did MiniMax M2 End Up as a Full Attention Model?