Prithiv Sakthi's picture

Building on HF

Prithiv Sakthi PRO

prithivMLmods

·

https://linktr.ee/prithivsakthi

AI & ML interests

computer vision, nlp, multimodality - HuggingFace Fellow🤗

Recent Activity

upvoted a paper about 7 hours ago

Adaptation of Agentic AI

updated a Space about 10 hours ago

prithivMLmods/TRELLIS.2-Text-to-3D

liked a Space about 11 hours ago

prithivMLmods/Qwen3-VL-4B-Abliterated

View all activity

Organizations

Posts 136

Post

783

Introducing demos for new SOTA models from AI2: SAGE-MM (Smart Any-Horizon Agents for Long-Video Reasoning) and Molmo-2, an open vision-language model that supports multi-image (QA and pointing) and video (QA, pointing, and tracking). The respective demo-related collections are listed below. 🎃🔥

✨ SAGE-MM [Video-Reasoning]: prithivMLmods/SAGE-MM-Video-Reasoning
✨ Molmo2 [Demo]: prithivMLmods/Molmo2-HF-Demo

🎃 GitHub[SAGE-MM]: https://github.com/PRITHIVSAKTHIUR/SAGE-MM-Video-Reasoning
🎃 GitHub[Molmo2]: https://github.com/PRITHIVSAKTHIUR/Molmo2-HF-Demo
🎃 Multimodal Implementations: https://huggingface.co/collections/prithivMLmods/multimodal-implementations

To know more about it, visit the app page or the respective model page!

Articles 16

Article

4

Fine-Tuning MetaCLIP-2 for Image Classification on Downstream Tasks

View all Articles

Collections 162

View 162 collections

spaces 41

TRELLIS.2-Text-to-3D

Text-to-Image → 3D or Image-to-3D

Qwen3-VL-4B-Abliterated

Abliterated Reasoning and Captioning.

SAM Audio Demo

SAM-Audio: Segment Anything Model for Audio

SAGE Video Reasoning

Smart Any-Horizon Agents for Long Video Reasoning.

Z Image Turbo LoRA DLC

Demo of a Collection of Impressive LoRAs for Z-Image-Turbo

Qwen Image Edit 2509 LoRAs Fast

Demo of the Collection of Qwen Image Edit LoRAs

models 1,013

prithivMLmods/Gliese-CUA-Tool-Call-8B

Image-Text-to-Text • 8B • Updated 1 day ago • 33 • 1

prithivMLmods/SAGE-MM-Qwen2.5-VL-7B-SFT_RL-GGUF

Video-Text-to-Text • 8B • Updated 1 day ago • 595 • 1

prithivMLmods/SAGE-MM-Qwen2.5-VL-7B-SFT-GGUF

Video-Text-to-Text • 8B • Updated 1 day ago • 527 • 1

prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT-GGUF

Video-Text-to-Text • 4B • Updated 1 day ago • 2.52k • 1

prithivMLmods/SAGE-MM-Qwen3-VL-4B-SFT_RL-GGUF

Video-Text-to-Text • 4B • Updated 1 day ago • 1.83k • 1

prithivMLmods/SAGE-MM-Qwen3-VL-8B-SFT_RL-GGUF

Video-Text-to-Text • 8B • Updated 1 day ago • 395 • 1

prithivMLmods/SAGE-MM-Qwen3-VL-8B-SFT-GGUF

Video-Text-to-Text • 8B • Updated 1 day ago • 494 • 1

prithivMLmods/Gliese-CUA-Tool-Call-8B-GGUF

Image-Text-to-Text • 8B • Updated 6 days ago • 561 • 1

prithivMLmods/Gliese-CUA-Tool-Call-8B-Localization-GGUF

Image-Text-to-Text • 8B • Updated 6 days ago • 514 • 1

prithivMLmods/Herculis-CUA-GUI-Actioner-4B

Image-Text-to-Text • 4B • Updated 6 days ago • 92 • 1

View 1,013 models

datasets 122

prithivMLmods/LAP2-K-Think-v1.b

Viewer • Updated 26 days ago • 380k • 82 • 1

prithivMLmods/Gacrux-Tiny-1M

Viewer • Updated 26 days ago • 1.07M • 39 • 1

prithivMLmods/Pegasus-Tiny-250K

Viewer • Updated 26 days ago • 292k • 59 • 1

prithivMLmods/LAP2-K-Think-v1.a

Viewer • Updated 26 days ago • 257k • 60 • 1

prithivMLmods/Caption3o-LongCap-v4

Viewer • Updated Sep 15 • 523k • 77 • 1

prithivMLmods/Caption3o-XL-v4

Viewer • Updated Sep 15 • 52.8k • 122

prithivMLmods/Turing-Reason-CoT

Viewer • Updated Sep 15 • 4.99M • 98 • 4

prithivMLmods/Turing-Reason-CoT-Mini

Viewer • Updated Sep 15 • 558k • 91

prithivMLmods/Gargantua-R1-Compact

Viewer • Updated Sep 9 • 6.67M • 519 • 5

prithivMLmods/OpenDoc-Null-6K

Viewer • Updated Sep 9 • 6.91k • 68

View 122 datasets