Prithiv Sakthi's picture

Building on HF

Prithiv Sakthi PRO

prithivMLmods

·

https://linktr.ee/prithivsakthi

AI & ML interests

computer vision, nlp, multimodality - HuggingFace Fellow🤗

Recent Activity

replied to their post 24 minutes ago

#Newer / Current Version 🚨Huggingface APK Update v0.0.4🚨 1. Fixed Pinch to Zoom Update . 2. Swipe Gestures. 3. Fixed Auto Rotate. 4. Updated app Indentifiers. Download the app now !! 🚨Huggingface v0.0.4 Download, ⬇️Link : https://drive.google.com/file/d/1xEiH7LMdP14fBG-xDuSqKje5TRLV1PuS/view?usp=sharing Like 👍Share 🚀 Follow 🌠

updated a model about 3 hours ago

prithivMLmods/Delorme_1-OCR-7B-Post1.0

updated a model about 5 hours ago

prithivMLmods/Qwen-Image-Edit-Rapid-AIO-V21

View all activity

Organizations

upvoted a paper about 6 hours ago

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published 6 days ago • 83

upvoted a collection about 20 hours ago

Qwen Image Edit (exps)

adapter LoRA developed for Qwen’s Qwen-Image-Edit-2511 image-to-image model • 8 items • Updated about 20 hours ago • 2

upvoted 2 collections 3 days ago

Jan 5 Releases

35 items • Updated 6 days ago • 5

YOLO26 Models

YOLO26 models: detection, segmentation, classification, pose, and OBB variants with demo Space. • 31 items • Updated 4 days ago • 15

upvoted 3 papers 3 days ago

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Doubao 1.8, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

Paper • 2601.10527 • Published 4 days ago • 19

FlowAct-R1: Towards Interactive Humanoid Video Generation

Paper • 2601.10103 • Published 4 days ago • 18

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published 5 days ago • 168

upvoted 2 papers 8 days ago

RL-AWB: Deep Reinforcement Learning for Auto White Balance Correction in Low-Light Night-time Scenes

Paper • 2601.05249 • Published 11 days ago • 45

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 11 days ago • 194

upvoted a paper 10 days ago

UltraShape 1.0: High-Fidelity 3D Shape Generation via Scalable Geometric Refinement

Paper • 2512.21185 • Published 26 days ago • 29

upvoted 3 collections 13 days ago

👁️ LFM2.5-VL

4 items • Updated 6 days ago • 19

Physical Long-Horizon Reasoning

SFT, RL • 5 items • Updated 15 days ago • 1

Nemotron Speech

Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S • 16 items • Updated 3 days ago • 18

upvoted a collection 14 days ago

Qwen Image Edit (Object-Manipulator)

Add or remove the specified objects, flexible for both single-image and multi-image modes. • 2 items • Updated 15 days ago • 9

upvoted an article 15 days ago

Article

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

17 days ago

•

12

upvoted 2 papers 15 days ago

DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models

Paper • 2512.24165 • Published 20 days ago • 49

On the Role of Discreteness in Diffusion LLMs

Paper • 2512.22630 • Published 23 days ago • 17

upvoted a paper 17 days ago

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Paper • 2512.24618 • Published 19 days ago • 136

upvoted a paper 18 days ago

DreamOmni3: Scribble-based Editing and Generation

Paper • 2512.22525 • Published 23 days ago • 14

upvoted a paper 19 days ago

Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone

Paper • 2512.22615 • Published 23 days ago • 44