sigma's picture

9 105

sigma

sigma7863

·

AI & ML interests

None yet

Recent Activity

liked a dataset 5 days ago

openai/MMMLU

liked a Space 7 days ago

HuggingFaceTB/smol-training-playbook

liked a Space 8 days ago

tori29umai/Qwen-Image-2509-MultipleAngles

View all activity

Organizations

None yet

upvoted a collection 9 days ago

Qwen3-VL

37 items • Updated 10 days ago • 387

upvoted a paper 18 days ago

Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models

Paper • 2509.26628 • Published Sep 30 • 14

upvoted a collection about 1 month ago

The Markovian Thinker

Reformulating the RL of reasoning LLMs through Markovian Thinking paradigm. • 7 items • Updated Oct 9 • 10

upvoted a paper about 1 month ago

In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7 • 101

upvoted 3 collections about 1 month ago

EditReward

EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing • 11 items • Updated 30 days ago • 4

Granite 4.0 Language Models

11 items • Updated 11 days ago • 177

⚛️ Liquid Nanos

Library of task-specific models: https://www.liquid.ai/blog/introducing-liquid-nanos-frontier-grade-performance-on-everyday-devices • 21 items • Updated 13 days ago • 92

upvoted a collection 2 months ago

InternVL3.5

This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 54 items • Updated Sep 28 • 102

upvoted an article 3 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

Aug 5

• 505