19 26 9

Xirui Li PRO

AIcell

https://xirui-li.github.io/

AI & ML interests

Foundation LLM and VLM

Recent Activity

updated a model about 2 hours ago

AIcell/qwen2_5vl-3b-rl

published a model about 3 hours ago

AIcell/qwen2_5vl-3b-rl

updated a model 1 day ago

AIcell/qwen2_5vl-3b-sft

View all activity

Organizations

upvoted a paper 11 days ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 14 days ago • 246

upvoted a paper 14 days ago

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Paper • 2512.16093 • Published 27 days ago • 93

upvoted 2 papers 17 days ago

StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

Paper • 2512.09363 • Published Dec 10, 2025 • 71

Schoenfeld's Anatomy of Mathematical Reasoning by Language Models

Paper • 2512.19995 • Published 22 days ago • 15

upvoted a paper 19 days ago

EgoX: Egocentric Video Generation from a Single Exocentric Video

Paper • 2512.08269 • Published Dec 9, 2025 • 117

upvoted 3 papers 20 days ago

WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling

Paper • 2512.14614 • Published 28 days ago • 67

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published 22 days ago • 63

SpatialTree: How Spatial Abilities Branch Out in MLLMs

Paper • 2512.20617 • Published 21 days ago • 42

upvoted a paper 22 days ago

Can LLMs Estimate Student Struggles? Human-AI Difficulty Alignment with Proficiency Simulation for Item Difficulty Prediction

Paper • 2512.18880 • Published 23 days ago • 24

upvoted a paper 29 days ago

V-REX: Benchmarking Exploratory Visual Reasoning via Chain-of-Questions

Paper • 2512.11995 • Published Dec 12, 2025 • 9

upvoted a paper about 2 months ago

The Path Not Taken: RLVR Provably Learns Off the Principals

Paper • 2511.08567 • Published Nov 11, 2025 • 33

upvoted 2 papers 2 months ago

Routing Manifold Alignment Improves Generalization of Mixture-of-Experts LLMs

Paper • 2511.07419 • Published Nov 10, 2025 • 26

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

Paper • 2510.27492 • Published Oct 30, 2025 • 83

upvoted 2 papers 3 months ago

VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators

Paper • 2510.00406 • Published Oct 1, 2025 • 65

Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play

Paper • 2509.25541 • Published Sep 29, 2025 • 140

upvoted a paper 5 months ago

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22, 2025 • 160

upvoted 2 papers 9 months ago

FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding

Paper • 2504.09925 • Published Apr 14, 2025 • 38

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Paper • 2504.08837 • Published Apr 10, 2025 • 43

upvoted a collection 10 months ago

InternVL2.5

Collection

Better than InternVL 2.0 • 19 items • Updated Sep 28, 2025 • 92

upvoted a paper 10 months ago

R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model

Paper • 2503.05132 • Published Mar 7, 2025 • 57

Xirui Li PRO

AI & ML interests

Recent Activity

Organizations

AIcell's activity