14 113 10

Chengsong Huang

ChengsongHuang

https://chengsong-huang.github.io/

hcscctv

AI & ML interests

None yet

Recent Activity

upvoted a paper about 20 hours ago

Scaling Agent Learning via Experience Synthesis

upvoted a paper 1 day ago

NVIDIA Nemotron Nano V2 VL

upvoted a paper 5 days ago

Defeating the Training-Inference Mismatch via FP16

View all activity

Organizations

upvoted a paper about 20 hours ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published 3 days ago • 54

upvoted a paper 1 day ago

NVIDIA Nemotron Nano V2 VL

Paper • 2511.03929 • Published 2 days ago • 10

upvoted 2 papers 5 days ago

Defeating the Training-Inference Mismatch via FP16

Paper • 2510.26788 • Published 9 days ago • 27

Continuous Autoregressive Language Models

Paper • 2510.27688 • Published 8 days ago • 59

upvoted 2 papers 8 days ago

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Paper • 2510.25992 • Published 9 days ago • 40

The End of Manual Decoding: Towards Truly End-to-End Language Models

Paper • 2510.26697 • Published 9 days ago • 113

upvoted a paper 9 days ago

SPICE: Self-Play In Corpus Environments Improves Reasoning

Paper • 2510.24684 • Published 11 days ago • 12

upvoted 2 papers 10 days ago

VisCoder2: Building Multi-Language Visualization Coding Agents

Paper • 2510.23642 • Published 15 days ago • 20

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published 11 days ago • 90

upvoted a paper 12 days ago

Reasoning with Sampling: Your Base Model is Smarter Than You Think

Paper • 2510.14901 • Published 23 days ago • 45

upvoted 2 papers 15 days ago

Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1

Paper • 2510.19600 • Published 17 days ago • 67

AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders

Paper • 2510.19779 • Published 17 days ago • 58

upvoted a paper 19 days ago

A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning

Paper • 2510.15444 • Published 22 days ago • 145

upvoted a paper 24 days ago

Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity

Paper • 2510.01171 • Published Oct 1 • 18

upvoted a paper 25 days ago

Demystifying Reinforcement Learning in Agentic Reasoning

Paper • 2510.11701 • Published 26 days ago • 31

upvoted 2 papers 26 days ago

Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels

Paper • 2510.06499 • Published Oct 7 • 31

ReviewerToo: Should AI Join The Program Committee? A Look At The Future of Peer Review

Paper • 2510.08867 • Published 29 days ago • 4

upvoted a paper 29 days ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published 30 days ago • 260

upvoted 2 papers about 1 month ago

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published Oct 2 • 92

Group-Relative REINFORCE Is Secretly an Off-Policy Algorithm: Demystifying Some Myths About GRPO and Its Friends

Paper • 2509.24203 • Published Sep 29 • 7

Chengsong Huang

AI & ML interests

Recent Activity

Organizations

ChengsongHuang's activity