TsinghuaC3I

university

http://c3i.ee.tsinghua.edu.cn/en/

TsinghuaC3I

AI & ML interests

Large Language Models

Recent Activity

Xiaotiank updated a model 15 days ago

TsinghuaC3I/Qwen2.5-7B-VL-ReAd-R

iseesaw authored a paper about 1 month ago

FlowRL: Matching Reward Distributions for LLM Reasoning

xuekai authored a paper about 2 months ago

CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model

View all activity

Papers

A Survey of Reinforcement Learning for Large Reasoning Models

View all Papers

TsinghuaC3I 's Papers 1

Submitted by

iseesaw

A Survey of Reinforcement Learning for Large Reasoning Models

TsinghuaC3I