Yunxuan Li's picture

Yunxuan Li

eric008

·

AI & ML interests

None yet

Organizations

None yet

authored a paper 6 months ago

Beyond Markovian: Reflective Exploration via Bayes-Adaptive RL for LLM Reasoning

Paper • 2505.20561 • Published May 26 • 7

authored 2 papers over 1 year ago

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Paper • 2407.15762 • Published Jul 22, 2024 • 10

Improve Mathematical Reasoning in Language Models by Automated Process Supervision

Paper • 2406.06592 • Published Jun 5, 2024 • 29

authored a paper almost 2 years ago

Gemini: A Family of Highly Capable Multimodal Models

Paper • 2312.11805 • Published Dec 19, 2023 • 47

authored a paper about 2 years ago

Enable Language Models to Implicitly Learn Self-Improvement From Data

Paper • 2310.00898 • Published Oct 2, 2023 • 23