-
OpenAI o1 System Card
Paper • 2412.16720 • Published • 36 -
LearnLM: Improving Gemini for Learning
Paper • 2412.16429 • Published • 22 -
NILE: Internal Consistency Alignment in Large Language Models
Paper • 2412.16686 • Published • 8 -
Offline Reinforcement Learning for LLM Multi-Step Reasoning
Paper • 2412.16145 • Published • 38
Sheikh Jubair
sheikhjubair
AI & ML interests
None yet
Recent Activity
updated
a collection
9 days ago
reasoning-agentic
updated
a dataset
2 months ago
humain-ai/LC-Eval
updated
a collection
3 months ago
reasoning-agentic