Théo Pomies
theopomies
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 15 hours ago
PipelineRL: Faster On-policy Reinforcement Learning for Long Sequence
Generation
upvoted
a
paper
about 15 hours ago
Executable Code Actions Elicit Better LLM Agents
upvoted
a
paper
about 15 hours ago
Defeating the Training-Inference Mismatch via FP16