-
LM2: Large Memory Models
Paper • 2502.06049 • Published • 30 -
Titans: Learning to Memorize at Test Time
Paper • 2501.00663 • Published • 26 -
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Paper • 2501.17161 • Published • 123 -
You Do Not Fully Utilize Transformer's Representation Capacity
Paper • 2502.09245 • Published • 37
Myeongkyun Cho
hestu
·
AI & ML interests
None yet
Recent Activity
liked
a model
4 days ago
manifestai/Brumby-14B-Base
liked
a model
about 2 months ago
Qwen/Qwen3-Next-80B-A3B-Instruct-FP8
upvoted
a
paper
about 2 months ago
A Survey of Reinforcement Learning for Large Reasoning Models