Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation Paper • 2510.22115 • Published 13 days ago • 77
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain Paper • 2509.26507 • Published Sep 30 • 525
Glyph: Scaling Context Windows via Visual-Text Compression Paper • 2510.17800 • Published 18 days ago • 64
Diffusion Transformers with Representation Autoencoders Paper • 2510.11690 • Published 25 days ago • 160
SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer Paper • 2509.24695 • Published Sep 29 • 44
RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark Paper • 2509.24897 • Published Sep 29 • 46
Scrub It Out! Erasing Sensitive Memorization in Code Language Models via Machine Unlearning Paper • 2509.13755 • Published Sep 17 • 19
The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs Paper • 2509.09677 • Published Sep 11 • 34
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs Paper • 2508.16153 • Published Aug 22 • 154
MultiRef: Controllable Image Generation with Multiple Visual References Paper • 2508.06905 • Published Aug 9 • 21
Are We on the Right Way for Assessing Document Retrieval-Augmented Generation? Paper • 2508.03644 • Published Aug 5 • 25
Janus Collection Janus is a novel autoregressive framework that unifies multimodal understanding and generation. • 8 items • Updated Feb 18 • 17
SurveyX: Academic Survey Automation via Large Language Models Paper • 2502.14776 • Published Feb 20 • 100