Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model Paper • 2408.11039 • Published Aug 20, 2024 • 63
Characterizing and Efficiently Accelerating Multimodal Generation Model Inference Paper • 2410.00215 • Published Sep 30, 2024
CWM: An Open-Weights LLM for Research on Code Generation with World Models Paper • 2510.02387 • Published Sep 30 • 7
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published Dec 13, 2024 • 108
OneFlow: Concurrent Mixed-Modal and Interleaved Generation with Edit Flows Paper • 2510.03506 • Published about 1 month ago • 13
Jointly Reinforcing Diversity and Quality in Language Model Generations Paper • 2509.02534 • Published Sep 2 • 24
Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering Paper • 2305.17080 • Published May 26, 2023
Text Quality-Based Pruning for Efficient Training of Language Models Paper • 2405.01582 • Published Apr 26, 2024
Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation Paper • 2406.10970 • Published Jun 16, 2024 • 1
HebDB: a Weakly Supervised Dataset for Hebrew Speech Processing Paper • 2407.07566 • Published Jul 10, 2024