-
Making Multimodal Generation Easier: When Diffusion Models Meet LLMs
Paper • 2310.08949 • Published • 1 -
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
Paper • 2503.09573 • Published • 73 -
JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models
Paper • 2308.04729 • Published • 32 -
PerceiverS: A Multi-Scale Perceiver with Effective Segmentation for Long-Term Expressive Symbolic Music Generation
Paper • 2411.08307 • Published • 7
Quan Duong
ruathudo
AI & ML interests
None yet
Organizations
None yet
Music generation
-
Making Multimodal Generation Easier: When Diffusion Models Meet LLMs
Paper • 2310.08949 • Published • 1 -
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
Paper • 2503.09573 • Published • 73 -
JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models
Paper • 2308.04729 • Published • 32 -
PerceiverS: A Multi-Scale Perceiver with Effective Segmentation for Long-Term Expressive Symbolic Music Generation
Paper • 2411.08307 • Published • 7
models
0
None public yet
datasets
0
None public yet