XVLA Collection X-VLA is a soft-prompted Transformer for cross-embodiment robot learning • 6 items • Updated 23 days ago • 11
Treble10 Collection Treble Technologies and Hugging Face have entered in to a long term collaboration. In celebration, we are releasing the Treble10 dataset. • 3 items • Updated Oct 28 • 4
Persian Models Collection This is the largest collection of Persian models available on Huggingface • 773 items • Updated Nov 23 • 16
Persian Datasets Collection This the largest collection of Persian datasets available on Huggingface • 124 items • Updated Sep 14 • 15
NaturalVoices - Voice Conversion Datasets Collection This is a collaborative work of JHU Smile Lab and CMU MSP Lab. Please cite https://arxiv.org/abs/2511.00256 • 5 items • Updated Nov 10 • 4
Evolving Diagnostic Agents in a Virtual Clinical Environment Paper • 2510.24654 • Published Oct 28 • 11
POWSM: A Phonetic Open Whisper-Style Speech Foundation Model Paper • 2510.24992 • Published Oct 28 • 2
OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes Paper • 2510.26800 • Published Oct 30 • 21
Kimi Linear: An Expressive, Efficient Attention Architecture Paper • 2510.26692 • Published Oct 30 • 119
The End of Manual Decoding: Towards Truly End-to-End Language Models Paper • 2510.26697 • Published Oct 30 • 116
ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning Paper • 2510.27492 • Published Oct 30 • 82
π_RL: Online RL Fine-tuning for Flow-based Vision-Language-Action Models Paper • 2510.25889 • Published Oct 29 • 64