Reza Sayar's picture

Reza Sayar PRO

Reza2kn

·

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

YatharthS/MiraTTS

liked a model 7 days ago

google/medasr

liked a dataset 9 days ago

motus-robotics/DatasetDemo

View all activity

Organizations

upvoted a collection 9 days ago

sam-audio

11 items • Updated 10 days ago • 97

upvoted a collection 12 days ago

XVLA

X-VLA is a soft-prompted Transformer for cross-embodiment robot learning • 6 items • Updated 23 days ago • 11

upvoted an article 15 days ago

Article

Make and publish your Reachy Mini App

24 days ago

•

25

upvoted an article 29 days ago

Article

Curating datasets directly on the Hub

30 days ago

•

22

upvoted a collection about 1 month ago

Treble10

Treble Technologies and Hugging Face have entered in to a long term collaboration. In celebration, we are releasing the Treble10 dataset. • 3 items • Updated Oct 28 • 4

upvoted a paper about 1 month ago

SAM 3D: 3Dfy Anything in Images

Paper • 2511.16624 • Published Nov 20 • 109

upvoted 3 collections about 2 months ago

Persian Models

This is the largest collection of Persian models available on Huggingface • 773 items • Updated Nov 23 • 16

Persian Datasets

This the largest collection of Persian datasets available on Huggingface • 124 items • Updated Sep 14 • 15

NaturalVoices - Voice Conversion Datasets

This is a collaborative work of JHU Smile Lab and CMU MSP Lab. Please cite https://arxiv.org/abs/2511.00256 • 5 items • Updated Nov 10 • 4

upvoted 5 papers about 2 months ago

Evolving Diagnostic Agents in a Virtual Clinical Environment

Paper • 2510.24654 • Published Oct 28 • 11

POWSM: A Phonetic Open Whisper-Style Speech Foundation Model

Paper • 2510.24992 • Published Oct 28 • 2

OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes

Paper • 2510.26800 • Published Oct 30 • 21

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30 • 119

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30 • 108

upvoted a collection about 2 months ago

Emu3.5

Native Multimodal Models are World Learners 🌍 • 4 items • Updated 2 days ago • 72

upvoted 2 papers about 2 months ago

The End of Manual Decoding: Towards Truly End-to-End Language Models

Paper • 2510.26697 • Published Oct 30 • 116

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

Paper • 2510.27492 • Published Oct 30 • 82

upvoted 2 collections about 2 months ago

ACG (GR00T-N1-2B Post-trained Models)

3 items • Updated about 19 hours ago • 1

Dexbotic

21 items • Updated Oct 21 • 3

upvoted a paper about 2 months ago

π_RL: Online RL Fine-tuning for Flow-based Vision-Language-Action Models

Paper • 2510.25889 • Published Oct 29 • 64