Zhaoye Fei's picture

Zhaoye Fei

ngc7293

·

https://ngc7292.github.io/

AI & ML interests

NLP & Ro.

Recent Activity

upvoted a paper 2 days ago

DiRL: An Efficient Post-Training Framework for Diffusion Language Models

upvoted a paper 2 days ago

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

liked a Space 3 days ago

OpenMOSS-Team/MOSS-transcribe-diarize

View all activity

Organizations

upvoted 2 papers 2 days ago

DiRL: An Efficient Post-Training Framework for Diffusion Language Models

Paper • 2512.22234 • Published 10 days ago • 17

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

Paper • 2512.23576 • Published 3 days ago • 59

liked a Space 3 days ago

MOSS Transcribe Diarize

upvoted 2 papers about 1 month ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 210

SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models

Paper • 2511.15605 • Published Nov 19, 2025 • 22

liked a model about 2 months ago

OpenMOSS-Team/MOSS-TTSD-v0.7

Text-to-Speech • 2B • Updated Nov 11, 2025 • 1.05k • 15

upvoted a paper about 2 months ago

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30, 2025 • 108

upvoted a paper 2 months ago

RoboOmni: Proactive Robot Manipulation in Omni-modal Context

Paper • 2510.23763 • Published Oct 27, 2025 • 53

liked 2 datasets 3 months ago

Sylvest/libero_plus_rlds

Updated Oct 17, 2025 • 472 • 5

Sylvest/LIBERO-plus

Updated Oct 17, 2025 • 545 • 15

upvoted 3 papers 3 months ago

PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning

Paper • 2510.13809 • Published Oct 15, 2025 • 37

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models

Paper • 2510.13626 • Published Oct 15, 2025 • 45

MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance

Paper • 2510.00499 • Published Oct 1, 2025 • 19

liked a model 3 months ago

OpenMOSS-Team/MOSS-Speech

9B • Updated Sep 30, 2025 • 197 • 16

liked a Space 3 months ago

MOSS-Speech Demo

True Speech-to-Speech Language Model

upvoted a paper 3 months ago

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Paper • 2509.15221 • Published Sep 18, 2025 • 111

liked a Space 4 months ago

README

OpenMOSS Team of SII

updated a Space 4 months ago

README

OpenMOSS Team of SII

published a Space 4 months ago

README

OpenMOSS Team of SII

upvoted a paper 5 months ago

WideSearch: Benchmarking Agentic Broad Info-Seeking

Paper • 2508.07999 • Published Aug 11, 2025 • 110