Senqiao Yang's picture

5 59 7

Senqiao Yang

Senqiao

·

https://senqiaoyang.com/

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

upvoted a paper 12 days ago

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

upvoted a paper 12 days ago

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

View all activity

Organizations

upvoted a paper 5 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published 5 days ago • 61

upvoted 4 papers 12 days ago

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

Paper • 2512.12967 • Published 13 days ago • 101

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Paper • 2512.13604 • Published 12 days ago • 70

Openpi Comet: Competition Solution For 2025 BEHAVIOR Challenge

Paper • 2512.10071 • Published 17 days ago • 17

Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics

Paper • 2512.12602 • Published 14 days ago • 40

upvoted a paper 18 days ago

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Paper • 2512.08765 • Published 19 days ago • 125

upvoted a paper 26 days ago

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

Paper • 2511.20785 • Published Nov 25 • 166

liked 2 models about 2 months ago

Alibaba-NLP/GVE-3B

Sentence Similarity • 4B • Updated Nov 3 • 337 • 15

Alibaba-NLP/GVE-7B

Sentence Similarity • 8B • Updated Nov 3 • 139 • 13

liked a dataset about 2 months ago

Alibaba-NLP/UVRB

Updated Nov 6 • 1.12k • 4

upvoted a collection about 2 months ago

GVE

Towards General Video Embeddings: Models and Benchmarks • 4 items • Updated Nov 3 • 19

upvoted a paper about 2 months ago

Towards Universal Video Retrieval: Generalizing Video Embedding via Synthesized Multimodal Pyramid Curriculum

Paper • 2510.27571 • Published Oct 31 • 17

upvoted 8 papers 2 months ago

LongCat-Video Technical Report

Paper • 2510.22200 • Published Oct 25 • 29

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Paper • 2510.23607 • Published Oct 27 • 177

Knocking-Heads Attention

Paper • 2510.23052 • Published Oct 27 • 29

E^2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker

Paper • 2510.22733 • Published Oct 26 • 31

Lookahead Anchoring: Preserving Character Identity in Audio-Driven Human Animation

Paper • 2510.23581 • Published Oct 27 • 41

FARMER: Flow AutoRegressive Transformer over Pixels

Paper • 2510.23588 • Published Oct 27 • 58

VITA-E: Natural Embodied Interaction with Concurrent Seeing, Hearing, Speaking, and Acting

Paper • 2510.21817 • Published Oct 21 • 41

Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences

Paper • 2510.23451 • Published Oct 27 • 26