9 17 7

Han Zhao

han1997

https://h-zhao1997.github.io

AI & ML interests

Robotics, reinforcement learning, large language model, AGI

Recent Activity

authored a paper 21 days ago

QUAR-VLA: Vision-Language-Action Model for Quadruped Robots

authored a paper 21 days ago

PiTe: Pixel-Temporal Alignment for Large Video-Language Model

authored a paper 21 days ago

OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation

View all activity

Organizations

authored 12 papers 21 days ago

QUAR-VLA: Vision-Language-Action Model for Quadruped Robots

Paper • 2312.14457 • Published Dec 22, 2023 • 1

PiTe: Pixel-Temporal Alignment for Large Video-Language Model

Paper • 2409.07239 • Published Sep 11, 2024 • 15

OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation

Paper • 2505.03912 • Published May 6 • 9

SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning

Paper • 2505.12448 • Published May 18 • 10

Accelerating Vision-Language-Action Model Integrated with Action Chunking via Parallel Decoding

Paper • 2503.02310 • Published Mar 4 • 1

CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding

Paper • 2506.13725 • Published Jun 16 • 1

QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning

Paper • 2412.15576 • Published Dec 20, 2024

ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver

Paper • 2508.10333 • Published Aug 14 • 1

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Paper • 2509.09372 • Published Sep 11 • 236

Towards a Unified Understanding of Robot Manipulation: A Comprehensive Survey

Paper • 2510.10903 • Published 27 days ago

Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model

Paper • 2510.12276 • Published 25 days ago • 142

VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation

Paper • 2510.14902 • Published 23 days ago • 13

commented a paper 22 days ago

VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation

Paper • 2510.14902 • Published 23 days ago • 13 •

upvoted a paper 22 days ago

VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation

Paper • 2510.14902 • Published 23 days ago • 13

upvoted a paper about 1 month ago

VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators

Paper • 2510.00406 • Published Oct 1 • 64

upvoted a paper about 2 months ago

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Paper • 2509.09372 • Published Sep 11 • 236

updated a model about 1 year ago

han1997/cobra

Updated Aug 19, 2024 • 19

reacted to ybelkada's post with 🔥 about 1 year ago

Post

4133

FalconMamba 7B - a new model from TII (Technology Innovation Institute) is out !

- Blogpost: https://huggingface.co/blog/falconmamba
- Link to collection: tiiuae/falconmamba-7b-66b9a580324dd1598b0f6d4a
- Link to playground: tiiuae/falcon-mamba-playground

New activity in fla-hub/rwkv6-7B-finch over 1 year ago

Can you add some details about this model

#1 opened over 1 year ago by

chen-yingfa

upvoted a paper over 1 year ago

TextGrad: Automatic "Differentiation" via Text

Paper • 2406.07496 • Published Jun 11, 2024 • 31

Han Zhao

AI & ML interests

Recent Activity

Organizations

han1997's activity

Can you add some details about this model