12 14 44

XuHao Hu

Foreshhh

AI & ML interests

NLP MM

Recent Activity

upvoted a paper 14 days ago

VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos

upvoted a paper 23 days ago

BEAR: Benchmarking and Enhancing Multimodal Language Models for Atomic Embodied Capabilities

authored a paper 27 days ago

IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks

View all activity

Organizations

upvoted a paper 14 days ago

VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos

Paper • 2510.19488 • Published 15 days ago • 19

upvoted a paper 23 days ago

BEAR: Benchmarking and Enhancing Multimodal Language Models for Atomic Embodied Capabilities

Paper • 2510.08759 • Published 27 days ago • 46

authored 5 papers 27 days ago

IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks

Paper • 2506.16402 • Published Jun 19 • 1

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law

Paper • 2507.18576 • Published Jul 24 • 6

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report

Paper • 2507.16534 • Published Jul 22 • 7

Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Step

Paper • 2509.23924 • Published Sep 28 • 7

LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions

Paper • 2510.08211 • Published 28 days ago • 22

upvoted a paper 27 days ago

LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions

Paper • 2510.08211 • Published 28 days ago • 22

commented a paper 27 days ago

LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions

Paper • 2510.08211 • Published 28 days ago • 22 •

upvoted a paper about 1 month ago

Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Step

Paper • 2509.23924 • Published Sep 28 • 7

upvoted a collection about 1 month ago

Qwen3

Collection

84 items • Updated Aug 6 • 1.39k

upvoted a paper about 1 month ago

How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective

Paper • 2509.18905 • Published Sep 23 • 28

New activity in OpenSafetyLab/MD-Judge-v0_2-internlm2_7b 3 months ago

Unable to download the model

#2 opened about 1 year ago by

sriharshasurineni

New activity in Foreshhh/vlsbench 3 months ago

[bot] Conversion to Parquet

#1 opened 12 months ago by

parquet-converter

Question about image files - no images found when loading dataset

#2 opened 3 months ago by

leo1200213

updated a dataset 3 months ago

Foreshhh/vlsbench

Viewer • Updated Aug 12 • 2.24k • 286 • 7

New activity in OpenSafetyLab/t2i_safety_dataset 3 months ago

Improve dataset card for T2ISafety benchmark

#1 opened 3 months ago by

nielsr

updated a collection 4 months ago

VLSBench

Collection

3 items • Updated Jul 21

XuHao Hu

AI & ML interests

Recent Activity

Organizations

Foreshhh's activity

Unable to download the model

[bot] Conversion to Parquet

Question about image files - no images found when loading dataset

Improve dataset card for T2ISafety benchmark