5 16 3

Zhenhailong Wang PRO

mikewang

https://mikewangwzhl.github.io/

AI & ML interests

NLP, Computer Vision

Recent Activity

upvoted a paper about 20 hours ago

EBT-Policy: Energy Unlocks Emergent Physical Reasoning Capabilities

upvoted a paper 6 days ago

Scaling Latent Reasoning via Looped Language Models

upvoted a paper 22 days ago

Multimodal Policy Internalization for Conversational Agents

View all activity

Organizations

upvoted a paper about 20 hours ago

EBT-Policy: Energy Unlocks Emergent Physical Reasoning Capabilities

Paper • 2510.27545 • Published 5 days ago • 35

upvoted a paper 6 days ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published 7 days ago • 196

upvoted a paper 22 days ago

Multimodal Policy Internalization for Conversational Agents

Paper • 2510.09474 • Published 26 days ago • 4

commented a paper 22 days ago

Multimodal Policy Internalization for Conversational Agents

Paper • 2510.09474 • Published 26 days ago • 4 •

upvoted a paper about 1 month ago

Where LLM Agents Fail and How They can Learn From Failures

Paper • 2509.25370 • Published Sep 29 • 11

upvoted a paper 2 months ago

FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games

Paper • 2509.01052 • Published Sep 1 • 20

upvoted a paper 3 months ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 259

upvoted a paper 4 months ago

Perception-Aware Policy Optimization for Multimodal Reasoning

Paper • 2507.06448 • Published Jul 8 • 47

commented a paper 4 months ago

Perception-Aware Policy Optimization for Multimodal Reasoning

Paper • 2507.06448 • Published Jul 8 • 47 •

upvoted a paper 4 months ago

Energy-Based Transformers are Scalable Learners and Thinkers

Paper • 2507.02092 • Published Jul 2 • 68

New activity in mikewang/PVD-160K 5 months ago

Add image-to-text task category

#2 opened 5 months ago by

nielsr

New activity in mikewang/PVD-160k-Mistral-7b 5 months ago

Add library name and pipeline tag

#1 opened 5 months ago by

nielsr

published a model 6 months ago

mikewang/DyMU

Updated Apr 11

upvoted 3 papers 6 months ago

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14 • 98

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published May 5 • 80

DyMU: Dynamic Merging and Virtual Unmerging for Efficient VLMs

Paper • 2504.17040 • Published Apr 23 • 13

upvoted a paper 7 months ago

ToolRL: Reward is All Tool Learning Needs

Paper • 2504.13958 • Published Apr 16 • 48

updated a model 7 months ago

mikewang/DyMU

Updated Apr 11

upvoted a paper 8 months ago

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

Paper • 2503.01935 • Published Mar 3 • 29

authored a paper 8 months ago

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

Paper • 2503.01935 • Published Mar 3 • 29

Zhenhailong Wang PRO

AI & ML interests

Recent Activity

Organizations

mikewang's activity

Add image-to-text task category

Add library name and pipeline tag