MJ-Bench-Team

community

https://mj-bench.github.io

MJ-Bench

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

zhuokai authored a paper about 2 months ago

Scaling Agent Learning via Experience Synthesis

zhuokai authored a paper 2 months ago

From Uncertainty to Trust: Enhancing Reliability in Vision-Language Models with Uncertainty-Guided Dropout Decoding

zhuokai authored a paper 2 months ago

Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment

View all activity

zhuokai

authored a paper about 2 months ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5 • 81

zhuokai

authored 10 papers 2 months ago

From Uncertainty to Trust: Enhancing Reliability in Vision-Language Models with Uncertainty-Guided Dropout Decoding

Paper • 2412.06474 • Published Dec 9, 2024

CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning

Paper • 2503.19900 • Published Mar 25

Boosting LLM Reasoning via Spontaneous Self-Correction

Paper • 2506.06923 • Published Jun 7

RecoWorld: Building Simulated Environments for Agentic Recommender Systems

Paper • 2509.10397 • Published Sep 12 • 7

StreamMem: Query-Agnostic KV Cache Memory for Streaming Video Understanding

Paper • 2508.15717 • Published Aug 21 • 1

Let it Calm: Exploratory Annealed Decoding for Verifiable Reinforcement Learning

Paper • 2510.05251 • Published Oct 6 • 7

Thought Communication in Multiagent Collaboration

Paper • 2510.20733 • Published Oct 23 • 14

Zhaorun

updated a dataset 2 months ago

MJ-Bench/MJ-Bench

Viewer • Updated Oct 23 • 7.56k • 140

Zhaorun

published a dataset 2 months ago

MJ-Bench/MJ-Bench

Viewer • Updated Oct 23 • 7.56k • 140

yichaodu

authored a paper 5 months ago

Don't Overthink It: A Survey of Efficient R1-style Large Reasoning Models

Paper • 2508.02120 • Published Aug 4 • 19

zhuokai

authored a paper 10 months ago

HumanMM: Global Human Motion Recovery from Multi-shot Videos

Paper • 2503.07597 • Published Mar 10 • 2

Zhaorun

authored 2 papers about 1 year ago

GRAPE: Generalizing Robot Policy via Preference Alignment

Paper • 2411.19309 • Published Nov 28, 2024 • 47

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Paper • 2410.10139 • Published Oct 14, 2024 • 51

zhuokai

authored a paper about 1 year ago

Quantifying Generalization Complexity for Large Language Models

Paper • 2410.01769 • Published Oct 2, 2024 • 13

Zhaorun

authored a paper over 1 year ago

AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases

Paper • 2407.12784 • Published Jul 17, 2024 • 51

yuqingzhang

authored a paper over 1 year ago

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

Paper • 2407.04842 • Published Jul 5, 2024 • 55

AI & ML interests

Recent Activity

Team members 7

MJ-Bench's activity