UIUC ScaleML Lab

university

https://github.com/ScaleML/ScaleML-lab

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Ray2333 authored a paper 9 days ago

Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning

Hanyang81 authored a paper 28 days ago

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning

research4pan authored a paper 29 days ago

GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving

View all activity

Ray2333

authored a paper 9 days ago

Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning

Paper • 2510.27623 • Published 12 days ago • 12

Hanyang81

authored a paper 28 days ago

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning

Paper • 2510.12693 • Published 29 days ago • 26

research4pan

authored a paper 29 days ago

GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving

Paper • 2510.11769 • Published about 1 month ago • 25

Ray2333

authored a paper 29 days ago

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning

Paper • 2510.12693 • Published 29 days ago • 26

FlippyDora

authored a paper 5 months ago

Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models

Paper • 2506.18945 • Published Jun 23 • 40

Ray2333

authored a paper 5 months ago

GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

Paper • 2506.03143 • Published Jun 3 • 53

FlippyDora

authored a paper 5 months ago

MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning

Paper • 2505.24846 • Published May 30 • 15

Ray2333

authored a paper 5 months ago

MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning

Paper • 2505.24846 • Published May 30 • 15

HanningZhang

authored a paper 6 months ago

Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL

Paper • 2505.02391 • Published May 5 • 25

FlippyDora

authored a paper 6 months ago

Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL

Paper • 2505.02391 • Published May 5 • 25

Chenlu123

authored a paper 9 months ago

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26 • 83

HanningZhang

authored a paper 9 months ago

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26 • 83

JackBAI

authored a paper 9 months ago

Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Paper • 2405.10292 • Published May 16, 2024 • 2

Ray2333

authored 2 papers 9 months ago

Rethinking Diverse Human Preference Learning through Principal Component Analysis

Paper • 2502.13131 • Published Feb 18 • 37

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents

Paper • 2502.09560 • Published Feb 13 • 35

Hanyang81

authored a paper 9 months ago

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents

Paper • 2502.09560 • Published Feb 13 • 35

Ray2333

authored a paper about 1 year ago

DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

Paper • 2411.00836 • Published Oct 29, 2024 • 15

research4pan

authored a paper about 1 year ago

Personalized Visual Instruction Tuning

Paper • 2410.07113 • Published Oct 9, 2024 • 70

RickyDeSkywalker

authored 2 papers over 1 year ago

DragVideo: Interactive Drag-style Video Editing

Paper • 2312.02216 • Published Dec 3, 2023 • 13

Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models

Paper • 2310.13671 • Published Oct 20, 2023 • 19

AI & ML interests

Recent Activity

Team members 13

UIUC-ScaleML's activity