3 18 25

Erfan Shayegani 😈

Erfan-Shayegani

https://erfanshayegani.github.io/

AI & ML interests

AI Safety - Responsible AI - Multi-Modal Alignment

Recent Activity

liked a model 10 days ago

microsoft/Fara-7B

upvoted a paper 29 days ago

The Collaboration Gap

upvoted a paper about 2 months ago

Misaligned Roles, Misplaced Images: Structural Input Perturbations Expose Multimodal Alignment Blind Spots

View all activity

Organizations

liked a model 10 days ago

microsoft/Fara-7B

Image-Text-to-Text • 8B • Updated 4 days ago • 27.3k • 416

upvoted a paper 29 days ago

The Collaboration Gap

Paper • 2511.02687 • Published about 1 month ago • 21

upvoted a paper about 2 months ago

Misaligned Roles, Misplaced Images: Structural Input Perturbations Expose Multimodal Alignment Blind Spots

Paper • 2504.03735 • Published Apr 1 • 1

authored a paper 2 months ago

Just Do It!? Computer-Use Agents Exhibit Blind Goal-Directedness

Paper • 2510.01670 • Published Oct 2 • 6

commented a paper 2 months ago

Just Do It!? Computer-Use Agents Exhibit Blind Goal-Directedness

Paper • 2510.01670 • Published Oct 2 • 6 •

upvoted a paper 2 months ago

Just Do It!? Computer-Use Agents Exhibit Blind Goal-Directedness

Paper • 2510.01670 • Published Oct 2 • 6

upvoted an article 6 months ago

Article

ScreenSuite - The most comprehensive evaluation suite for GUI Agents!

Jun 6

•

liked a Space 7 months ago

OS ATLAS

📉

A Foundation Action Model For Generalist GUI Agents

liked 2 models 7 months ago

ByteDance-Seed/UI-TARS-7B-DPO

Image-Text-to-Text • 8B • Updated Jan 25 • 1.22k • 221

ByteDance-Seed/UI-TARS-1.5-7B

Image-Text-to-Text • 8B • Updated Apr 18 • 154k • 442

upvoted a paper 7 months ago

Why Are Web AI Agents More Vulnerable Than Standalone LLMs? A Security Analysis

Paper • 2502.20383 • Published Feb 27 • 3

authored 2 papers 8 months ago

Misaligned Roles, Misplaced Images: Structural Input Perturbations Expose Multimodal Alignment Blind Spots

Paper • 2504.03735 • Published Apr 1 • 1

Unfair Alignment: Examining Safety Alignment Across Vision Encoder Layers in Vision-Language Models

Paper • 2411.04291 • Published Nov 6, 2024

upvoted a paper 9 months ago

R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model

Paper • 2503.05132 • Published Mar 7 • 57

liked a model 9 months ago

CohereLabs/aya-vision-8b

Image-Text-to-Text • 9B • Updated Oct 30 • 46.8k • 314

upvoted an article 11 months ago

Article

Abliterating Refusal and Code LLMs

Jul 26, 2024

•

liked 3 models 11 months ago

upvoted a paper about 1 year ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 131

Erfan Shayegani 😈

AI & ML interests

Recent Activity

Organizations

Erfan-Shayegani's activity

ScreenSuite - The most comprehensive evaluation suite for GUI Agents!

OS ATLAS

Abliterating Refusal and Code LLMs