2 7 20

Maksym Andriushchenko

MaksymAndriushchenko

https://www.andriushchenko.me/

AI & ML interests

None yet

Recent Activity

upvoted a paper 26 days ago

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

upvoted a paper about 1 month ago

Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM

authored a paper about 2 months ago

Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM

View all activity

Organizations

upvoted a paper 26 days ago

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

Paper • 2510.09462 • Published 29 days ago • 5

upvoted a paper about 1 month ago

Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM

Paper • 2509.18058 • Published Sep 22 • 12

authored a paper about 2 months ago

Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM

Paper • 2509.18058 • Published Sep 22 • 12

upvoted a paper about 2 months ago

The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs

Paper • 2509.09677 • Published Sep 11 • 34

liked a dataset 2 months ago

microsoft/llmail-inject-challenge

Viewer • Updated May 16 • 462k • 805 • 23

liked a model 2 months ago

swiss-ai/Apertus-8B-Instruct-2509

Text Generation • 8B • Updated Oct 1 • 389k • • 388

liked a dataset 4 months ago

HuggingFaceTB/smoltalk2

Viewer • Updated 8 days ago • 8.61M • 9.08k • 116

upvoted a paper 5 months ago

OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents

Paper • 2506.14866 • Published Jun 17 • 5

commented a paper 5 months ago

OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents

Paper • 2506.14866 • Published Jun 17 • 5 •

liked a model 5 months ago

Qwen/Qwen3-8B

Text Generation • 8B • Updated Jul 26 • 3.24M • • 720

liked a dataset 5 months ago

YuehHanChen/DecomposedHarm

Viewer • Updated Jun 17 • 4.64k • 25 • 4

upvoted a paper 5 months ago

Capability-Based Scaling Laws for LLM Red-Teaming

Paper • 2505.20162 • Published May 26 • 4

upvoted a paper 7 months ago

Antidistillation Sampling

Paper • 2504.13146 • Published Apr 17 • 59

liked a model 7 months ago

tomg-group-umd/huginn-0125

Text Generation • 4B • Updated Jul 29 • 2.2k • 288

liked a model about 1 year ago

GraySwanAI/Mistral-7B-Instruct-RR

Text Generation • 7B • Updated Jul 9, 2024 • 9.9k • 5

liked 2 datasets about 1 year ago

ai-safety-institute/AgentHarm

Viewer • Updated Dec 19, 2024 • 468 • 1.6k • 39

gaia-benchmark/GAIA

Viewer • Updated 11 days ago • 932 • 8.46k • 478

updated a dataset about 1 year ago

JailbreakBench/JBB-Behaviors

Viewer • Updated Sep 26, 2024 • 500 • 13.4k • 65

liked a dataset about 1 year ago

locuslab/TOFU

Viewer • Updated Mar 27 • 18.1k • 99.2k • 42

liked a Space about 1 year ago

13.7k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots

Maksym Andriushchenko

AI & ML interests

Recent Activity

Organizations

MaksymAndriushchenko's activity

Open LLM Leaderboard