MihailSlutsky's picture

14 14

MihailSlutsky

MihailSlutsky

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management

upvoted a paper 16 days ago

Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity

upvoted a paper 16 days ago

Detect Anything via Next Point Prediction

View all activity

Organizations

None yet

upvoted 6 papers 16 days ago

Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management

Paper • 2510.06727 • Published 28 days ago • 3

Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity

Paper • 2510.01171 • Published Oct 1 • 18

Detect Anything via Next Point Prediction

Paper • 2510.12798 • Published 21 days ago • 44

Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs

Paper • 2510.11062 • Published 23 days ago • 26

AnyUp: Universal Feature Upsampling

Paper • 2510.12764 • Published 21 days ago • 10

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published 20 days ago • 101

upvoted 7 papers about 2 months ago

THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning

Paper • 2509.13761 • Published Sep 17 • 16

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

Paper • 2509.09265 • Published Sep 11 • 45

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Paper • 2509.09674 • Published Sep 11 • 78

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Paper • 2509.09372 • Published Sep 11 • 235

QuantAgent: Price-Driven Multi-Agent LLMs for High-Frequency Trading

Paper • 2509.09995 • Published Sep 12 • 14

LoFT: Parameter-Efficient Fine-Tuning for Long-tailed Semi-Supervised Learning in Open-World Scenarios

Paper • 2509.09926 • Published Sep 12 • 13

The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs

Paper • 2509.09677 • Published Sep 11 • 34

liked 6 datasets about 2 months ago

Kwai-Keye/Thyme-SFT

Viewer • Updated Aug 18 • 346k • 1.83k • 10

advaitgupta/perception-test

Viewer • Updated Jul 2 • 7.41k • 59 • 1

lmms-lab/PerceptionTest_Val

Viewer • Updated Jun 5, 2024 • 19.1k • 688 • 1

lmms-lab/PerceptionTest

Viewer • Updated Jun 4, 2024 • 30.7k • 227 • 1

VLM2Vec/NExTQA

Viewer • Updated May 31, 2024 • 60.6k • 207 • 1

yifanzhang114/Thyme-RL

Viewer • Updated Aug 9 • 55.2k • 117 • 1

liked a dataset 2 months ago

Leyo/ActivityNet_Captions

Viewer • Updated Jul 1, 2022 • 19.8k • 121 • 2