MIKHAIL BURTSEV's picture

7 14 8

MIKHAIL BURTSEV

mbur

·

AI & ML interests

None yet

Recent Activity

upvoted an article 26 days ago

Arc Virtual Cell Challenge: A Primer

upvoted a paper about 2 months ago

AutoIntent: AutoML for Text Classification

upvoted a paper 3 months ago

Limitations of Normalization in Attention Mechanism

View all activity

Organizations

authored 2 papers 3 months ago

Beyond Memorization: Extending Reasoning Depth with Recurrence, Memory and Test-Time Compute Scaling

Paper • 2508.16745 • Published Aug 22 • 28

Limitations of Normalization in Attention Mechanism

Paper • 2508.17821 • Published Aug 25 • 7

authored a paper 9 months ago

Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

Paper • 2502.13063 • Published Feb 18 • 72

authored a paper 10 months ago

SRMT: Shared Memory for Multi-agent Lifelong Pathfinding

Paper • 2501.13200 • Published Jan 22 • 69

authored 12 papers over 1 year ago

The Second Conversational Intelligence Challenge (ConvAI2)

Paper • 1902.00098 • Published Jan 31, 2019

ConvAI3: Generating Clarifying Questions for Open-Domain Dialogue Systems (ClariQ)

Paper • 2009.11352 • Published Sep 23, 2020

Knowledge Distillation of Russian Language Models with Reduction of Vocabulary

Paper • 2205.02340 • Published May 4, 2022

Scaling Transformer to 1M tokens and beyond with RMT

Paper • 2304.11062 • Published Apr 19, 2023 • 3

Recurrent Memory Transformer

Paper • 2207.06881 • Published Jul 14, 2022 • 1

Better Together: Enhancing Generative Knowledge Graph Completion with Language Models and Neighborhood Information

Paper • 2311.01326 • Published Nov 2, 2023 • 3

Uncertainty Guided Global Memory Improves Multi-Hop Question Answering

Paper • 2311.18151 • Published Nov 29, 2023

Associative Recurrent Memory Transformer

Paper • 2407.04841 • Published Jul 5, 2024 • 36

AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents

Paper • 2407.04363 • Published Jul 5, 2024 • 33

Complexity of Symbolic Representation in Working Memory of Transformer Correlates with the Complexity of a Task

Paper • 2406.14213 • Published Jun 20, 2024 • 21

BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack

Paper • 2406.10149 • Published Jun 14, 2024 • 52

In Search of Needles in a 10M Haystack: Recurrent Memory Finds What LLMs Miss

Paper • 2402.10790 • Published Feb 16, 2024 • 42