Beyond Memorization: Extending Reasoning Depth with Recurrence, Memory and Test-Time Compute Scaling Paper • 2508.16745 • Published Aug 22 • 28
Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity Paper • 2502.13063 • Published Feb 18 • 72
The Second Conversational Intelligence Challenge (ConvAI2) Paper • 1902.00098 • Published Jan 31, 2019
ConvAI3: Generating Clarifying Questions for Open-Domain Dialogue Systems (ClariQ) Paper • 2009.11352 • Published Sep 23, 2020
Knowledge Distillation of Russian Language Models with Reduction of Vocabulary Paper • 2205.02340 • Published May 4, 2022
Better Together: Enhancing Generative Knowledge Graph Completion with Language Models and Neighborhood Information Paper • 2311.01326 • Published Nov 2, 2023 • 3
Uncertainty Guided Global Memory Improves Multi-Hop Question Answering Paper • 2311.18151 • Published Nov 29, 2023
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents Paper • 2407.04363 • Published Jul 5, 2024 • 33
Complexity of Symbolic Representation in Working Memory of Transformer Correlates with the Complexity of a Task Paper • 2406.14213 • Published Jun 20, 2024 • 21
BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack Paper • 2406.10149 • Published Jun 14, 2024 • 52
In Search of Needles in a 10M Haystack: Recurrent Memory Finds What LLMs Miss Paper • 2402.10790 • Published Feb 16, 2024 • 42