MIKHAIL BURTSEV's picture

5 10 7

MIKHAIL BURTSEV

mbur

·

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

You could have designed state of the art positional encoding

liked a Space about 2 months ago

RMT-team/babilong

authored a paper 5 months ago

Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

View all activity

Organizations

authored a paper 5 months ago

Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

Paper • 2502.13063 • Published Feb 18 • 73

authored a paper 6 months ago

SRMT: Shared Memory for Multi-agent Lifelong Pathfinding

Paper • 2501.13200 • Published Jan 22 • 68

authored 8 papers 12 months ago

The Second Conversational Intelligence Challenge (ConvAI2)

Paper • 1902.00098 • Published Jan 31, 2019

ConvAI3: Generating Clarifying Questions for Open-Domain Dialogue Systems (ClariQ)

Paper • 2009.11352 • Published Sep 23, 2020

Knowledge Distillation of Russian Language Models with Reduction of Vocabulary

Paper • 2205.02340 • Published May 4, 2022

Scaling Transformer to 1M tokens and beyond with RMT

Paper • 2304.11062 • Published Apr 19, 2023 • 3

Recurrent Memory Transformer

Paper • 2207.06881 • Published Jul 14, 2022 • 1

Better Together: Enhancing Generative Knowledge Graph Completion with Language Models and Neighborhood Information

Paper • 2311.01326 • Published Nov 2, 2023 • 2

Uncertainty Guided Global Memory Improves Multi-Hop Question Answering

Paper • 2311.18151 • Published Nov 29, 2023

Associative Recurrent Memory Transformer

Paper • 2407.04841 • Published Jul 5, 2024 • 37

authored 3 papers about 1 year ago

AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents

Paper • 2407.04363 • Published Jul 5, 2024 • 34

Complexity of Symbolic Representation in Working Memory of Transformer Correlates with the Complexity of a Task

Paper • 2406.14213 • Published Jun 20, 2024 • 21

BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack

Paper • 2406.10149 • Published Jun 14, 2024 • 53

authored a paper over 1 year ago

In Search of Needles in a 10M Haystack: Recurrent Memory Finds What LLMs Miss

Paper • 2402.10790 • Published Feb 16, 2024 • 43