10 30 19

Yury Kuratov

yurakuratov

yurakuratov

AI & ML interests

None yet

Recent Activity

updated a collection 10 days ago

KV-retrieval datasets

updated a collection 10 days ago

KV-retrieval datasets

updated a collection 10 days ago

KV-retrieval datasets

View all activity

Organizations

upvoted 5 papers about 1 month ago

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Paper • 2506.06395 • Published Jun 5 • 127

Image Reconstruction as a Tool for Feature Analysis

Paper • 2506.07803 • Published Jun 9 • 29

Diagonal Batching Unlocks Parallelism in Recurrent Memory Transformers for Long Contexts

Paper • 2506.05229 • Published Jun 5 • 37

Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Paper • 2505.21115 • Published May 27 • 135

AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment

Paper • 2506.04089 • Published Jun 4 • 46

upvoted 2 papers about 2 months ago

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Paper • 2505.20411 • Published May 26 • 87

Exploring the Latent Capacity of LLMs for One-Step Text Generation

Paper • 2505.21189 • Published May 27 • 62

upvoted a paper 3 months ago

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published Apr 8 • 110

upvoted a paper 4 months ago

A Comprehensive Survey on Long Context Language Modeling

Paper • 2503.17407 • Published Mar 20 • 50

upvoted a collection 4 months ago

Gemma 3 Release

Collection

24 items • Updated 4 days ago • 404

upvoted a paper 4 months ago

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12 • 72

upvoted 4 papers 5 months ago

MUDDFormer: Breaking Residual Bottlenecks in Transformers via Multiway Dynamic Dense Connections

Paper • 2502.12170 • Published Feb 13 • 12

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20 • 175

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published Feb 20 • 91

Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

Paper • 2502.13063 • Published Feb 18 • 73

upvoted a paper 6 months ago

SRMT: Shared Memory for Multi-agent Lifelong Pathfinding

Paper • 2501.13200 • Published Jan 22 • 69

upvoted a collection 6 months ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 622

upvoted a paper 7 months ago

Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

Paper • 2412.06531 • Published Dec 9, 2024 • 73

upvoted a collection 8 months ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Apr 28 • 626

upvoted a paper 10 months ago

MAPF-GPT: Imitation Learning for Multi-Agent Pathfinding at Scale

Paper • 2409.00134 • Published Aug 29, 2024 • 2

Yury Kuratov

AI & ML interests

Recent Activity

Organizations

yurakuratov's activity