Andrey Sokolov's picture

10 15

Andrey Sokolov

shmpanski

·

AI & ML interests

llm, translation, summarization and semantic models

Recent Activity

upvoted a paper 27 days ago

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

liked a model about 1 month ago

deepvk/USER2-base

upvoted a paper 3 months ago

You Do Not Fully Utilize Transformer's Representation Capacity

View all activity

Organizations

shmpanski's activity

upvoted a paper 27 days ago

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published Apr 8 • 109

upvoted a paper 3 months ago

You Do Not Fully Utilize Transformer's Representation Capacity

Paper • 2502.09245 • Published Feb 13 • 38

upvoted a collection 3 months ago

RuModernBERT

Modernized BERT for Russian • 2 items • Updated Feb 19 • 5

upvoted a paper 3 months ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3 • 115

upvoted a collection 6 months ago

Cultura-Ru-Edu

Our dataset for enhancing LLM training with educational content in the Russian language. • 2 items • Updated Nov 26, 2024 • 5

upvoted a paper 8 months ago

METR: Image Watermarking with Large Number of Unique Messages

Paper • 2408.08340 • Published Aug 15, 2024 • 5

upvoted a collection 9 months ago

Vision-Language Modeling

Our datasets and models for Visual-Language Modeling • 5 items • Updated Nov 25, 2024 • 6

upvoted a collection 11 months ago

USER

Collection of models and dataset for sentence encoder task • 4 items • Updated about 1 month ago • 7

upvoted 2 papers about 1 year ago

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15, 2024 • 87

Linear Transformers with Learnable Kernel Functions are Better In-Context Models

Paper • 2402.10644 • Published Feb 16, 2024 • 82