1 31 9

Yan Varakin

ZDPLI

https://www.researchgate.net/profile/Yan-Varakin

ZDPLI

AI & ML interests

All areas of NLP, computational mathematics, reinforcement learning, robotics.

Recent Activity

upvoted an article 4 days ago

StackLLaMA: A hands-on guide to train LLaMA with RLHF

upvoted an article 4 days ago

Fine-tune Llama 2 with DPO

upvoted a paper 4 days ago

Phi-4-reasoning Technical Report

View all activity

Organizations

ZDPLI's activity

upvoted 2 articles 4 days ago

Article

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Apr 5, 2023

• 36

Article

Fine-tune Llama 2 with DPO

Aug 8, 2023

• 53

upvoted a paper 4 days ago

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published 15 days ago • 43

upvoted 4 papers 6 days ago

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Paper • 2505.00551 • Published 14 days ago • 30

upvoted a paper 7 days ago

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Paper • 2504.20752 • Published 16 days ago • 86

upvoted 5 papers 4 months ago

EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents

Paper • 2501.11858 • Published Jan 21 • 7

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24 • 74

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published Jan 24 • 28

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 92

Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

Paper • 2501.04003 • Published Jan 7 • 28

upvoted 2 papers 5 months ago

VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models

Paper • 2412.01822 • Published Dec 2, 2024 • 15

DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving

Paper • 2411.15139 • Published Nov 22, 2024 • 15

upvoted a paper 6 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 125

upvoted a collection 6 months ago

Medical QA Datasets

Collection

A collection of medical question answering (QA) datasets • 23 items • Updated Feb 22 • 38

upvoted a paper 6 months ago

Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning

Paper • 2410.21845 • Published Oct 29, 2024 • 14