Vince

bolerovt

bolerovt

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Scaling Law for Quantization-Aware Training

upvoted a paper 4 days ago

MMaDA: Multimodal Large Diffusion Language Models

upvoted a paper 4 days ago

Distilling LLM Agent into Small Models with Retrieval and Code Tools

View all activity

Organizations

None yet

bolerovt's activity

upvoted 20 papers 4 days ago

Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance

Paper • 2505.16348 • Published May 22 • 51

Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model

Paper • 2505.17894 • Published about 1 month ago • 217

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published 28 days ago • 102

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published 26 days ago • 122

Table-R1: Inference-Time Scaling for Table Reasoning

Paper • 2505.23621 • Published 25 days ago • 92

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published 24 days ago • 127

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 21 days ago • 160

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published 24 days ago • 200

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Paper • 2506.05176 • Published 18 days ago • 61

Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Paper • 2505.21115 • Published 27 days ago • 129

SpatialLM: Training Large Language Models for Structured Indoor Modeling

Paper • 2506.07491 • Published 14 days ago • 38

MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published 14 days ago • 81

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published 15 days ago • 105

Reinforcement Pre-Training

Paper • 2506.08007 • Published 14 days ago • 220

Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation

Paper • 2506.09991 • Published 12 days ago • 55

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published 13 days ago • 89

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Paper • 2506.06395 • Published 18 days ago • 121