Microsoft Research

company

https://www.microsoft.com/en-us/research/

AI & ML interests

None defined yet.

Recent Activity

ZongqianLi submitted a paper 4 days ago

Scaling Data Difficulty: Improving Coding Models via Reinforcement Learning on Fresh and Challenging Problems

ZongqianLi submitted a paper 4 days ago

Breaking Training Bottlenecks: Effective and Stable Reinforcement Learning for Coding Models

ACID23333 submitted a paper 4 days ago

Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity

View all activity

Papers

Scaling Data Difficulty: Improving Coding Models via Reinforcement Learning on Fresh and Challenging Problems

Breaking Training Bottlenecks: Effective and Stable Reinforcement Learning for Coding Models

View all Papers

MicrosoftResearch 's Papers 27

Submitted by

Zongqian Li

Scaling Data Difficulty: Improving Coding Models via Reinforcement Learning on Fresh and Challenging Problems

MicrosoftResearch

Microsoft Research

Submitted by

Zongqian Li

Breaking Training Bottlenecks: Effective and Stable Reinforcement Learning for Coding Models

MicrosoftResearch

Microsoft Research

Submitted by

Akshay Nambi

Scaling Agentic Capabilities, Not Context: Efficient Reinforcement Finetuning for Large Toolspaces

MicrosoftResearch

Microsoft Research

Submitted by

ZD

Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity

MicrosoftResearch

Microsoft Research

Submitted by

taesiri

Proact-VL: A Proactive VideoLLM for Real-Time AI Companions

MicrosoftResearch

Microsoft Research

5

Submitted by

Akshay Nambi

Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use

MicrosoftResearch

Microsoft Research

Submitted by

Baolin Peng

Reinforcement World Model Learning for LLM-based Agents

MicrosoftResearch

Microsoft Research

Submitted by

junchao-cuhk

LIVE: Long-horizon Interactive Video World Modeling

MicrosoftResearch

Microsoft Research

Submitted by

Baohao Liao

Self-Hinting Language Models Enhance Reinforcement Learning

MicrosoftResearch

Microsoft Research

Submitted by

Xiao Liang

Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability

MicrosoftResearch

Microsoft Research

Submitted by

Hang Guo

Efficient Autoregressive Video Diffusion with Dummy Head

MicrosoftResearch

Microsoft Research

Submitted by

Daixuan Cheng

LLM-in-Sandbox Elicits General Agentic Intelligence

MicrosoftResearch

Microsoft Research

Submitted by

Li Dong

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

MicrosoftResearch

Microsoft Research

Submitted by

taesiri

X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests

MicrosoftResearch

Microsoft Research

Kascade: A Practical Sparse Attention Method for Long-Context LLM Inference

MicrosoftResearch

Microsoft Research

Submitted by

Jialuo Li

Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding

MicrosoftResearch

Microsoft Research

Submitted by

ytz

Black-Box On-Policy Distillation of Large Language Models

MicrosoftResearch

Microsoft Research

Submitted by

Tim Davidson

The Collaboration Gap

MicrosoftResearch

Microsoft Research

Submitted by

Akshay Nambi

Magentic Marketplace: An Open-Source Environment for Studying Agentic Markets

MicrosoftResearch

Microsoft Research

Submitted by

Jack

Code Aesthetics with Agentic Reward Feedback

MicrosoftResearch

Microsoft Research

Submitted by

Li Lyna Zhang

LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts

MicrosoftResearch

Microsoft Research

5

Submitted by

HUANG SHAOHAN

BitNet Distillation

MicrosoftResearch

Microsoft Research

Submitted by

Junpeng Liu

DocReward: A Document Reward Model for Structuring and Stylizing

MicrosoftResearch

Microsoft Research

Submitted by

Martina Vilas

Tracing the Traces: Latent Temporal Signals for Efficient and Accurate Reasoning

MicrosoftResearch

Microsoft Research

2

Submitted by

Zijian Li

PixelCraft: A Multi-Agent System for High-Fidelity Visual Reasoning on Structured Images

MicrosoftResearch

Microsoft Research

Submitted by

Yifei Shen

Benefits and Pitfalls of Reinforcement Learning for Language Model Planning: A Theoretical Perspective

MicrosoftResearch

Microsoft Research

Submitted by

Li Dong

VibeVoice Technical Report

MicrosoftResearch

Microsoft Research