Shrey Pandit's picture

Shrey Pandit

SP2001

·

https://sites.google.com/view/shrey-pandit/home

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

SkillOrchestra: Learning to Route Agents via Skill Transfer

liked a dataset about 1 month ago

Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b

authored a paper about 1 month ago

Least-Loaded Expert Parallelism: Load Balancing An Imbalanced Mixture-of-Experts

View all activity

Organizations

upvoted a paper 17 days ago

SkillOrchestra: Learning to Route Agents via Skill Transfer

Paper • 2602.19672 • Published 18 days ago • 55

upvoted 2 papers about 2 months ago

Least-Loaded Expert Parallelism: Load Balancing An Imbalanced Mixture-of-Experts

Paper • 2601.17111 • Published Jan 23 • 5

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 201

upvoted a paper 2 months ago

Step-DeepResearch Technical Report

Paper • 2512.20491 • Published Dec 23, 2025 • 86

upvoted 2 papers 4 months ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 62

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 128

upvoted 4 papers 5 months ago

LiveResearchBench: A Live Benchmark for User-Centric Deep Research in the Wild

Paper • 2510.14240 • Published Oct 16, 2025 • 13

Synthesizing Agentic Data for Web Agents with Progressive Difficulty Enhancement Mechanisms

Paper • 2510.13913 • Published Oct 15, 2025 • 4

Hard2Verify: A Step-Level Verification Benchmark for Open-Ended Frontier Math

Paper • 2510.13744 • Published Oct 15, 2025 • 6

Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels

Paper • 2510.06499 • Published Oct 7, 2025 • 33

upvoted 3 papers 6 months ago

SFR-DeepResearch: Towards Effective Reinforcement Learning for Autonomously Reasoning Single Agents

Paper • 2509.06283 • Published Sep 8, 2025 • 17

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 196

Open Data Synthesis For Deep Research

Paper • 2509.00375 • Published Aug 30, 2025 • 72

upvoted 3 papers 7 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 214

BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent

Paper • 2508.06600 • Published Aug 8, 2025 • 41

WideSearch: Benchmarking Agentic Broad Info-Seeking

Paper • 2508.07999 • Published Aug 11, 2025 • 111

upvoted 4 papers 8 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 319

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Paper • 2507.16812 • Published Jul 22, 2025 • 64

WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization

Paper • 2507.15061 • Published Jul 20, 2025 • 60

GTA1: GUI Test-time Scaling Agent

Paper • 2507.05791 • Published Jul 8, 2025 • 27