40 92 101

Somshubra Majumdar

smajumdar94

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

upvoted a paper 3 days ago

HardTests: Synthesizing High-Quality Test Cases for LLM Coding

liked a model 7 days ago

deepseek-ai/DeepSeek-R1-0528

View all activity

Organizations

smajumdar94's activity

upvoted 2 papers 3 days ago

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published 6 days ago • 84

HardTests: Synthesizing High-Quality Test Cases for LLM Coding

Paper • 2505.24098 • Published 7 days ago • 41

upvoted a paper 13 days ago

Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models

Paper • 2505.14810 • Published 16 days ago • 60

upvoted an article 14 days ago

Article

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

•

16 days ago

• 18

upvoted a paper 15 days ago

Reward Reasoning Model

Paper • 2505.14674 • Published 16 days ago • 34

upvoted an article 22 days ago

Article

Blazingly fast whisper transcriptions with Inference Endpoints

and 5 others •

24 days ago

• 67

upvoted 2 collections about 1 month ago

OpenMathReasoning

Collection

Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset" • 7 items • Updated about 4 hours ago • 40

OpenCodeReasoning

Collection

Reasoning data for supervised finetuning of LLMs to advance data distillation for competitive coding • 7 items • Updated about 4 hours ago • 16

upvoted an article about 1 month ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

•

Apr 25

• 267

upvoted a paper about 2 months ago

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Paper • 2504.10481 • Published Apr 14 • 84

upvoted an article about 2 months ago

Article

Introducing smolagents: simple agents that write actions in code.

and 2 others •

Dec 31, 2024

• 1.06k

upvoted 7 papers 2 months ago

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published Apr 3 • 54

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published Mar 31 • 62

upvoted an article 3 months ago

Article

NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets

and 4 others •

Mar 18

• 41

upvoted a paper 3 months ago

Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond

Paper • 2503.10460 • Published Mar 13 • 29