-
Agent Laboratory: Using LLM Agents as Research Assistants
Paper • 2501.04227 • Published • 89 -
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper • 2501.05366 • Published • 98 -
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
Paper • 2501.11425 • Published • 99 -
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Paper • 2501.10893 • Published • 25
Shyam Sunder Kumar
theainerd
AI & ML interests
Natural Language Processing
Recent Activity
liked
a model
about 16 hours ago
nvidia/canary-1b-flash
liked
a model
about 18 hours ago
deepseek-ai/DeepSeek-V3-0324
Organizations
Collections
4
-
Training Large Language Models to Reason in a Continuous Latent Space
Paper • 2412.06769 • Published • 79 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 59 -
Evolving Deeper LLM Thinking
Paper • 2501.09891 • Published • 111 -
Kimi k1.5: Scaling Reinforcement Learning with LLMs
Paper • 2501.12599 • Published • 110
models
2
datasets
None public yet