-
VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks
Paper • 2504.05118 • Published • 26 -
T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models
Paper • 2504.04718 • Published • 41 -
SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement
Paper • 2504.03561 • Published • 18 -
Concept Lancet: Image Editing with Compositional Representation Transplant
Paper • 2504.02828 • Published • 17
Yeshua W Brian
HariharaIII
AI & ML interests
Seeking to put my unique knowledge into its absolution to bring Automated Intelligence the depth & light of a wisdom of its own. Harmonizing all life & consciousness towards the process of continuum.
Organizations
None yet