wongyukim's picture

wongyukim

wongyukim

·

kimwongyuda

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

upvoted a paper 1 day ago

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

upvoted a paper 3 days ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

View all activity

Organizations

None yet

upvoted 2 papers 1 day ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published 4 days ago • 25

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

Paper • 2508.14460 • Published 4 days ago • 72

upvoted a paper 3 days ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published 17 days ago • 102

upvoted 3 papers 4 days ago

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

Paper • 2508.09834 • Published 10 days ago • 46

Has GPT-5 Achieved Spatial Intelligence? An Empirical Study

Paper • 2508.13142 • Published 5 days ago • 31

Ovis2.5 Technical Report

Paper • 2508.11737 • Published 8 days ago • 99

upvoted 4 papers 5 days ago

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published 9 days ago • 53

Thyme: Think Beyond Images

Paper • 2508.11630 • Published 8 days ago • 75

SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published 9 days ago • 87

DINOv3

Paper • 2508.10104 • Published 10 days ago • 179

upvoted a paper 8 days ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published 9 days ago • 135

upvoted 5 papers 9 days ago

Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning

Paper • 2508.09726 • Published 11 days ago • 11

Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models

Paper • 2508.05613 • Published 16 days ago • 17

AWorld: Dynamic Multi-Agent System with Stable Maneuvering for Robust GAIA Problem Solving

Paper • 2508.09889 • Published 10 days ago • 32

Mol-R1: Towards Explicit Long-CoT Reasoning in Molecule Discovery

Paper • 2508.08401 • Published 12 days ago • 38

Story2Board: A Training-Free Approach for Expressive Storyboard Generation

Paper • 2508.09983 • Published 10 days ago • 63

upvoted 4 papers 10 days ago

Adversarial Video Promotion Against Text-to-Video Retrieval

Paper • 2508.06964 • Published 14 days ago • 9

Train Long, Think Short: Curriculum Learning for Efficient Reasoning

Paper • 2508.08940 • Published 11 days ago • 22

Complex Logical Instruction Generation

Paper • 2508.09125 • Published 11 days ago • 38

WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent

Paper • 2508.05748 • Published 16 days ago • 117