HKUST NLP Group

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

ksshumab submitted a paper about 2 months ago

Qwen3-Coder-Next Technical Report

AndrewZeng authored a paper 2 months ago

LOCA-bench: Benchmarking Language Agents Under Controllable and Extreme Context Growth

yuzhen17 authored a paper 2 months ago

LOCA-bench: Benchmarking Language Agents Under Controllable and Extreme Context Growth

View all activity

Papers

AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios

LOCA-bench: Benchmarking Language Agents Under Controllable and Extreme Context Growth

View all Papers

Collections 12

View 12 collections

models 66

datasets 32

hkust-nlp/drkernel-validation-data

Viewer • Updated Feb 6 • 100 • 70 • 1

hkust-nlp/drkernel-rl-data

Viewer • Updated Feb 6 • 72k • 118

hkust-nlp/drkernel-coldstart-8k

Viewer • Updated Feb 6 • 8.92k • 274 • 2

hkust-nlp/Toolathlon-Trajectories

Preview • Updated Dec 5, 2025 • 3.23k • 19

hkust-nlp/WebExplorer-QA

Viewer • Updated Nov 22, 2025 • 100 • 36 • 7

hkust-nlp/CodeIO-PyEdu-Reasoning-Raw

Updated Jun 18, 2025 • 30 • 2

hkust-nlp/CodeIO-PyEdu-Reasoning

Preview • Updated Jun 18, 2025 • 146 • 57

hkust-nlp/rl-verifier-pitfalls_hacking_data

Viewer • Updated May 28, 2025 • 6.12k • 13 • 1

hkust-nlp/deepscaler_simplelr

Viewer • Updated May 28, 2025 • 40.3k • 21

hkust-nlp/Laser-Deepscaler-Dataset

Viewer • Updated May 21, 2025 • 40.8k • 36

View 32 datasets

HKUST NLP Group

AI & ML interests

Recent Activity

Papers

Collections 12

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

hkust-nlp/drkernel-14b

hkust-nlp/drkernel-8b

hkust-nlp/drkernel-14b-coldstart

hkust-nlp/Toolathlon-Trajectories

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

hkust-nlp/drkernel-14b

hkust-nlp/drkernel-8b

hkust-nlp/drkernel-14b-coldstart

hkust-nlp/Toolathlon-Trajectories

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

models 66

hkust-nlp/drkernel-8b-coldstart

hkust-nlp/drkernel-14b-coldstart

hkust-nlp/drkernel-14b

hkust-nlp/drkernel-8b

hkust-nlp/WebExplorer-8B

hkust-nlp/Qwen-2.5-7B-Verifier-general-verifier

hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B

hkust-nlp/Qwen-2.5-7B-Verifier-HF

hkust-nlp/R1-Distill-Verifier-1.5B

hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B

datasets 32

hkust-nlp/drkernel-validation-data

hkust-nlp/drkernel-rl-data

hkust-nlp/drkernel-coldstart-8k

hkust-nlp/Toolathlon-Trajectories

hkust-nlp/WebExplorer-QA

hkust-nlp/CodeIO-PyEdu-Reasoning-Raw

hkust-nlp/CodeIO-PyEdu-Reasoning

hkust-nlp/rl-verifier-pitfalls_hacking_data

hkust-nlp/deepscaler_simplelr

hkust-nlp/Laser-Deepscaler-Dataset

AI & ML interests

Recent Activity

Papers

Team members 15

Collections 12

models 66 Sort: Recently updated

datasets 32 Sort: Recently updated

models 66

datasets 32