VerlTool

community

https://github.com/TIGER-AI-Lab/verl-tool

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

DongfuJiang updated a model 29 days ago

VerlTool/acecoder-fsdp_agent-qwen_qwen2.5-coder-1.5b-instruct-grpo-69k-sys12-mtrl-d1fo-535-step

DongfuJiang published a model 29 days ago

VerlTool/acecoder-fsdp_agent-qwen_qwen2.5-coder-1.5b-instruct-grpo-69k-sys12-mtrl-d1fo-535-step

DongfuJiang updated a model 29 days ago

VerlTool/acecoder-fsdp_agent-qwen_qwen2.5-coder-1.5b-instruct-grpo-69k-sys12-mtrl-d1fo-280-step

View all activity

DongfuJiang

updated a model 29 days ago

VerlTool/acecoder-fsdp_agent-qwen_qwen2.5-coder-1.5b-instruct-grpo-69k-sys12-mtrl-d1fo-535-step

2B • Updated 29 days ago • 9

DongfuJiang

published a model 29 days ago

VerlTool/acecoder-fsdp_agent-qwen_qwen2.5-coder-1.5b-instruct-grpo-69k-sys12-mtrl-d1fo-535-step

2B • Updated 29 days ago • 9

DongfuJiang

updated a model 29 days ago

VerlTool/acecoder-fsdp_agent-qwen_qwen2.5-coder-1.5b-instruct-grpo-69k-sys12-mtrl-d1fo-280-step

2B • Updated 29 days ago • 12

ZhuofengLi

updated a model about 1 month ago

VerlTool/torl-deep_math-fsdp_agent-qwen2.5-math-7b-grpo-n16-b128-t1.0-lr1e-6-310-step

8B • Updated about 1 month ago • 36

ZhuofengLi

published a model about 1 month ago

VerlTool/torl-deep_math-fsdp_agent-qwen2.5-math-7b-grpo-n16-b128-t1.0-lr1e-6-310-step

8B • Updated about 1 month ago • 36

ZhuofengLi

updated a model about 1 month ago

VerlTool/torl-deep_math-fsdp_agent-qwen2.5-math-1.5b-grpo-n16-b128-t1.0-lr1e-6-320-step

2B • Updated May 27 • 32

ZhuofengLi

published a model about 1 month ago

VerlTool/torl-deep_math-fsdp_agent-qwen2.5-math-1.5b-grpo-n16-b128-t1.0-lr1e-6-320-step

2B • Updated May 27 • 32

DongfuJiang

authored a paper about 1 month ago

StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs

Paper • 2505.20139 • Published May 26 • 18

ZhuofengLi

authored a paper about 1 month ago

StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs

Paper • 2505.20139 • Published May 26 • 18

DongfuJiang

authored a paper about 1 month ago

QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design

Paper • 2505.16175 • Published May 22 • 40

ZhuofengLi

updated a model about 1 month ago

VerlTool/torl-deep_math-fsdp-qwen2.5-math-1.5b-grpo-n16-b128-t1.0-lr1e-6-830-step

2B • Updated May 25 • 11

ZhuofengLi

published a model about 1 month ago

VerlTool/torl-deep_math-fsdp-qwen2.5-math-1.5b-grpo-n16-b128-t1.0-lr1e-6-830-step

2B • Updated May 25 • 11

DongfuJiang

authored a paper about 1 month ago

General-Reasoner: Advancing LLM Reasoning Across All Domains

Paper • 2505.14652 • Published May 20 • 22

ZhuofengLi

authored 2 papers about 1 month ago

TEG-DB: A Comprehensive Dataset and Benchmark of Textual-Edge Graphs

Paper • 2406.10310 • Published Jun 14, 2024 • 1

VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation

Paper • 2505.14640 • Published May 20 • 14

DongfuJiang

published a model about 1 month ago

VerlTool/acecoder-fsdp_agent-qwen_qwen2.5-coder-1.5b-instruct-grpo-69k-sys12-mtrl-d1fo-280-step

2B • Updated 29 days ago • 12

DongfuJiang

updated 2 datasets about 1 month ago

VerlTool/AceCoderV2-69K-cleaned

Viewer • Updated May 18 • 69k • 50

VerlTool/AceCoderV2-122K-cleaned

Viewer • Updated May 18 • 123k • 42

DongfuJiang

published 2 datasets about 1 month ago

VerlTool/AceCoderV2-122K-cleaned

Viewer • Updated May 18 • 123k • 42

VerlTool/AceCoderV2-69K-cleaned

Viewer • Updated May 18 • 69k • 50