This is excellent @ngxson .. no errors and very crisp.
Pratik Bhavsar PRO
pratikbhavsar
AI & ML interests
LLM agents, evaluation & reasoning
Recent Activity
commented on
their
article
4 days ago
Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios
commented on
their
article
4 days ago
Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios
updated
a Space
7 days ago
galileo-ai/agent-leaderboard
Organizations
pratikbhavsar's activity

commented on
Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios
4 days ago

commented on
Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios
4 days ago
Thank you Erin! We will continue to update this further with more LLMs :)

published
an
article
9 days ago
Article
Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios
By
and 1 other
•
•
12
upvoted
an
article
22 days ago
Article
Open-R1: a fully open reproduction of DeepSeek-R1
•
768
bespokelabs/Bespoke-Stratos-17k
Viewer
•
Updated
•
16.7k
•
92.5k
•
277
open-thoughts/OpenThoughts-114k
Viewer
•
Updated
•
228k
•
97.4k
•
571
PrimeIntellect/NuminaMath-QwQ-CoT-5M
Viewer
•
Updated
•
5.14M
•
3.73k
•
48
ServiceNow-AI/R1-Distill-SFT
Viewer
•
Updated
•
1.85M
•
6.91k
•
253
cognitivecomputations/dolphin-r1
Viewer
•
Updated
•
814k
•
5.35k
•
261

upvoted
a
collection
22 days ago
Does this have tooling support?
4
#7 opened about 1 month ago
by
xceptor


upvoted
a
collection
6 months ago