Introducing the Open Chain of Thought Leaderboard
•
34
Building breatkthrough AI to solve the world's biggest problems.
Display and filter reward model evaluation data
Render a leaderboard for model evaluation
Embed and use ZeroEval for evaluation tasks
Chat with advanced language models