Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zk67 's Collections
LLM Evaluation
Foundation Models and AGI
Model Architecture
Instruction Tuning
Agent AI
Training
LLM Data
inference optimization
Ilya Papers
LLM Reasoning Papers
LLM Tech Report
LLM Post Training
LLM Pre-Train

LLM Evaluation

updated Jan 20
Upvote
-

  • Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

    Paper • 2306.05685 • Published Jun 9, 2023 • 36

    Note MT-Bench and Arena MT-Bench-101 https://arxiv.org/abs/2402.14762

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs