FrenchBench Evaluation datasets Collection These datasets are used to evaluate models on French performance using: https://github.com/EleutherAI/lm-evaluation-harness (from CroissantLLM paper) • 11 items • Updated Jun 7, 2024 • 7
Running on CPU Upgrade 12.6k 12.6k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots