Clémentine Fourrier

clefourrier

AI & ML interests

None yet

Articles

Organizations

clefourrier's activity

New activity in demo-leaderboard-backend/leaderboard 8 days ago
New activity in gaia-benchmark/leaderboard 8 days ago
New activity in open-llm-leaderboard/open_llm_leaderboard about 1 month ago

failed open llm leaderboard benchmark

2
#890 opened about 1 month ago by legolasyiu
New activity in open-llm-leaderboard/requests about 1 month ago

Failed LLM bechmark test

2
#55 opened about 1 month ago by legolasyiu
New activity in gaia-benchmark/GAIA about 1 month ago

How can I check my test result?

1
#13 opened about 1 month ago by cheung
New activity in demo-leaderboard/gpt2-demo about 1 month ago

Create README.md

#1 opened about 1 month ago by woodaries888
New activity in open-llm-leaderboard/open_llm_leaderboard about 1 month ago

Failed LLM benchmark request

3
#882 opened about 1 month ago by legolasyiu
New activity in open-llm-leaderboard/requests about 1 month ago

EpistemeAI

2
#50 opened about 1 month ago by legolasyiu
New activity in demo-leaderboard-backend/leaderboard about 1 month ago

🚩 Report: Not working

3
#8 opened about 1 month ago by not-lain
New activity in gaia-benchmark/leaderboard about 2 months ago

F GAIA, runtime error

#19 opened 2 months ago by lauralex
New activity in gaia-benchmark/results_public about 2 months ago

Upload dataset

#4 opened about 2 months ago by clefourrier

Upload dataset

#3 opened about 2 months ago by clefourrier

Delete '2023' config

#2 opened about 2 months ago by clefourrier
New activity in open-llm-leaderboard/open_llm_leaderboard about 2 months ago

Model deleted from Pending

20
#850 opened 2 months ago by dnhkng

Failed evaluation for model

7
#865 opened about 2 months ago by Pretergeek

Gemma-2-9B-it scores

8
#843 opened 2 months ago by saishf

Add fewshot_as_multiturn column

2
#868 opened about 2 months ago by djstrong

Leaderboard not updating?

5
#863 opened about 2 months ago by Pretergeek

Llama-3.1-70B fine-tuned failed

1
#864 opened about 2 months ago by MaziyarPanahi

What are "raw" metrics?

3
#856 opened 2 months ago by aginart-salesforce

Jamba model FAILED

6
#854 opened 2 months ago by devingulliver

voting-system-update

19
#844 opened 2 months ago by alozowski