Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Andrew Jardine's picture

3 1

Andrew Jardine

2legit2overfit

tiandilou's profile picture

Daduerding's profile picture

thomwolf's profile picture

·

AI & ML interests

None yet

Organizations

2legit2overfit 's collections 4

miqudev/miqu-1-70b

69B • Updated Feb 4, 2024 • 476 • 986
Qwen/Qwen1.5-72B-Chat

Text Generation • 72B • Updated Oct 8, 2024 • 11.6k • 217
segolilylabs/Lily-Cybersecurity-7B-v0.2

Text Generation • 7B • Updated Jan 22, 2024 • 1.14k • 108
nomic-ai/nomic-embed-text-v1

Sentence Similarity • 0.1B • Updated Mar 31 • 1.12M • 531

My leaderboards

Running

222

222

AI2 WildBench Leaderboard (V2)

🦁

Display and explore model leaderboards and chat history
Running

4.5k

4.5k

Chatbot Arena Leaderboard

🏆

Show chatbot performance leaderboard
Running on CPU Upgrade

13.3k

13.3k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots
Running on CPU Upgrade

5.97k

5.97k

MTEB Leaderboard

🥇

Embedding Leaderboard

JudgeLM: Fine-tuned Large Language Models are Scalable Judges

Paper • 2310.17631 • Published Oct 26, 2023 • 35
Prometheus: Inducing Fine-grained Evaluation Capability in Language Models

Paper • 2310.08491 • Published Oct 12, 2023 • 55
Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 110
BitDelta: Your Fine-Tune May Only Be Worth One Bit

Paper • 2402.10193 • Published Feb 15, 2024 • 23

My Fav datasets

berkeley-nest/Nectar

Viewer • Updated Mar 20, 2024 • 183k • 641 • 291
HuggingFaceTB/cosmopedia

Viewer • Updated Aug 12, 2024 • 31.1M • 5.39k • 624
Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 11.2k • 1.37k
openbmb/UltraFeedback

Viewer • Updated Dec 29, 2023 • 64k • 2.28k • 371

miqudev/miqu-1-70b

69B • Updated Feb 4, 2024 • 476 • 986
Qwen/Qwen1.5-72B-Chat

Text Generation • 72B • Updated Oct 8, 2024 • 11.6k • 217
segolilylabs/Lily-Cybersecurity-7B-v0.2

Text Generation • 7B • Updated Jan 22, 2024 • 1.14k • 108
nomic-ai/nomic-embed-text-v1

Sentence Similarity • 0.1B • Updated Mar 31 • 1.12M • 531

JudgeLM: Fine-tuned Large Language Models are Scalable Judges

Paper • 2310.17631 • Published Oct 26, 2023 • 35
Prometheus: Inducing Fine-grained Evaluation Capability in Language Models

Paper • 2310.08491 • Published Oct 12, 2023 • 55
Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 110
BitDelta: Your Fine-Tune May Only Be Worth One Bit

Paper • 2402.10193 • Published Feb 15, 2024 • 23

My leaderboards

Running

222

222

AI2 WildBench Leaderboard (V2)

🦁

Display and explore model leaderboards and chat history
Running

4.5k

4.5k

Chatbot Arena Leaderboard

🏆

Show chatbot performance leaderboard
Running on CPU Upgrade

13.3k

13.3k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots
Running on CPU Upgrade

5.97k

5.97k

MTEB Leaderboard

🥇

Embedding Leaderboard

My Fav datasets

berkeley-nest/Nectar

Viewer • Updated Mar 20, 2024 • 183k • 641 • 291
HuggingFaceTB/cosmopedia

Viewer • Updated Aug 12, 2024 • 31.1M • 5.39k • 624
Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 11.2k • 1.37k
openbmb/UltraFeedback

Viewer • Updated Dec 29, 2023 • 64k • 2.28k • 371

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs