2 2

Rebecca Qian

RebeccaQian1

AI & ML interests

None yet

Recent Activity

liked a Space 27 days ago

PatronusAI/TRAIL

new activity 11 months ago

QuantFactory/Llama-3-Patronus-Lynx-8B-Instruct-GGUF:Update README.md

updated a model 11 months ago

PatronusAI/Llama-3-Patronus-Lynx-70B-Instruct-Q4_K_M-GGUF

View all activity

Organizations

RebeccaQian1's activity

liked a Space 27 days ago

TRAIL Leaderboard

🥇

Trace Reasoning and Agentic Issue Localization Leaderboard

New activity in QuantFactory/Llama-3-Patronus-Lynx-8B-Instruct-GGUF 11 months ago

Update README.md

#1 opened 11 months ago by

RebeccaQian1

updated 2 models 11 months ago

PatronusAI/Llama-3-Patronus-Lynx-70B-Instruct-Q4_K_M-GGUF

Updated Jul 17, 2024 • 20

PatronusAI/Llama-3-Patronus-Lynx-8B-Instruct-Q4_K_M-GGUF

Updated Jul 17, 2024 • 128 • 24

reacted to clefourrier's post with ❤️ over 1 year ago

Post

🔥 New LLM leaderboard on the hub: an Enterprise Scenarios Leaderboard!

This work evaluates LLMs on several real world use cases (Finance documents, Legal confidentiality, Customer support, ...), which makes it grounded, and interesting for companies! 🏢
Bonus: the test set is private, so it's hard to game 🔥
PatronusAI/enterprise_scenarios_leaderboard

Side note: I discovered through this benchmark that you could evaluate "Engagingness" of an LLM, which could also be interesting for our LLM fine-tuning community out there.

Read more about their different tasks and metrics in the intro blog: https://huggingface.co/blog/leaderboards-on-the-hub-patronus

Congrats to @sunitha98 who led the leaderboard implementation, and to @rebeccaqian and @anandnk24 , all at Patronus AI !

2 replies

liked a dataset over 1 year ago

PatronusAI/financebench

Viewer • Updated Nov 17, 2024 • 150 • 1.17k • 101