Kelly Chiu's picture

1 2 4

Kelly Chiu PRO

kellycyy

·

https://kellycyy.github.io/

kellychiuyy

AI & ML interests

None yet

Recent Activity

updated a collection about 4 hours ago

updated a collection about 4 hours ago

updated a collection about 4 hours ago

View all activity

Organizations

kellycyy's activity

updated 3 collections about 4 hours ago

DailyDilemmas

2 items • Updated about 4 hours ago

AIRiskDilemmas

2 items • Updated about 4 hours ago

CulturalBench

A Robust, Diverse and Challegning Benchmark for Measuring Cultural Knowledge of LLMs • 6 items • Updated about 4 hours ago

updated a dataset 16 days ago

kellycyy/AIRiskDilemmas

Viewer • Updated 16 days ago • 42.6k • 188

upvoted a paper 16 days ago

Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas

Paper • 2505.14633 • Published 17 days ago • 3

commented a paper 16 days ago

Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas

Paper • 2505.14633 • Published 17 days ago • 3 •

published a dataset 23 days ago

kellycyy/AIRiskDilemmas

Viewer • Updated 16 days ago • 42.6k • 188

updated a dataset 8 months ago

kellycyy/daily_dilemmas

Viewer • Updated Oct 15, 2024 • 17.7k • 141 • 3

updated a Space 8 months ago

CulturalBench

Display leaderboard for model evaluation

updated a dataset 8 months ago

kellycyy/CulturalBench

Viewer • Updated Oct 14, 2024 • 6.14k • 700 • 4

authored a paper 8 months ago

CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs

Paper • 2410.02677 • Published Oct 3, 2024

updated a collection 8 months ago

CulturalBench

A Robust, Diverse and Challegning Benchmark for Measuring Cultural Knowledge of LLMs • 6 items • Updated about 4 hours ago