Josh Harris
jah242
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
4 days ago
Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health
Information
upvoted
a
paper
11 months ago
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls
and Complex Instructions
upvoted
a
paper
11 months ago
Are We Done with MMLU?
Organizations
models
0
None public yet
datasets
0
None public yet