Jay Gil
JayGilDS
AI & ML interests
NLP, LLMs, IA in healthcare
Organizations
None yet
Benchmarks
-
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark
Paper • 2409.02813 • Published • 31 -
MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
Paper • 2404.16006 • Published -
Running4.64k4.64k
LMArena Leaderboard
🏆Display LMArena Leaderboard
Models
datasets
Benchmarks
-
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark
Paper • 2409.02813 • Published • 31 -
MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
Paper • 2404.16006 • Published -
Running4.64k4.64k
LMArena Leaderboard
🏆Display LMArena Leaderboard