3 5

Amit Agarwal

amitbcp

AI & ML interests

Computer Vision

Recent Activity

authored a paper 2 days ago

BenchHub: A Unified Benchmark Suite for Holistic and Customizable LLM Evaluation

upvoted a paper 3 days ago

BenchHub: A Unified Benchmark Suite for Holistic and Customizable LLM Evaluation

authored a paper 7 days ago

Survey of Large Multimodal Model Datasets, Application Categories and Taxonomy

View all activity

Organizations

amitbcp's activity

authored a paper 2 days ago

BenchHub: A Unified Benchmark Suite for Holistic and Customizable LLM Evaluation

Paper • 2506.00482 • Published 9 days ago • 8

upvoted a paper 3 days ago

BenchHub: A Unified Benchmark Suite for Holistic and Customizable LLM Evaluation

Paper • 2506.00482 • Published 9 days ago • 8

authored 3 papers 7 days ago

Survey of Large Multimodal Model Datasets, Application Categories and Taxonomy

Paper • 2412.17759 • Published Dec 23, 2024

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published Mar 10 • 98

SweEval: Do LLMs Really Swear? A Safety Benchmark for Testing Limits for Enterprise Use

Paper • 2505.17332 • Published 17 days ago • 31

upvoted a paper 10 days ago

FS-DAG: Few Shot Domain Adapting Graph Networks for Visually Rich Document Understanding

Paper • 2505.17330 • Published 17 days ago • 22

commented a paper 10 days ago

FS-DAG: Few Shot Domain Adapting Graph Networks for Visually Rich Document Understanding

Paper • 2505.17330 • Published 17 days ago • 22 •

upvoted a paper 10 days ago

Hard Negative Mining for Domain-Specific Retrieval in Enterprise Systems

Paper • 2505.18366 • Published 16 days ago • 25

commented a paper 10 days ago

Hard Negative Mining for Domain-Specific Retrieval in Enterprise Systems

Paper • 2505.18366 • Published 16 days ago • 25 •

upvoted a paper 13 days ago

SweEval: Do LLMs Really Swear? A Safety Benchmark for Testing Limits for Enterprise Use

Paper • 2505.17332 • Published 17 days ago • 31

upvoted a paper 3 months ago

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published Mar 10 • 98

authored a paper 5 months ago

MVTamperBench: Evaluating Robustness of Vision-Language Models

Paper • 2412.19794 • Published Dec 27, 2024 • 2

updated a dataset 10 months ago

amitbcp/muir_tsv

Viewer • Updated Aug 20, 2024 • 2.6k • 11

updated a dataset about 1 year ago

amitbcp/nomir

Updated May 15, 2024 • 101