Bhavish Pahwa
bpHigh
AI & ML interests
Hate Speech, Domain Identification, NER, Summarisation, Performance metrics for various NLP tasks
Recent Activity
authored
a paper
about 2 months ago
MMTEB: Massive Multilingual Text Embedding Benchmark
authored
a paper
9 months ago
Data Contamination Report from the 2024 CONDA Shared Task
updated
a dataset
11 months ago
bpHigh/BIRCO_Doris_Mae_Without_Task_Awareness
Organizations
bpHigh's activity
GPT-3.5 Spider contamination based on https://arxiv.org/pdf/2402.08100
3
#18 opened about 1 year ago
by
bpHigh

Should indirect data leakages be included in the Data Contamination Database?
2
#19 opened about 1 year ago
by
bpHigh
