6 5 5

Ghosh

Sreyan88

AI & ML interests

None yet

Recent Activity

commented on a paper 3 days ago

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities

liked a Space 3 days ago

nvidia/audio-flamingo-2

upvoted a collection 2 months ago

Cosmos

View all activity

Organizations

None yet

Sreyan88's activity

commented a paper 3 days ago

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities

Paper • 2503.03983 • Published 4 days ago • 18 •

liked a Space 3 days ago

Audio Flamingo 2

🏃

Audio Flamingo 2 Demo

upvoted a collection 2 months ago

Cosmos

Collection

The collection of Cosmos models • 31 items • Updated Jan 17 • 268

authored 4 papers 4 months ago

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

Paper • 2410.19168 • Published Oct 24, 2024 • 19

Do Audio-Language Models Understand Linguistic Variations?

Paper • 2410.16505 • Published Oct 21, 2024 • 1

Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation

Paper • 2410.13198 • Published Oct 17, 2024 • 10

ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds

Paper • 2409.09213 • Published Sep 13, 2024 • 13

commented a paper 4 months ago

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

Paper • 2410.19168 • Published Oct 24, 2024 • 19 •

liked a Space 4 months ago

Synthio Stable Audio Open

📚

Stable audio open model from Synthio paper.

liked a model 4 months ago

sonalkum/synthio-stable-audio-open

Updated Oct 19, 2024 • 2

commented a paper 5 months ago

Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation

Paper • 2410.13198 • Published Oct 17, 2024 • 10 •

upvoted a paper 5 months ago

Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data

Paper • 2410.02056 • Published Oct 2, 2024 • 6

commented a paper 5 months ago

Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data

Paper • 2410.02056 • Published Oct 2, 2024 • 6 •

commented a paper 6 months ago

ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds

Paper • 2409.09213 • Published Sep 13, 2024 • 13 •

liked 2 Spaces 8 months ago

GAMA-IT

🏆

Analyze audio and answer questions about it

GAMA

🌍

Answer questions about audio

upvoted a paper 8 months ago

GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities

Paper • 2406.11768 • Published Jun 17, 2024 • 20

authored 3 papers 9 months ago

data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup

Paper • 2211.01246 • Published Nov 2, 2022

CoDa: Constrained Generation based Data Augmentation for Low-Resource NLP

Paper • 2404.00415 • Published Mar 30, 2024

GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities

Paper • 2406.11768 • Published Jun 17, 2024 • 20