3 3 28

Ayaan Sharif

Ayaan-Sharif

https://shariif.tech

AI & ML interests

LLM, multimodals, reinforcement learning

Recent Activity

liked a model about 2 months ago

ZuluVision/MoviiGen1.1

liked a model about 2 months ago

parthiv11/indic_whisper_hi_multi_gpu

liked a Space 2 months ago

3DAIGC/MotionShop2

View all activity

Organizations

liked 2 models about 2 months ago

ZuluVision/MoviiGen1.1

Text-to-Video • Updated 26 days ago • 1.38k • 92

parthiv11/indic_whisper_hi_multi_gpu

Automatic Speech Recognition • Updated Feb 28, 2024 • 44 • 5

liked a Space 2 months ago

155

MotionShop2

🏃

Replace characters in a video with characters in photos

liked 2 Spaces 3 months ago

Vevo for Zero-shot VC, TTS, and More

🐠

Controllable Zero-Shot Voice Imitation

809

Sesame CSM

🌱

Conversational speech generation

liked a model 3 months ago

sesame/csm-1b

Text-to-Speech • 2B • Updated May 27 • 30.5k • • 2.13k

updated a model 3 months ago

Ayaan-Sharif/qwen2-7b-instruct-trl-sft-ChartQA

Updated Apr 10

published a model 3 months ago

Ayaan-Sharif/qwen2-7b-instruct-trl-sft-ChartQA

Updated Apr 10

liked a model 5 months ago

ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q4

Reinforcement Learning • 8B • Updated Mar 26 • 3.24k • 217

New activity in sanchit-gandhi/whisper-jax 5 months ago

The whisper jax demo is not working. Error messages

👍 3

#18 opened about 2 years ago by

ray608

liked a model 6 months ago

MiniMaxAI/MiniMax-VL-01

Image-Text-to-Text • 456B • Updated 11 days ago • 25k • 275

liked a dataset 6 months ago

DAMO-NLP-SG/multimodal_textbook

Updated Mar 17 • 2.06k • 145

liked 2 models 6 months ago

dphn/dolphin-2.9-llama3-8b

Text Generation • 8B • Updated May 20, 2024 • 1.94k • 449

dphn/Dolphin3.0-Llama3.2-1B

1B • Updated Apr 25 • 1.81k • 30

replied to sanchit-gandhi's post 6 months ago

what if we segment the audio first and then transcribe tho its some extra compute to throw in but imo it would resul tin better result !

liked 3 Spaces 6 months ago

Turn any ebook into audiobook, 1107+ languages supported!

liked a model 7 months ago

huggyllama/llama-7b

Text Generation • 7B • Updated Jul 2, 2024 • 65.4k • 336

commented a paper 7 months ago

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Paper • 2412.10302 • Published Dec 13, 2024 • 18 •

Ayaan Sharif

AI & ML interests

Recent Activity

Organizations

Ayaan-Sharif's activity

MotionShop2

Vevo for Zero-shot VC, TTS, and More

Sesame CSM

The whisper jax demo is not working. Error messages

ClipVideo

Whisper JAX

Ebook2audiobook v25.7.12