Vaibhav Srivastav's picture

Vaibhav Srivastav PRO

reach-vb

·

https://vaibhavs10.github.io

AI & ML interests

TTS + LM performance prediction

Recent Activity

new activity 2 days ago

reach-vb/random-files:Upload very_long_context_prompt.txt

upvoted a collection 4 days ago

liked a model 4 days ago

rasbt/llama-3.2-from-scratch

View all activity

Organizations

reach-vb's activity

upvoted a collection 4 days ago

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated 1 day ago • 58

upvoted an article 4 days ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

9 days ago

• 141

upvoted a paper 5 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 6 days ago • 155

upvoted a collection 8 days ago

Llama 4

Llama 4 release • 10 items • Updated 8 days ago • 421

upvoted an article 9 days ago

Article

The NLP Course is becoming the LLM Course!

11 days ago

• 67

upvoted an article 13 days ago

Article

Xet is on the Hub

27 days ago

• 46

upvoted a paper 17 days ago

Gemma 3 Technical Report

Paper • 2503.19786 • Published 19 days ago • 45

upvoted a collection 17 days ago

FAIR's LayerSkip Llama models

7 items • Updated Dec 13, 2024 • 2

upvoted a collection 19 days ago

State of open code models (March 2025)

The best open code models on Hugging Face as of March 2025 • 7 items • Updated 19 days ago • 2

upvoted a collection 23 days ago

MoshiVis v0.1

MoshiVis is a Vision Speech Model built as a perceptually-augmented version of Moshi v0.1 for conversing about image inputs • 8 items • Updated 23 days ago • 22

upvoted a paper 24 days ago

Training and Inference Efficiency of Encoder-Decoder Speech Models

Paper • 2503.05931 • Published Mar 7 • 3

upvoted a collection 25 days ago

Orpheus TTS

TTS Towards Human-Sounding Speech • 2 items • Updated 26 days ago • 57

upvoted a collection 26 days ago

Cosmos Transfer1

Multimodal Conditional World Generation for World2World Transfer • 5 items • Updated 3 days ago • 14

upvoted an article about 1 month ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12

• 384

upvoted 2 collections about 1 month ago

Gemma 3 Release

17 items • Updated 10 days ago • 327

Gemma 3

4 items • Updated Mar 12 • 15

upvoted 3 articles about 1 month ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18, 2024

• 286

Article

TTS Arena: Benchmarking Text-to-Speech Models in the Wild

Feb 27, 2024

• 64

Article

LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone!

Mar 7

• 51