Aritra Roy Gosthipaty's picture

Aritra Roy Gosthipaty PRO

ariG23498

AI & ML interests

Deep Representation Learning

Recent Activity

Organizations

Hugging Face's profile picture Google's profile picture Notebooks-explorers's profile picture 🧨Diffusers's profile picture PyTorch Image Models's profile picture Keras's profile picture Cohere Labs's profile picture Hugging Test Lab's profile picture Hugging Face Fellows's profile picture Probing ViTs's profile picture TrystAI's profile picture PyImageSearch's profile picture Keras Dreambooth Event's profile picture Hugging Face OSS Metrics's profile picture Blog-explorers's profile picture ZeroGPU Explorers's profile picture kotol's profile picture gg-hf's profile picture MLX Community's profile picture IBM Granite's profile picture Open Generative Fill's profile picture Social Post Explorers's profile picture Hugging Face Discord Community's profile picture nltpt's profile picture nltpt-q's profile picture qrias's profile picture Hugging Face Science's profile picture open/ acc's profile picture wut?'s profile picture LLM from Scratch's profile picture s0225's profile picture gg-hf-g's profile picture llrehf's profile picture University of Science and Technology of China's profile picture Model Metadata's profile picture all things vision LMs's profile picture

ariG23498's activity

commented on KV Cache from scratch in nanoVLM 1 day ago
posted an update 1 day ago
view post
Post
713
🚨 Implement KV Cache from scratch in pure PyTorch. 🚨

We have documented all of our learning while implementing KV Cache to nanoVLM. Joint work with @kashif @lusxvr @andito @pcuenq

Blog: hf.co/blog/kv-cache
  • 1 reply
·
upvoted an article 1 day ago
reacted to danielhanchen's post with 🔥 2 days ago
upvoted an article 2 days ago
view article
Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

By danaaubakirova and 8 others
87
published an article 2 days ago
published an article 3 days ago
view article
Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

By danaaubakirova and 8 others
87