1 14 1

Devin Thang

winvswon78

devininthelab

AI & ML interests

None yet

Recent Activity

upvoted a paper 22 days ago

Aligning Latent Spaces with Flow Priors

upvoted an article 24 days ago

KV Cache from scratch in nanoVLM

updated a dataset 29 days ago

lmms-lab/TOMATO

View all activity

Organizations

upvoted a paper 22 days ago

Aligning Latent Spaces with Flow Priors

Paper • 2506.05240 • Published 23 days ago • 25

upvoted an article 24 days ago

Article

KV Cache from scratch in nanoVLM

and 4 others •

25 days ago

• 79

upvoted an article about 1 month ago

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

May 12

• 465

upvoted a paper about 1 month ago

Emerging Properties in Unified Multimodal Pretraining

Paper • 2505.14683 • Published May 20 • 130

upvoted an article about 1 month ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

and 6 others •

May 21

• 174

upvoted a collection about 2 months ago

Aero-1-Audio

Collection

2 items • Updated May 1 • 1

upvoted a collection 3 months ago

Qwen2.5-Omni

Collection

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated May 21 • 145

upvoted a paper 4 months ago

Matryoshka Quantization

Paper • 2502.06786 • Published Feb 10 • 30

upvoted a collection 4 months ago

Ola

Collection

Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment • 4 items • Updated Feb 21 • 3

upvoted a paper 4 months ago

Lost in the Middle: How Language Models Use Long Contexts

Paper • 2307.03172 • Published Jul 6, 2023 • 40

upvoted 2 articles 5 months ago

Article

Mixture of Experts Explained

and 5 others •

Dec 11, 2023

• 701

Article

How NuminaMath Won the 1st AIMO Progress Prize

and 7 others •

Jul 11, 2024

• 121

upvoted a paper 5 months ago

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published Jan 23 • 26

upvoted a paper 6 months ago

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

Paper • 2411.14982 • Published Nov 22, 2024 • 18

Devin Thang

AI & ML interests

Recent Activity

Organizations

winvswon78's activity

KV Cache from scratch in nanoVLM

Vision Language Models (Better, Faster, Stronger)

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Mixture of Experts Explained

How NuminaMath Won the 1st AIMO Progress Prize