1 91 1210

KW

kevineen

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

retrieva-jp/amber-large

liked a model 6 days ago

tencent/HunyuanVideo-I2V

upvoted a collection 6 days ago

Japanese Novel Reward Model

View all activity

Organizations

kevineen's activity

upvoted a collection 6 days ago

Japanese Novel Reward Model

Collection

Japanese Novel Reward Model/日本語小説評価モデル • 5 items • Updated 9 days ago • 2

upvoted an article 9 days ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 225

upvoted a paper 19 days ago

FlexiViT: One Model for All Patch Sizes

Paper • 2212.08013 • Published Dec 15, 2022 • 1

upvoted a paper 20 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 21 days ago • 129

upvoted an article 20 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

21 days ago

• 205

upvoted an article 21 days ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

22 days ago

• 65

upvoted a paper 22 days ago

Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published 23 days ago • 56

upvoted a paper 25 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 28 days ago • 143

upvoted an article 25 days ago

Article

We now support VLMs in smolagents!

Jan 24

• 92

upvoted an article 26 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 151

upvoted an article 29 days ago

Article

Build awesome datasets for video generation

29 days ago

• 27

upvoted a paper about 1 month ago

CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers

Paper • 2502.06527 • Published about 1 month ago • 11

upvoted an article about 1 month ago

Article

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

•

Jan 20

• 63

upvoted a paper about 1 month ago

MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation

Paper • 2502.04299 • Published Feb 6 • 18

upvoted 2 collections about 1 month ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 8 items • Updated 17 days ago • 394

Qwen2-VL

Collection

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 208

upvoted a paper about 1 month ago

DextrAH-G: Pixels-to-Action Dexterous Arm-Hand Grasping with Geometric Fabrics

Paper • 2407.02274 • Published Jul 2, 2024 • 1

upvoted 3 articles about 1 month ago

Article

State of open video generation models in Diffusers

Jan 27

• 50

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 427

Article

FineWeb2-C: Help Build Better Language Models in Your Language

and 5 others •

Dec 23, 2024

• 18