PZ's picture

PZ PRO

philipp-zettl

·

philsupertramp

AI & ML interests

NLP/CV/Multimodal learning

Recent Activity

updated a dataset 16 days ago

philipp-zettl/mtg_cards-2025-04-04

liked a model 16 days ago

black-forest-labs/FLUX.1-Kontext-dev

updated a Space 23 days ago

philipp-zettl/NSFW_MASTER_FLUX

View all activity

Organizations

upvoted an article 27 days ago

Article

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

By

and 6 others •

Jun 12

• 113

upvoted a collection 2 months ago

Llama 4

Llama 4 release • 13 items • Updated Apr 29 • 568

upvoted a collection 3 months ago

LLäMmlein 🐑

https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/ • 13 items • Updated 3 days ago • 10

upvoted an article 3 months ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

By

and 6 others •

Apr 5

• 145

upvoted 4 papers 3 months ago

Less-to-More Generalization: Unlocking More Controllability by In-Context Generation

Paper • 2504.02160 • Published Apr 2 • 37

MegaTTS 3: Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis

Paper • 2502.18924 • Published Feb 26 • 13

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Paper • 2504.00595 • Published Apr 1 • 36

Your ViT is Secretly an Image Segmentation Model

Paper • 2503.19108 • Published Mar 24 • 22

upvoted a paper 4 months ago

Wan: Open and Advanced Large-Scale Video Generative Models

Paper • 2503.20314 • Published Mar 26 • 53

upvoted an article 4 months ago

Article

LeRobot goes to driving school: World’s largest open-source self-driving dataset

By

and 1 other •

Mar 11

• 95

upvoted 3 papers 4 months ago

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18 • 152

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14 • 111

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation

Paper • 2503.09641 • Published Mar 12 • 40

upvoted an article 4 months ago

Article

Welcome to Inference Providers on the Hub 🔥

By

and 6 others •

Jan 28

• 483

upvoted a paper 4 months ago

Auditing Prompt Caching in Language Model APIs

Paper • 2502.07776 • Published Feb 11 • 5

upvoted an article 4 months ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

By

and 6 others •

Feb 20

• 283

upvoted a collection 4 months ago

Phi-4

Phi-4 family of small language, multi-modal and reasoning models. • 17 items • Updated 3 days ago • 168

upvoted a paper 5 months ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published Feb 13 • 149

upvoted a collection 5 months ago

Express 🚅

Express Tiny LLM's • 7 items • Updated Mar 24 • 3

upvoted a paper 5 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 235