view article Article cocogold: training Marigold for text-grounded segmentation By pcuenq • 15 days ago • 27
PS3: Scaling Vision Pre-Training to 4K Resolution Collection Enabling 4k resolution for VLMs, CVPR 2025, https://nvlabs.github.io/PS3/ • 4 items • Updated 2 days ago • 2
Running on Zero 88 88 VLM Object Understanding 🦀 Explore object detection, visual grounding, keypoint Detecti
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper • 2506.20920 • Published 27 days ago • 61
view article Article Training and Finetuning Sparse Embedding Models with Sentence Transformers v5 By tomaarsen and 1 other • 22 days ago • 105
Running on Zero 158 158 Chat with Kimi-VL-A3B-Thinking-2506 🤔 Chat with images, videos, or PDFs to generate text