Unlocking Creativity with Text-to-Image Generation: Exploring LoRA Models and Styles [Generative Vision] Aug 8, 2024 • 14
Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding Paper • 2501.07783 • Published 4 days ago • 7
MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents Paper • 2501.08828 • Published 3 days ago • 24
Multimodal Models Collection Multimodal models with leading performance. • 17 items • Updated 1 day ago • 28
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 3 days ago • 98
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference 2 days ago • 42
view article Article How to Automate Reddit Comment Generation with AI Agents in KaibanJS By darielnoel • 11 days ago • 4
view article Article **N-Queens Problem Based Monte Carlo Algorithm** By prithivMLmods • 7 days ago • 7
Blaze.1 🔥 Collection Text Generation, Vision Language, Image Generation • 5 items • Updated 2 days ago • 5
Enhancing Human-Like Responses in Large Language Models Paper • 2501.05032 • Published 9 days ago • 46
Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives Paper • 2501.04003 • Published 11 days ago • 23
The GAN is dead; long live the GAN! A Modern GAN Baseline Paper • 2501.05441 • Published 9 days ago • 77
GWQ Collection Open Eval < 500, long chain of thought and keyword based answering, reasoning • 6 items • Updated 7 days ago • 8
view article Article Announcing NVIDIA Cosmos World Foundation Models By mingyuliutw • 11 days ago • 22
Zeroshot Classifiers Collection These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 12 items • Updated 12 days ago • 123