Running 3 3 OpenThoughts Benchmark Explorer 📊 Explore model performance through benchmark correlations
Scaling Laws for Robust Comparison of Open Foundation Language-Vision Models and Datasets Paper • 2506.04598 • Published 24 days ago • 5
Running 3 3 OpenThoughts Benchmark Explorer 📊 Explore model performance through benchmark correlations
Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models Paper • 2406.02061 • Published Jun 4, 2024 • 1
DataComp-LM: In search of the next generation of training sets for language models Paper • 2406.11794 • Published Jun 17, 2024 • 53
Scaling Laws for Robust Comparison of Open Foundation Language-Vision Models and Datasets Paper • 2506.04598 • Published 24 days ago • 5
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper • 2506.05209 • Published 23 days ago • 42
xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations Paper • 2408.12590 • Published Aug 22, 2024 • 37
xLAM: A Family of Large Action Models to Empower AI Agent Systems Paper • 2409.03215 • Published Sep 5, 2024 • 5
SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant Paper • 2403.11299 • Published Mar 17, 2024 • 1
LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer Paper • 2212.09877 • Published Dec 19, 2022
Trust but Verify: Programmatic VLM Evaluation in the Wild Paper • 2410.13121 • Published Oct 17, 2024 • 3