Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling Paper • 2504.13169 • Published 20 days ago • 39
Long Reasoning Collection Datasets with reasoning traces for math and code (Train + Eval) • 49 items • Updated Mar 21 • 1
Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated Feb 2 • 61
🧠Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 21 items • Updated 22 days ago • 136
M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework Paper • 2411.06176 • Published Nov 9, 2024 • 46
Direct Preference Optimization Datasets Collection Datasets suitable for DPO based on having 'chosen', 'rejected', and 'prompt' columns. Created using librarian-bots/dataset-column-search-api • 5520 items • Updated Apr 6 • 6