Dual Caption Preference Optimization for Diffusion Models Paper • 2502.06023 • Published Feb 9 • 9
Grounding Stylistic Domain Generalization with Quantitative Domain Shift Measures and Synthetic Scene Images Paper • 2405.15961 • Published May 24, 2024
REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models Paper • 2408.02231 • Published Aug 5, 2024 • 2
REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models Paper • 2408.02231 • Published Aug 5, 2024 • 2
On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation Paper • 2404.08540 • Published Apr 12, 2024 • 12
Lost in Translation? Translation Errors and Challenges for Fair Assessment of Text-to-Image Models on Multilingual Concepts Paper • 2403.11092 • Published Mar 17, 2024
To Find Waldo You Need Contextual Cues: Debiasing Who's Waldo Paper • 2203.16682 • Published Mar 30, 2022
Insights into Alignment: Evaluating DPO and its Variants Across Multiple Tasks Paper • 2404.14723 • Published Apr 23, 2024 • 10
Getting it Right: Improving Spatial Consistency in Text-to-Image Models Paper • 2404.01197 • Published Apr 1, 2024 • 31