Hanfu-Bench: A Multimodal Benchmark on Cross-Temporal Cultural Understanding and Transcreation Paper • 2506.01565 • Published Jun 2 • 3
FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture Paper • 2406.11030 • Published Jun 16, 2024
Understanding Retrieval Robustness for Retrieval-Augmented Image Captioning Paper • 2406.02265 • Published Jun 4, 2024 • 7