STAIR Collection Datasets and Models for STAIR (Improving Safety Alignment with Introspective Reasoning) • 7 items • Updated Feb 26 • 1
GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training Paper • 2503.08525 • Published Mar 11 • 15
Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study Paper • 2406.07057 • Published Jun 11, 2024 • 17