LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published Nov 15, 2024 • 126
MENTOR: Mixture-of-Experts Network with Task-Oriented Perturbation for Visual Reinforcement Learning Paper • 2410.14972 • Published Oct 19, 2024
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization Paper • 2402.14528 • Published Feb 22, 2024
Can Pre-Trained Text-to-Image Models Generate Visual Goals for Reinforcement Learning? Paper • 2307.07837 • Published Jul 15, 2023
DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization Paper • 2310.19668 • Published Oct 30, 2023 • 3