Ctrl-Crash: Controllable Diffusion for Realistic Car Crashes
Abstract
Ctrl-Crash, a controllable car crash video generation model using classifier-free guidance, achieves top performance in video quality and realism compared to existing diffusion-based methods.
Video diffusion techniques have advanced significantly in recent years; however, they struggle to generate realistic imagery of car crashes due to the scarcity of accident events in most driving datasets. Improving traffic safety requires realistic and controllable accident simulations. To tackle the problem, we propose Ctrl-Crash, a controllable car crash video generation model that conditions on signals such as bounding boxes, crash types, and an initial image frame. Our approach enables counterfactual scenario generation where minor variations in input can lead to dramatically different crash outcomes. To support fine-grained control at inference time, we leverage classifier-free guidance with independently tunable scales for each conditioning signal. Ctrl-Crash achieves state-of-the-art performance across quantitative video quality metrics (e.g., FVD and JEDi) and qualitative measurements based on a human-evaluation of physical realism and video quality compared to prior diffusion-based methods.
Community
๐ฅ Check out our website for more video demos!
๐ Website: https://anthonygosselin.github.io/Ctrl-Crash-ProjectPage/
๐ Paper: https://arxiv.org/abs/2506.00227
๐ GitHub: https://github.com/AnthonyGosselin/Ctrl-Crash/tree/master
๐ค HuggingFace: https://huggingface.co/AnthonyGosselin/Ctrl-Crash
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Simulating the Unseen: Crash Prediction Must Learn from What Did Not Happen (2025)
- Safety-Critical Traffic Simulation with Guided Latent Diffusion Model (2025)
- CrashAgent: Crash Scenario Generation via Multi-modal Reasoning (2025)
- LANGTRAJ: Diffusion Model and Dataset for Language-Conditioned Trajectory Simulation (2025)
- Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis (2025)
- Challenger: Affordable Adversarial Driving Video Generation (2025)
- RADE: Learning Risk-Adjustable Driving Environment via Multi-Agent Conditional Diffusion (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 1
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper