Challenger: Affordable Adversarial Driving Video Generation

Zhiyuan Xu1,2*, Bohan Li3,4*, Huan-ang Gao1, Mingju Gao1
Yong Chen5, Ming Liu5, Chenxu Yan5, Hang Zhao6, Shuo Feng7, Hao Zhao1†
1 Institute for AI Industry Research (AIR), Tsinghua University   2 University of Chinese Academy of Sciences
3 Shanghai Jiao Tong University   4 Eastern Institute of Technology(EIT), Ningbo   5 Geely Auto
6 Institute for Interdisciplinary Information Sciences (IIIS), Tsinghua University  7 Department of Automation, Tsinghua University

*Equal Contribution   Corresponding author

Introductory Video

Abstract

Generating photorealistic driving videos has seen significant progress recently, but current methods largely focus on ordinary, non-adversarial scenarios. Meanwhile, efforts to generate adversarial driving scenarios often operate on abstract trajectory or BEV representations, falling short of delivering realistic sensor data that can truly stress-test autonomous driving (AD) systems. In this work, we introduce Challenger, a framework that produces physically plausible yet photorealistic adversarial driving videos. Generating such videos poses a fundamental challenge: it requires jointly optimizing over the space of traffic interactions and high-fidelity sensor observations. Challenger makes this affordable through two techniques: (1) a physics-aware multi-round trajectory refinement process that narrows down candidate adversarial maneuvers, and (2) a tailored trajectory scoring function that encourages realistic yet adversarial behavior while maintaining compatibility with downstream video synthesis. As tested on the nuScenes dataset, Challenger generates a diverse range of aggressive driving scenarios—including cut-ins, sudden lane changes, tailgating, and blind spot intrusions—and renders them into multiview photorealistic videos. Extensive evaluations show that these scenarios significantly increase the collision rate of state-of-the-art end-to-end AD models (UniAD, VAD, SparseDrive, and DiffusionDrive), and importantly, adversarial behaviors discovered for one model often transfer to others.