Text-Driven Traffic Anomaly Detection With Temporal High-Frequency Modeling in Driving Videos

被引:1
作者
Liang, Rongqin [1 ]
Li, Yuanman [1 ]
Zhou, Jiantao [2 ,3 ]
Li, Xia [1 ]
机构
[1] Shenzhen Univ, Guangdong Key Lab Intelligent Informat Proc, Coll Elect & Informat Engn, Shenzhen, Guangdong, Peoples R China
[2] Univ Macau, State Key Lab Internet Things Smart City, Macau, Peoples R China
[3] Univ Macau, Dept Comp & Informat Sci, Macau, Peoples R China
关键词
Traffic anomaly detection; multi-modality learning; high frequency; attention; PREDICTION;
D O I
10.1109/TCSVT.2024.3390173
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Traffic anomaly detection (TAD) in driving videos is critical for ensuring the safety of autonomous driving and advanced driver assistance systems. Previous single-stage TAD methods primarily rely on frame prediction, making them vulnerable to interference from dynamic backgrounds induced by the rapid movement of the dashboard camera. While two-stage TAD methods appear to be a natural solution to mitigate such interference by pre-extracting background-independent features (such as bounding boxes and optical flow) using perceptual algorithms, they are susceptible to the performance of first-stage perceptual algorithms and may result in error propagation. In this paper, we introduce TTHF, a novel single-stage method aligning video clips with text prompts, offering a new perspective on traffic anomaly detection. Unlike previous approaches, the supervised signal of our method is derived from languages rather than orthogonal one-hot vectors, providing a more comprehensive representation. Further, concerning visual representation, we propose to model the high frequency of driving videos in the temporal domain. This modeling captures the dynamic changes of driving scenes, enhances the perception of driving behavior, and significantly improves the detection of traffic anomalies. In addition, to better perceive various types of traffic anomalies, we carefully design an attentive anomaly focusing mechanism that visually and linguistically guides the model to adaptively focus on the visual context of interest, thereby facilitating the detection of traffic anomalies. It is shown that our proposed TTHF achieves promising performance, outperforming state-of-the-art competitors by +5.4% AUC on the DoTA dataset and achieving high generalization on the DADA dataset.
引用
收藏
页码:8684 / 8697
页数:14
相关论文
共 56 条
[1]   Zero-Shot Composed Image Retrieval with Textual Inversion [J].
Baldrati, Alberto ;
Agnolucci, Lorenzo ;
Bertini, Marco ;
Del Bimbo, Alberto .
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, :15292-15301
[2]   Abnormal Event Detection in Videos Using Spatiotemporal Autoencoder [J].
Chong, Yong Shean ;
Tay, Yong Haur .
ADVANCES IN NEURAL NETWORKS, PT II, 2017, 10262 :189-196
[3]   A Review of Motion Planning for Highway Autonomous Driving [J].
Claussmann, Laurene ;
Revilloud, Marc ;
Gruyer, Dominique ;
Glaser, Sebastien .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (05) :1826-1848
[4]  
Dosovitskiy A., 2021, arXiv
[5]   DADA: Driver Attention Prediction in Driving Accident Scenarios [J].
Fang, Jianwu ;
Yan, Dingxin ;
Qiao, Jiahuan ;
Xue, Jianru ;
Yu, Hongkai .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (06) :4959-4971
[6]   Traffic Accident Detection via Self-Supervised Consistency Learning in Driving Scenarios [J].
Fang, Jianwu ;
Qiao, Jiahuan ;
Bai, Jie ;
Yu, Hongkai ;
Xue, Jianru .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) :9601-9614
[7]   A Background-Agnostic Framework With Adversarial Training for Abnormal Event Detection in Video [J].
Georgescu, Mariana Iuliana ;
Ionescu, Radu Tudor ;
Khan, Fahad Shahbaz ;
Popescu, Marius ;
Shah, Mubarak .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) :4505-4523
[8]   Memorizing Normality to Detect Anomaly: Memory-augmented Deep Autoencoder for Unsupervised Anomaly Detection [J].
Gong, Dong ;
Liu, Lingqiao ;
Le, Vuong ;
Saha, Budhaditya ;
Mansour, Moussa Reda ;
Venkatesh, Svetha ;
van den Hengel, Anton .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1705-1714
[9]  
Gong Y., 2022, P IEEE INT C MULT EX, P1
[10]  
Gu X, 2022, P INT C LEARN REPR