Audiovisual Dependency Attention for Violence Detection in Videos

被引:4
|
作者
Pang, Wenfeng [1 ]
Xie, Wei [1 ]
He, Qianhua [1 ]
Li, Yanxiong [1 ]
Yang, Jichen [2 ]
机构
[1] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou 510640, Peoples R China
[2] Guangdong Polytech Normal Univ, Sch Cyberspace Secur, Speech Informat Secur Lab, Guangzhou 510640, Peoples R China
基金
中国国家自然科学基金;
关键词
Audiovisual dependency attention; dependency map; violence detection; SCENES; MOVIES; FUSION;
D O I
10.1109/TMM.2022.3184533
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Violence detection in videos can help maintain public order, detect crimes, or provide timely assistance. In this paper, we aim to leverage multimodal information to determine whether successive frames contain violence. Specifically, we propose an audiovisual dependency attention (AVD-attention) module modified from the co-attention architecture to fuse visual and audio information, unlike commonly used methods such as the feature concatenation, addition, and score fusion. Because the AVD-attention module's dependency map contains sufficient fusion information, we argue that it should be applied more sufficiently. A combination pooling method is utilized to convert the dependency map to an attention vector, which can be considered a new feature that includes fusion information or a mask of the attention feature map. Since some information in the input feature might be lost after processing by attention modules, we employ a multimodal low-rank bilinear method that considers all pairwise interactions among two features in each time step to complement the original information for output features of the module. AVD-attention outperformed co-attention in experiments on the XD-Violence dataset. Our system outperforms state-of-the-art systems.
引用
收藏
页码:4922 / 4932
页数:11
相关论文
共 50 条
  • [1] Revisiting vision-based violence detection in videos: A critical analysis
    Kaur, Gurmeet
    Singh, Sarbjeet
    NEUROCOMPUTING, 2024, 597
  • [2] Violence Detection in Surveillance Videos with Deep Network using Transfer Learning
    Mumtaz, Aqib
    Sargano, Allah Bux
    Habib, Zulfiqar
    2018 2ND EUROPEAN CONFERENCE ON ELECTRICAL ENGINEERING AND COMPUTER SCIENCE (EECS 2018), 2018, : 558 - 563
  • [3] A dataset for automatic violence detection in videos
    Bianculli, Miriana
    Falcionelli, Nicola
    Sernani, Paolo
    Tomassini, Selene
    Contardo, Paolo
    Lombardi, Mara
    Dragoni, Aldo Franco
    DATA IN BRIEF, 2020, 33
  • [4] Violence detection in videos for an intelligent surveillance system using MoBSIFT and movement filtering algorithm
    Febin, I. P.
    Jayasree, K.
    Joy, Preetha Theresa
    PATTERN ANALYSIS AND APPLICATIONS, 2020, 23 (02) : 611 - 623
  • [5] VIOLENCE DETECTION IN VIDEOS BASED ON FUSING VISUAL AND AUDIO INFORMATION
    Pang, Wen-Feng
    He, Qian-Hua
    Hu, Yong-jian
    Li, Yan-Xiong
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2260 - 2264
  • [6] Bidirectional Convolutional LSTM for the Detection of Violence in Videos
    Hanson, Alex
    Koutilya, P. N. V. R.
    Krishnagopal, Sanjukta
    Davis, Larry
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT II, 2019, 11130 : 280 - 295
  • [7] A Comprehensive Review on Vision-Based Violence Detection in Surveillance Videos
    Ullah, Fath U. Min
    Obaidat, Mohammad S.
    Ullah, Amin
    Muhammad, Khan
    Hijji, Mohammad
    Baik, Sung Wook
    ACM COMPUTING SURVEYS, 2023, 55 (10)
  • [8] Autocorrelation of gradients based violence detection in surveillance videos
    Deepak, K.
    Vignesh, L. K. P.
    Chandrakala, S.
    ICT EXPRESS, 2020, 6 (03): : 155 - 159
  • [9] Violence Detection from Videos using HOG Features
    Das, Sunanda
    Sarker, Amlan
    Mahmud, Tareq
    2019 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL INFORMATION AND COMMUNICATION TECHNOLOGY (EICT), 2019,
  • [10] Detection of Violence in Cartoon Videos Using Visual Features
    Khalil, Tahira
    Bangash, Javed Iqbal
    Khan, Abdul Waheed
    Lashari, Saima Anwar
    Khan, Abdullah
    Ramli, Dzati Athiar
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021), 2021, 192 : 4962 - 4971