Armed boundary sabotage: A case study of human malicious behaviors identification with computer vision and explainable reasoning methods

被引:0
作者
Li, Zhan [1 ]
Song, Xingyu [1 ]
Chen, Shi [1 ]
Demachi, Kazuyuki [1 ]
机构
[1] Univ Tokyo, Sch Engn, Dept Nucl Engn & Management, Tokyo, Japan
关键词
Human malicious behaviors identification; Armed boundary sabotage; Computer vision; Human-object interaction analysis; Data-based reasoning method; Language-based reasoning method; KNOWLEDGE; SYSTEM;
D O I
10.1016/j.compeleceng.2024.109924
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, the technologies in computer vision (CV) are labor-saving and convenient to identify human malicious behaviors. However, they usually fail to consider the robustness, generalization and interpretability of calculation frameworks. In this paper, a very common but sometimes difficult-to-detect case research called armed boundary sabotage is conducted, which is achieved by computer vision module (CVM) and reasoning module (RM). Among them, CVM is used for extracting the key information from raw videos, while RM is applied to obtain the final reasoning results. Considering the transient and confusing properties in such scenarios, a specific humanobject interaction analysis process with soft constraint is proposed in CVM. In addition, two reasoning methods which are data-based reasoning method and language-based reasoning methods are implemented in RM. The results show that the human-object interaction analysis process with soft constraint prove to be effective and practical, while the optimal testing accuracy achieves 0.7871. Furthermore, the two proposed reasoning methods are promising for identification of human malicious behaviors. Among them, the advanced language-based reasoning method outperforms others, with highest precision value of 0.8750 and perfect recall value of 1.0000. Besides, these proposals are also verified to be high-performance in other external intrusion scenarios of our previous work. Finally, our research also obtain state-of-the-art results by comparing with other related works.
引用
收藏
页数:17
相关论文
共 64 条
[61]   Graphical Contrastive Losses for Scene Graph Parsing [J].
Zhang, Ji ;
Shih, Kevin J. ;
Elgammal, Ahmed ;
Tao, Andrew ;
Catanzaro, Bryan .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11527-11535
[62]   ChatMatch: Exploring the potential of hybrid vision-language deep learning approach for the intelligent analysis and inference of racket sports [J].
Zhang, Jiawen ;
Han, Dongliang ;
Han, Shuai ;
Li, Heng ;
Lam, Wing-Kai ;
Zhang, Mingyu .
COMPUTER SPEECH AND LANGUAGE, 2025, 89
[63]   Do surveillance cameras improve perceived neighborhood safety? A case study of Nanjing, China [J].
Zhang, Shanqi ;
Qin, Xiao ;
Zhen, Feng ;
Huang, Yijing ;
Kong, Yu .
CITIES, 2023, 140
[64]   Human-object integrated assembly intention recognition for context-aware human-robot collaborative assembly [J].
Zhang, Yaqian ;
Ding, Kai ;
Hui, Jizhuang ;
Lv, Jingxiang ;
Zhou, Xueliang ;
Zheng, Pai .
ADVANCED ENGINEERING INFORMATICS, 2022, 54