ASYv3: Attention-enabled pooling embedded Swin transformer-based YOLOv3 for obscenity detection

被引:3
作者
Samal, Sonali [1 ]
Zhang, Yu-Dong [2 ,8 ]
Gadekallu, Thippa Reddy [3 ,4 ,5 ,6 ,7 ]
Balabantaray, Bunil Kumar [1 ]
机构
[1] Natl Inst Technol Meghalaya, Dept Comp Sci & Engn, Shillong, India
[2] Univ Leicester, Sch Comp Sci, Leicester, Leicestershire, England
[3] Zhongda Grp, Haiyan Cty, Jiaxing, Peoples R China
[4] Lebanese Amer Univ, Dept Elect & Comp Engn, Byblos, Lebanon
[5] Vellore Inst Technol, Sch Informat Technol & Engn, Vellore, India
[6] Jiaxing Univ, Coll Informat Sci & Engn, Jiaxing, Peoples R China
[7] Lovely Profess Univ, Div Res & Dev, Phagwara, India
[8] Univ Leicester, Sch Comp Sci, Leicester LE1 7RH, Leicestershire, England
关键词
attention-based pooling; obscene detection; Swin transformer; YOLOv3;
D O I
10.1111/exsy.13337
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rampant spread of explicit content across social media can leave a damaging mark on our society. Hence, the need to be vigilant in detecting and curtailing sexually explicit content cannot be overstated. As such, it becomes paramount to discern and manage sexually explicit material to curb its dissemination and safeguard our digital communities from its harmful effects. In this article, we propose a unique technique entitled attention-enabled pooling (ABP) embedded Swin transformer-based YOLOv3 (ASYv3) for the detection of obscene areas present in the images with a bounding box around the offensive regions. ASYv3 employs a unique two-step approach for enhanced performance in obscene detection. In the first step, a scalable and efficient Swin transformer block is integrated, utilizing self-attention and model parallelism to train massive models effectively. In the second phase, the embedding layer of the Swin transformer is replaced with ABP, mitigating disruption of feature context. ABP allows for the projection of raw-valued features into linear form with proper attention to feature context information at specified locations, resulting in optimized feature extraction. The proposed ABP embedded Swin transformer-based YOLOv3 (ASYv3) was trained with annotated obscene images (AOI) dataset. The proposed ASYv3 model surpassed the state-of-the-art methods by achieving 97% testing accuracy, 96.62% precision, 97.40% sensitivity, 3.48% FPR rate, 97.37% NPV values, and 95.59% mAP values, respectively.
引用
收藏
页数:18
相关论文
共 39 条
[1]   Transfer Detection of YOLO to Focus CNN's Attention on Nude Regions for Adult Content Detection [J].
AlDahoul, Nouar ;
Abdul Karim, Hezerul ;
Lye Abdullah, Mohd Haris ;
Ahmad Fauzi, Mohammad Faizal ;
Ba Wazir, Abdulaziz Saleh ;
Mansor, Sarina ;
See, John .
SYMMETRY-BASEL, 2021, 13 (01) :1-26
[2]  
Avila S., 2018, NPDI PORN DAT I COMP
[3]   Pooling in image representation: The visual codeword point of view [J].
Avila, Sandra ;
Thome, Nicolas ;
Cord, Matthieu ;
Valle, Eduardo ;
Araujo, Arnaldo de A. .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (05) :453-465
[4]  
Basilio JAM., 2011, P 2011 AM C APPL MAT, P123
[5]   Explicit Content Detection System: An Approach towards a Safe and Ethical Environment [J].
Bhatti, Ali Qamar ;
Umer, Muhammad ;
Adil, Syed Hasan ;
Ebrahim, Mansoor ;
Nawaz, Daniyal ;
Ahmed, Faizan .
APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2018, 2018
[6]  
Chai D., 2000, BAYESIAN APPROACH SK
[7]  
da Silva Eleuterio P. M., 2010, INT J FORENSIC COMPU, V5, P49
[8]   Efficient Fake News Detection using Bagging Ensembles of Bidirectional Echo State Networks [J].
Del Ser, Javier ;
Bilbao, Miren Nekane ;
Lana, Ibai ;
Muhammad, Khan ;
Camacho, David .
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[9]   LEOBAT: Lightweight encryption and OTP based authentication technique for securing IoT networks [J].
Goel, Aarti ;
Sharma, Deepak Kumar ;
Gupta, Koyel Datta .
EXPERT SYSTEMS, 2022, 39 (05)
[10]   Optimal Transmission of Multi-Quality Tiled 360 VR Video in MIMO-OFDMA Systems [J].
Guo, Chengjun ;
Cui, Ying ;
Liu, Zhi ;
Ng, Derrick Wing Kwan .
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,