Vision Transformer-based Real-Time Camouflaged Object Detection System at Edge

被引:2
作者
Putatunda, Rohan [1 ]
Khan, Md Azim [1 ]
Gangopadhyay, Aryya [1 ]
Wang, Jianwu [1 ]
Busart, Carl [2 ]
Erbacher, Robert F. [2 ]
机构
[1] Univ Maryland Baltimore Cty, Dept Informat Syst, Baltimore, MD 21228 USA
[2] DEVCOM Army Res Lab, Adelphi, MD USA
来源
2023 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING, SMARTCOMP | 2023年
关键词
Camouflaged Object Detection; Multi-Modality; Vision Transformer; GRAD-CAM;
D O I
10.1109/SMARTCOMP58114.2023.00029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Camouflaged object detection is a challenging task in computer vision that involves identifying objects that are intentionally or unintentionally hidden in their surrounding environment. Vision Transformer mechanisms play a critical role in improving the performance of deep learning models by focusing on the most relevant features that help object detection under camouflaged conditions. In this paper, we utilized a vision transformer (VT) in two phases, a) By integrating VT with a deep learning architecture for efficient monocular depth map generation for camouflaged objects and b) By embedding VT multiclass object detection model with multimodal feature input (RGB with RGB-D) that increases the visual cues and provides more representational information to the model for performance enhancement. Additionally, we performed an ablation study to understand the role of the vision transformer in camouflaged object detection and incorporated GRAD-CAM on top of the model to visualize the performance improvement achieved by embedding the VT in the model architecture. We deployed the model on resource-constrained edge devices for real-time object detection to realistically test the performance of the trained model.
引用
收藏
页码:90 / 97
页数:8
相关论文
共 50 条
[31]   RT-DEKT: real-time object detector with KAN-Transformer [J].
Jin, Zhanao ;
Li, Changlu ;
Lei, Zhichun .
SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (06)
[32]   Camouflaged Object Detection Based on Ternary Cascade Perception [J].
Jiang, Xinhao ;
Cai, Wei ;
Ding, Yao ;
Wang, Xin ;
Yang, Zhiyong ;
Di, Xingyu ;
Gao, Weijie .
REMOTE SENSING, 2023, 15 (05)
[33]   A Camouflaged Object Detection Model Based on Deep Learning [J].
Wang, Yong ;
Li, Ling ;
Yang, Xin ;
Wang, Xinxin ;
Liu, Hui .
PROCEEDINGS OF 2020 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS), 2020, :150-153
[34]   Vision-Inspired Boundary Perception Network for Lightweight Camouflaged Object Detection [J].
Chen, Chunyuan ;
Liang, Weiyun ;
Wang, Donglin ;
Wang, Bin ;
Xu, Jing .
IEEE SIGNAL PROCESSING LETTERS, 2025, 32 :1176-1180
[35]   Fast Camouflaged Object Detection via Edge-based Reversible Re-calibration Network [J].
Ji, Ge-Peng ;
Zhu, Lei ;
Zhuge, Mingchen ;
Fu, Keren .
PATTERN RECOGNITION, 2022, 123
[36]   Edge-awareness and feature decoupling enhancement network for camouflaged object detection [J].
Xiang, Tao ;
Yang, Jinfu ;
Cai, Shu ;
Bai, Jinglei .
VISUAL COMPUTER, 2025,
[37]   Shrink and Expose: Locate Edge Coarse-to-Fine for Camouflaged Object Detection [J].
Kang, Kejun ;
Liu, Yixiu ;
Sun, Yaoqi ;
Zhu, Shangdong ;
Ge, Fawei ;
Wang, Wei ;
Yan, Chenggang ;
Zheng, Zhigao .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2025, 71 (01) :785-795
[38]   Edge-guided Contextual Attention Fusion Network for Camouflaged Object Detection [J].
Hu, Bo ;
Chen, Sibao .
PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CYBER SECURITY, ARTIFICIAL INTELLIGENCE AND DIGITAL ECONOMY, CSAIDE 2024, 2024, :108-112
[39]   An edge-aware high-resolution framework for camouflaged object detection [J].
Ma, Jingyuan ;
Chen, Tianyou ;
Xiao, Jin ;
Hu, Xiaoguang ;
Wang, Yingxun .
IMAGE AND VISION COMPUTING, 2025, 157
[40]   Transformer-Based Optimized Multimodal Fusion for 3D Object Detection in Autonomous Driving [J].
Alaba, Simegnew Yihunie ;
Ball, John E. .
IEEE ACCESS, 2024, 12 :50165-50176