共 7 条
UnionFormer: Unified-Learning Transformer with Multi-View Representation for Image Manipulation Detection and Localization
被引:3
|作者:
Li, Shuaibo
[1
,2
]
Ma, Wei
[1
]
Guo, Jianwei
[2
]
Xu, Shibiao
[3
]
Li, Benchong
[1
]
Zhan, Xiaopeng
[2
]
机构:
[1] Beijing Univ Technol, Beijing, Peoples R China
[2] Chinese Acad Sci, Inst Automat, MAIS, Beijing, Peoples R China
[3] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
来源:
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)
|
2024年
基金:
中国国家自然科学基金;
关键词:
NETWORKS;
D O I:
10.1109/CVPR52733.2024.01190
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
We present UnionFormer, a novel framework that integrates tampering clues across three views by unified learning for image manipulation detection and localization. Specifically, we construct a BSFI-Net to extract tampering features from RGB and noise views, achieving enhanced responsiveness to boundary artifacts while modulating spatial consistency at different scales. Additionally, to explore the inconsistency between objects as a new view of clues, we combine object consistency modeling with tampering detection and localization into a three-task unified learning process, allowing them to promote and improve mutually. Therefore, we acquire a unified manipulation discriminative representation under multi-scale supervision that consolidates information from three views. This integration facilitates highly effective concurrent detection and localization of tampering. We perform extensive experiments on diverse datasets, and the results show that the proposed approach outperforms state-of-the-art methods in tampering detection and localization.
引用
收藏
页码:12523 / 12533
页数:11
相关论文