Machine Learning Based Detection of Digital Documents Maliciously Recaptured from Displays

被引:0
作者
Gholam-Zadeh, Saleh [1 ]
Upenik, Evgeniy [1 ]
Hatarsi, Guy [2 ]
Ebrahimi, Touradj [1 ]
机构
[1] Ecole Polytech Fed Lausanne EPFL, Multimedia Signal Proc Grp MMSPG, CH-1015 Lausanne, Switzerland
[2] Quantum Integr SA, EPFL Innovat Pk,Batiment C, CH-1015 Lausanne, Switzerland
来源
APPLICATIONS OF DIGITAL IMAGE PROCESSING XLIII | 2020年 / 11510卷
关键词
image manipulation; image forgery; forgery detection; manipulation detection; KYC; image falsification; falsification detection; machine learning;
D O I
10.1117/12.2569256
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We used to say "seeing is believing": this is no longer true. The digitization is changing all aspects of life and business. One of the more noticeable impacts is in how business documents are being authored, exchanged and processed. Many documents such as passports and IDs are being at first created in paper form but are immediately scanned, digitized, and further processed in electronic form. Widely available photo editing software makes image manipulation quite literally a child's play increasing the number of forged contents tremendously. With the growing concerns over authenticity and integrity of scanned and image-based documents such as passports and IDs, it is more than urgent to be able to quickly validate scanned and photographic documents. The same machine learning that is behind some of the most successful content manipulation solutions can also be used as a counter measure to detect them. In this paper, we describe an efficient recaptured digital document detection based on machine learning. The core of the system is composed of a binary classification approach based on support vector machine (SVM), properly trained with authentic and recaptured digital passports. The detector informs when it encounters a digital document that is the result of photographic capture of another digital document displayed on an LCD monitor. To assess the proposed detector, a specific dataset of authentic and recaptured passports with a number of different cameras was created. Several experiments were set up to assess the overall performance of the detector as well as its efficacy for special situations, such as when the machine learning engine is trained on a specific type of camera or when it encounters a new type of camera for which it was not trained. Results show that the performance of the detector remains above 90 percent accuracy for the large majority of cases.
引用
收藏
页数:11
相关论文
共 7 条
[1]  
[Anonymous], 2011, INT J COMPUTER SCI E
[2]   Digital image forgery detection using passive techniques: A survey [J].
Birajdar, Gajanan K. ;
Mankar, Vijay H. .
DIGITAL INVESTIGATION, 2013, 10 (03) :226-245
[3]  
Farid H, 2017, AM SCI, V105, P77
[4]   Image Forgery Detection A survey [J].
Farid, Hany .
IEEE SIGNAL PROCESSING MAGAZINE, 2009, 26 (02) :16-25
[5]   Recent Advances in Passive Digital Image Security Forensics: A Brief Review [J].
Lin, Xiang ;
Li, Jian-Hua ;
Wang, Shi-Lin ;
Liew, Alan-Wee-Chung ;
Cheng, Feng ;
Huang, Xiao-Sa .
ENGINEERING, 2018, 4 (01) :29-39
[6]  
van der Maaten L, 2008, J MACH LEARN RES, V9, P2579
[7]  
Zhou Guojuan, 2011, 2011 Fourth International Joint Conference on Computational Sciences and Optimization (CSO), P332, DOI 10.1109/CSO.2011.85