Deep Learning of Visual and Textual Data for Region Detection Applied to Item Coding

被引:0
|
作者
Arroyo, Roberto [1 ]
Tovar, Javier [1 ]
Delgado, Francisco J. [1 ]
Almazan, Emilio J. [1 ]
Serrador, Diego G. [1 ]
Hurtado, Antonio [1 ]
机构
[1] Nielsen Connect AI, Calle Salvador de Madariaga 1, Madrid 28027, Spain
来源
PATTERN RECOGNITION AND IMAGE ANALYSIS, PT I | 2020年 / 11867卷
关键词
Deep learning; CNNs; OCR; Text-maps; Text regions detection; Item coding; Market studies;
D O I
10.1007/978-3-030-31332-6_29
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we propose a deep learning approach that combines visual appearance and text information in a Convolutional Neural Network (CNN), with the aim of detecting regions of different textual categories. We define a novel visual representation of the semantic meaning of text that allows a seamless integration in a standard CNN architecture. This representation, referred to as text-map, is integrated with the actual image to provide a much richer input to the network. Text-maps are colored with different intensities depending on the relevance of the words recognized over the image. More specifically, these words are previously extracted using Optical Character Recognition (OCR) and they are colored according to the probability of belonging to a textual category of interest. In this sense, the presented solution is especially relevant in the context of item coding for supermarket products, where different types of textual categories must be identified (e.g., ingredients or nutritional facts). We evaluated our approach in the proprietary item coding dataset of Nielsen Brandbank, which is composed of more than 10,000 images for train and 2,000 images for test. The reported results demonstrate that our method focused on visual and textual data outperforms state-of-the-art algorithms only based on appearance, such as standard Faster R-CNN. These improvements are exhibited in precision and recall, which are enhanced in 42 and 33 points respectively.
引用
收藏
页码:329 / 341
页数:13
相关论文
共 50 条
  • [21] Deep Learning Applied to Automatic Reclosers Detection in Power Grid
    Marques, Francisco
    Pinto, Alano
    Bastos, Arthur
    Goncalves, Ana
    Pereira, Gilherbson
    Reis, Flavio
    2021 IEEE 45TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2021), 2021, : 1861 - 1866
  • [22] Deep learning methods applied to electronic monitoring data: automated catch event detection for longline fishing
    Qiao, Maoying
    Wang, Dadong
    Tuck, Geoffrey N.
    Little, L. Richard
    Punt, Andre E.
    Gerner, Mike
    ICES JOURNAL OF MARINE SCIENCE, 2021, 78 (01) : 25 - 35
  • [23] Severe aortic stenosis detection by deep learning applied to echocardiography
    Holste, Gregory
    Oikonomou, Evangelos K.
    Mortazavi, Bobak J.
    Coppi, Andreas
    Faridi, Kamil F.
    Miller, Edward J.
    Forrest, John K.
    McNamara, Robert L.
    Ohno-Machado, Lucila
    Yuan, Neal
    Gupta, Aakriti
    Ouyang, David
    Krumholz, Harlan M.
    Wang, Zhangyang
    Khera, Rohan
    EUROPEAN HEART JOURNAL, 2023, 44 (43) : 4592 - 4604
  • [24] DEEP LEARNING BASED SENSITIVE DATA DETECTION
    Chong, Peng
    2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
  • [25] Deep Learning Poison Data Attack Detection
    Chacon, Henry
    Silva, Samuel Henrique
    Rad, Paul
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 971 - 978
  • [26] Sinkhole Detection by Deep Learning and Data Association
    Nam Vu Hoai
    Nguyen Manh Dung
    Ro, Soonghwan
    2019 ELEVENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN 2019), 2019, : 211 - 213
  • [27] Hybrid context enriched deep learning model for fine-grained sentiment analysis in textual and visual semiotic modality social data
    Kumar, Akshi
    Srinivasan, Kathiravan
    Cheng Wen-Huang
    Zomaya, Albert Y.
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (01)
  • [28] When Dictionary Learning Meets Deep Learning: Deep Dictionary Learning and Coding Network for Image Recognition With Limited Data
    Tang, Hao
    Liu, Hong
    Xiao, Wei
    Sebe, Nicu
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (05) : 2129 - 2141
  • [29] BBC-FND: An ensemble of deep learning framework for textual fake news detection
    Palani, Balasubramanian
    Elango, Sivasankar
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 110
  • [30] Loop Closure Detection for Visual SLAM Based on Deep Learning
    Hu, Hang
    Zhang, Yunzhou
    Duan, Qiang
    Hu, Meiyu
    Pang, Linzhuo
    2017 IEEE 7TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2017, : 1214 - 1219