Deep Learning of Visual and Textual Data for Region Detection Applied to Item Coding

被引:0
|
作者
Arroyo, Roberto [1 ]
Tovar, Javier [1 ]
Delgado, Francisco J. [1 ]
Almazan, Emilio J. [1 ]
Serrador, Diego G. [1 ]
Hurtado, Antonio [1 ]
机构
[1] Nielsen Connect AI, Calle Salvador de Madariaga 1, Madrid 28027, Spain
来源
PATTERN RECOGNITION AND IMAGE ANALYSIS, PT I | 2020年 / 11867卷
关键词
Deep learning; CNNs; OCR; Text-maps; Text regions detection; Item coding; Market studies;
D O I
10.1007/978-3-030-31332-6_29
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we propose a deep learning approach that combines visual appearance and text information in a Convolutional Neural Network (CNN), with the aim of detecting regions of different textual categories. We define a novel visual representation of the semantic meaning of text that allows a seamless integration in a standard CNN architecture. This representation, referred to as text-map, is integrated with the actual image to provide a much richer input to the network. Text-maps are colored with different intensities depending on the relevance of the words recognized over the image. More specifically, these words are previously extracted using Optical Character Recognition (OCR) and they are colored according to the probability of belonging to a textual category of interest. In this sense, the presented solution is especially relevant in the context of item coding for supermarket products, where different types of textual categories must be identified (e.g., ingredients or nutritional facts). We evaluated our approach in the proprietary item coding dataset of Nielsen Brandbank, which is composed of more than 10,000 images for train and 2,000 images for test. The reported results demonstrate that our method focused on visual and textual data outperforms state-of-the-art algorithms only based on appearance, such as standard Faster R-CNN. These improvements are exhibited in precision and recall, which are enhanced in 42 and 33 points respectively.
引用
收藏
页码:329 / 341
页数:13
相关论文
共 50 条
  • [41] Multi-Model Fusion Framework Using Deep Learning for Visual-Textual Sentiment Classification
    Al-Tameemi, Israa K. Salman
    Feizi-Derakhshi, Mohammad-Reza
    Pashazadeh, Saeed
    Asadpour, Mohammad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 76 (02): : 2145 - 2177
  • [42] Multi-Model Fusion Framework Using Deep Learning for Visual-Textual Sentiment Classification
    Salman Al-Tameemi I.K.
    Feizi-Derakhshi M.-R.
    Pashazadeh S.
    Asadpour M.
    Computers, Materials and Continua, 2023, 76 (02) : 2145 - 2177
  • [43] Deep learning object detection applied to defect recognition of memory modules
    Jung-Tang Huang
    Chien-Hung Ting
    The International Journal of Advanced Manufacturing Technology, 2022, 121 : 8433 - 8445
  • [44] Deep learning object detection applied to defect recognition of memory modules
    Huang, Jung-Tang
    Ting, Chien-Hung
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2022, 121 (11-12) : 8433 - 8445
  • [45] Deep Learning Methods applied to Intrusion Detection: Survey, Taxonomy and Challenges
    Lifandali, Oumaima
    Abghour, Noreddine
    2021 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATION (DASA), 2021,
  • [46] Deep learning applied to electroencephalogram data in mental disorders: A systematic review
    de Bardeci, Mateo
    Ip, Cheng Teng
    Olbrich, Sebastian
    BIOLOGICAL PSYCHOLOGY, 2021, 162
  • [47] Neovascularization Detection on Optic Disc Region Using Deep Learning
    Carrillo-Gomez, Cesar
    Nakano, Mariko
    Leon, Ana Gonzalez-H.
    Romo-Aguas, Juan Carlos
    Quiroz-Mercado, Hugo
    Lopez-Garcia, Osvaldo
    PATTERN RECOGNITION (MCPR 2021), 2021, 12725 : 111 - 120
  • [48] Deep Learning Applied to Mobile Phone Data for Individual Income Classification
    Sundsoy, Pal
    Bjelland, Johannes
    Reme, Bjorn-Atle
    Iqbal, Asif M.
    Jahani, Eaman
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, 2016, 127
  • [49] Deep learning for anomaly detection in log data: A survey
    Landauer, Max
    Onder, Sebastian
    Skopik, Florian
    Wurzenberger, Markus
    MACHINE LEARNING WITH APPLICATIONS, 2023, 12
  • [50] Deep Learning Based Data Race Detection Approach
    Zhang Y.
    Qiao L.
    Dong C.
    Gao H.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (09): : 1914 - 1928