Multi-Class Document Image Classification using Deep Visual and Textual Features

被引:2
|
作者
Sevim, Semih [1 ]
Ekinci, Ekin [2 ]
Omurca, Sevinc Ilhan [3 ]
Edinc, Eren Berk [3 ]
Eken, Suleyman [4 ]
Erdem, Turkucan [4 ]
Sayar, Ahmet [3 ]
机构
[1] Bandirma Onyedi Eylul Univ, Comp Engn Dept, Balikesir, Turkey
[2] Sakarya Univ Appl Sci, Comp Engn Dept, Sakarya, Turkey
[3] Kocaeli Univ, Comp Engn Dept, Kocaeli, Turkey
[4] Kocaeli Univ, Informat Syst Engn Dept, Kocaeli, Turkey
关键词
Document analysis and recognition; document classification; text mining; deep learning; NETWORKS;
D O I
10.1142/S1469026822500134
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The digitalization era has brought digital documents with it, and the classification of document images has become an important need as in classical text documents. Document images, in which text documents are stored as images, contain both text and visual features, unlike images. Therefore, it is possible to use both text and visual features while classifying such data. Considering this situation, in this study, it is aimed to classify document images by using both text and visual features and to determine which feature type is more successful in classification. In the text-based approach, each document/class is labeled with the keywords associated with that document/class and the classification is realized according to whether the document contains the related key-words or not. For visual-based classification, we use four deep learning models namely CNN, NASNet-Large, InceptionV3, and EfficientNetB3. Experimental study is carried out on document images obtained from applicants of the Kocaeli University. As a result, it is seen ii that EfficientNetB3 is the most superior among all with 0.8987 F-score.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Multi-Class Skin Lesions Classification Using Deep Features
    Usama, Muhammad
    Naeem, M. Asif
    Mirza, Farhaan
    SENSORS, 2022, 22 (21)
  • [2] Visual and Textual Deep Feature Fusion for Document Image Classification
    Bakkali, Souhail
    Ming, Zuheng
    Coustaty, Mickael
    Rusinol, Marcal
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 2394 - 2403
  • [3] Multi-class Enhanced Image Mining of Heterogeneous Textual Images Using Multiple Image Features
    Chitrakala, S.
    Shamini, P.
    Manjula, D.
    2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 496 - +
  • [4] Bag-of-Visual-Words codebook generation using deep features for effective classification of imbalanced multi-class image datasets
    Manisha Saini
    Seba Susan
    Multimedia Tools and Applications, 2021, 80 : 20821 - 20847
  • [5] Bag-of-Visual-Words codebook generation using deep features for effective classification of imbalanced multi-class image datasets
    Saini, Manisha
    Susan, Seba
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (14) : 20821 - 20847
  • [6] Deep Decision Network for Multi-Class Image Classification
    Murthy, Venkatesh N.
    Singh, Vivek
    Chen, Terrence
    Manmatha, R.
    Comaniciu, Dorin
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2240 - 2248
  • [7] Multi-class Alzheimer's disease classification using image and clinical features
    Altaf, Tooba
    Anwar, Syed Muhammad
    Gul, Nadia
    Majeed, Muhammad Nadeem
    Majid, Muhammad
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2018, 43 : 64 - 74
  • [8] Multi-Class Document Classification Using Lexical Ontology-Based Deep Learning †
    Yelmen, Ilkay
    Gunes, Ali
    Zontul, Metin
    APPLIED SCIENCES-BASEL, 2023, 13 (10):
  • [9] Multi-Class Retinopathy classification in Fundus Image using Deep Learning Approaches
    Wankhade, Nisha R.
    Bhoyar, Kishor K.
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2021, 12 (05): : 807 - 816
  • [10] Using natural class hierarchies in multi-class visual classification
    Autio, Ilkka
    PATTERN RECOGNITION, 2006, 39 (07) : 1290 - 1299