Multi-Class Document Image Classification using Deep Visual and Textual Features

被引：2

作者：

Sevim, Semih ^{[1
]}

Ekinci, Ekin ^{[2
]}

Omurca, Sevinc Ilhan ^{[3
]}

Edinc, Eren Berk ^{[3
]}

Eken, Suleyman ^{[4
]}

Erdem, Turkucan ^{[4
]}

Sayar, Ahmet ^{[3
]}

机构：

[1] Bandirma Onyedi Eylul Univ, Comp Engn Dept, Balikesir, Turkey

[2] Sakarya Univ Appl Sci, Comp Engn Dept, Sakarya, Turkey

[3] Kocaeli Univ, Comp Engn Dept, Kocaeli, Turkey

[4] Kocaeli Univ, Informat Syst Engn Dept, Kocaeli, Turkey

来源：

INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS | 2022年 / 21卷 / 02期

关键词：

Document analysis and recognition; document classification; text mining; deep learning; NETWORKS;

D O I：

10.1142/S1469026822500134

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The digitalization era has brought digital documents with it, and the classification of document images has become an important need as in classical text documents. Document images, in which text documents are stored as images, contain both text and visual features, unlike images. Therefore, it is possible to use both text and visual features while classifying such data. Considering this situation, in this study, it is aimed to classify document images by using both text and visual features and to determine which feature type is more successful in classification. In the text-based approach, each document/class is labeled with the keywords associated with that document/class and the classification is realized according to whether the document contains the related key-words or not. For visual-based classification, we use four deep learning models namely CNN, NASNet-Large, InceptionV3, and EfficientNetB3. Experimental study is carried out on document images obtained from applicants of the Kocaeli University. As a result, it is seen ii that EfficientNetB3 is the most superior among all with 0.8987 F-score.

引用

页数：15

共 50 条

[1] Multi-Class Skin Lesions Classification Using Deep Features
Usama, Muhammad
Naeem, M. Asif
Mirza, Farhaan
SENSORS, 2022, 22 (21)
[2] Visual and Textual Deep Feature Fusion for Document Image Classification
Bakkali, Souhail
Ming, Zuheng
Coustaty, Mickael
Rusinol, Marcal
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 2394 - 2403
[3] Multi-class Enhanced Image Mining of Heterogeneous Textual Images Using Multiple Image Features
Chitrakala, S.
Shamini, P.
Manjula, D.
2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 496 - +
[4] Bag-of-Visual-Words codebook generation using deep features for effective classification of imbalanced multi-class image datasets
Manisha Saini
Seba Susan
Multimedia Tools and Applications, 2021, 80 : 20821 - 20847
[5] Bag-of-Visual-Words codebook generation using deep features for effective classification of imbalanced multi-class image datasets
Saini, Manisha
Susan, Seba
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (14) : 20821 - 20847
[6] Deep Decision Network for Multi-Class Image Classification
Murthy, Venkatesh N.
Singh, Vivek
Chen, Terrence
Manmatha, R.
Comaniciu, Dorin
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2240 - 2248
[7] Multi-class Alzheimer's disease classification using image and clinical features
Altaf, Tooba
Anwar, Syed Muhammad
Gul, Nadia
Majeed, Muhammad Nadeem
Majid, Muhammad
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2018, 43 : 64 - 74
[8] Multi-Class Document Classification Using Lexical Ontology-Based Deep Learning †
Yelmen, Ilkay
Gunes, Ali
Zontul, Metin
APPLIED SCIENCES-BASEL, 2023, 13 (10):
[9] Multi-Class Retinopathy classification in Fundus Image using Deep Learning Approaches
Wankhade, Nisha R.
Bhoyar, Kishor K.
INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2021, 12 (05): : 807 - 816
[10] Using natural class hierarchies in multi-class visual classification
Autio, Ilkka
PATTERN RECOGNITION, 2006, 39 (07) : 1290 - 1299

← 1 2 3 4 5 →