Manuscripts Image Retrieval Using Deep Learning Incorporating a Variety of Fusion Levels

被引：6

作者：

Khayyat, Manal M. ^{[1
,2
]}

Elrefaei, Lamiaa A. ^{[1
,3
]}

机构：

[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Dept Comp Sci, Jeddah 21589, Saudi Arabia

[2] Umm Al Qura Univ, Preparatory Year Joint Med Track, Dept Comp Sci, Mecca 21955, Saudi Arabia

[3] Benha Univ, Dept Elect Engn, Fac Engn Shoubra, Cairo 11629, Egypt

来源：

IEEE ACCESS | 2020年 / 8卷

关键词：

Image retrieval; Feature extraction; Machine learning; Visualization; Image segmentation; Image color analysis; Semantics; fusion methods; similarity measurement; deep learning (DL); convolutional neural networks (CNN); long short-term memory (LSTM); DECISION LEVEL; SCORE; CLASSIFICATION; RECOGNITION; INFORMATION; FEATURES; MODEL; REPRESENTATION; ATTENTION; NETWORKS;

D O I：

10.1109/ACCESS.2020.3010882

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The instantaneous search and retrieval of the most relevant images to a specific query image is a desirable application for all digital libraries. The automatic extraction and classification according to the most distinguishable features, is a crucial step to detect the similarities among images successfully. This study introduces a novel approach that utilizes a fusion model for classifying and retrieving historical Arabic manuscripts' images. To accomplish our goal, the images are first classified according to their extracted deep learning visual features utilizing a pre-trained convolutional neural network. Then, the texts written in the manuscripts' images are extracted and pre-processed to classify the images according to their textual features using an optimized bidirectional LSTM deep learning model with attention and batch normalization layers. Finally, both the visual and textual deep learning models are fused at three different fusion-levels named: decision-level, features-level, and score-level. The score-level fusion model resulted in a considerable improvement of each model used individually. Extensive experimentation and evaluation of the proposed fusion method on the collected ancient Arabic manuscripts dataset proved its robustness against other state-of-the-art methods recording 99% classification accuracy and 98% mean accuracy on the top-10 image retrieval.

引用

页码：136460 / 136486

页数：27

共 86 条

[41] Koch G., 2015, P 32 INT C MACH LEAR, V37, P1
[42] Improving Traffic Flow Prediction With Weather Information in Connected Cars: A Deep Learning Approach
Koesdwiady, Arief
Soua, Ridha
Karray, Fakhreddine
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2016, 65 (12) : 9508 - 9517
[43] Adapting contentz-based image retrieval techniques for the semantic annotation of medical images
Kumar, Ashnil
Dyer, Shane
Kim, Jinman
Li, Changyang
Leong, Philip H. W.
Fulham, Michael
Feng, Dagan
[J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2016, 49 : 37 - 45
[44] Robust Face Recognition Using the Deep C2D-CNN Model Based on Decision-Level Fusion
Li, Jing
Qiu, Tao
Wen, Chang
Xie, Kai
Wen, Fang-Qing
[J]. SENSORS, 2018, 18 (07)
[45] Lip CC, 2012, ADV INTEL SOFT COMPU, V133, P941
[46] Bidirectional LSTM with attention mechanism and convolutional layer for text classification
Liu, Gang
Guo, Jiabao
[J]. NEUROCOMPUTING, 2019, 337 : 325 - 338
[47] High-Performance Solar Steam Device with Layered Channels: Artificial Tree with a Reversed Design
Liu, He
Chen, Chaoji
Chen, Guang
Kuang, Yudi
Zhao, Xinpeng
Song, Jianwei
Jia, Chao
Xu, Xu
Hitz, Emily
Xie, Hua
Wang, Sha
Jiang, Feng
Li, Tian
Li, Yiju
Gong, Amy
Yang, Ronggui
Das, Siddhartha
Hu, Liangbing
[J]. ADVANCED ENERGY MATERIALS, 2018, 8 (08)
[48] Fusion of Deep Learning and Compressed Domain Features for Content-Based Image Retrieval
Liu, Peizhong
Guo, Jing-Ming
Wu, Chi-Yi
Cai, Danlin
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (12) : 5706 - 5717
[49] Recurrent networks with attention and convolutional networks for sentence representation and classification
Liu, Tengfei
Yu, Shuangyuan
Xu, Baomin
Yin, Hongfeng
[J]. APPLIED INTELLIGENCE, 2018, 48 (10) : 3797 - 3806
[50] Marshall A. M., 2014, P CIET ECE DEP C, P1

← 1 2 3 4 5 6 7 8 9 →