Document Images Indexing with Relevance Feedback : an Application to Industrial Context

被引:3
|
作者
Augereau, O. [1 ]
Journet, N. [1 ]
Domenger, J. -P. [1 ]
机构
[1] Univ Bordeaux, Lab Bordelais Rech Informat LaBRI, Talence, France
来源
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011) | 2011年
关键词
document image clustering; document retrieval; feature selection; relevance feedback; industrial application; FEATURE-SELECTION;
D O I
10.1109/ICDAR.2011.240
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article presents a new method to index document images. This work is done in an industrial context where thousands of document images are daily digitized, these images have to be sorted in different classes like payroll, various bills, information letters. We propose a software method which aims to accelerate this task. Usually, the number of document classes is a priori unknown. In this paper, we propose an automatic estimation of this class number. According to this class number, we use a clustering algorithm in order to group document images. After this step, we propose an assisted classification tool based on content based image retrieval method (CBIR). For each cluster, a reference image is automatically selected then considering a similarity measure, the other images are sorted and shown to the user. By interacting with the process, the user can reject wrong images. The user feedback is automatically taken into account to enhance the similarity measure by selecting features. The first tests show that, on average, databases are indexed 3 times faster with our assisted classification method than with a standard manual classification process.
引用
收藏
页码:1190 / 1194
页数:5
相关论文
共 45 条
  • [31] Music-Inspired Optimization Algorithm: Harmony-Tabu for Document Retrieval Using Relevance Feedback
    Latha, K.
    Manivelu, R.
    INFORMATION PROCESSING AND MANAGEMENT, 2010, 70 : 385 - +
  • [32] Application of keyword map-based relevance feedback to interactive Blog search
    Takama, Y
    Kajinami, T
    Matsumura, A
    PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON ACTIVE MEDIA TECHNOLOGY (AMT 2005), 2005, : 112 - 115
  • [33] Content-Based Image Retrieval Based on Relevance Feedback and Reinforcement Learning for Medical Images
    Lakdashti, Abolfazl
    Ajorloo, Hossein
    ETRI JOURNAL, 2011, 33 (02) : 240 - 250
  • [34] Pattern Extraction in Segmented Satellite Images By Reducing Semantic Gap Using Relevance Feedback Mechanism
    Deepika, N. P.
    Subha, Lekshmi M. S.
    Gopal, Viji
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES, ICICT 2014, 2015, 46 : 1809 - 1816
  • [35] A Fast Adaptive Content-based Retrieval System of Satellite Images Database using Relevance Feedback
    Ezzat Mahmoud, Hanan Mahmoud
    Hefnawy, Alaa Abd El Fatah
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 13, 2006, 13 : 298 - 302
  • [36] ImageRank: A Novel Sorting Algorithm with Relevance Feedback in Application of National Costume Image Retrieval
    Ma, Baiyou
    Xu, Tianwei
    Zhou, Juxiang
    2017 IEEE 2ND INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2017, : 166 - 171
  • [37] Distance selection based on relevance feedback in the context of CBIR using the SFS meta-heuristic with one round
    Mosbah, Mawloud
    Boucheham, Bachir
    EGYPTIAN INFORMATICS JOURNAL, 2017, 18 (01) : 1 - 9
  • [38] Integrated probability function and its application to content-based image retrieval by relevance feedback
    King, I
    Jin, Z
    PATTERN RECOGNITION, 2003, 36 (09) : 2177 - 2186
  • [39] Application of texture-based features for text non-text classification in printed document images with novel feature selection algorithm
    Soulib Ghosh
    S. K. Khalid Hassan
    Ali Hussain Khan
    Ankur Manna
    Showmik Bhowmik
    Ram Sarkar
    Soft Computing, 2022, 26 : 891 - 909
  • [40] Application of texture-based features for text non-text classification in printed document images with novel feature selection algorithm
    Ghosh, Soulib
    Hassan, S. K. Khalid
    Khan, Ali Hussain
    Manna, Ankur
    Bhowmik, Showmik
    Sarkar, Ram
    SOFT COMPUTING, 2022, 26 (02) : 891 - 909