Document Images Indexing with Relevance Feedback : an Application to Industrial Context

被引:3
|
作者
Augereau, O. [1 ]
Journet, N. [1 ]
Domenger, J. -P. [1 ]
机构
[1] Univ Bordeaux, Lab Bordelais Rech Informat LaBRI, Talence, France
来源
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011) | 2011年
关键词
document image clustering; document retrieval; feature selection; relevance feedback; industrial application; FEATURE-SELECTION;
D O I
10.1109/ICDAR.2011.240
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article presents a new method to index document images. This work is done in an industrial context where thousands of document images are daily digitized, these images have to be sorted in different classes like payroll, various bills, information letters. We propose a software method which aims to accelerate this task. Usually, the number of document classes is a priori unknown. In this paper, we propose an automatic estimation of this class number. According to this class number, we use a clustering algorithm in order to group document images. After this step, we propose an assisted classification tool based on content based image retrieval method (CBIR). For each cluster, a reference image is automatically selected then considering a similarity measure, the other images are sorted and shown to the user. By interacting with the process, the user can reject wrong images. The user feedback is automatically taken into account to enhance the similarity measure by selecting features. The first tests show that, on average, databases are indexed 3 times faster with our assisted classification method than with a standard manual classification process.
引用
收藏
页码:1190 / 1194
页数:5
相关论文
共 45 条
  • [11] A HYBRID GA AND ACTIVE LEARNING SVM MODEL FOR RELEVANCE FEEDBACK IN THE CONTENT-BASED IMAGES RETRIVAL
    Ma Cai-hong
    Dai Qin
    Liu Shi-Bin
    2011 INTERNATIONAL CONFERENCE ON INSTRUMENTATION, MEASUREMENT, CIRCUITS AND SYSTEMS (ICIMCS 2011), VOL 2: FUTURE COMMUNICATION AND NETWORKING, 2011, : 429 - 432
  • [12] A Document Retrieval Strategy Based On Non-Relevance Feedback
    Wang, Xiaogang
    Li, Yue
    2009 SECOND INTERNATIONAL CONFERENCE ON FUTURE INFORMATION TECHNOLOGY AND MANAGEMENT ENGINEERING, FITME 2009, 2009, : 214 - 217
  • [13] Application of LSI based on relevance feedback
    Bo, Liu
    Cong, Wang
    PROCEEDINGS OF 2008 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL, VOLS 1 AND 2, 2008, : 165 - 169
  • [14] Semantic clustering of images using patterns of relevance feedback
    Morrison, Donn
    Marchand-Maillet, Stephane
    Bruno, Eric
    2008 INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING, 2008, : 307 - 313
  • [15] Document Image Retrieval Based on Keyword Spotting Using Relevance Feedback
    Keyvanpour, M.
    Tavoli, R.
    Mozaffari, S.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2014, 27 (01): : 7 - 13
  • [16] The Application of Particle Swarm Optimization in Relevance Feedback
    Xu, Xiangli
    Zhang, Libiao
    Yu, Zhezhou
    Zhou, Chunguang
    2009 INTERNATIONAL CONFERENCE ON FUTURE BIOMEDICAL INFORMATION ENGINEERING (FBIE 2009), 2009, : 156 - 159
  • [17] Personal Web Revisitation by Context and Content Keywords with Relevance Feedback
    Jin, Li
    Feng, Ling
    Liu, Gangli
    Wang, Chaokun
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (07) : 1508 - 1521
  • [18] Video Search with Context-Aware Ranker and Relevance Feedback
    Lokoc, Jakub
    Mejzlik, Frantisek
    Soucek, Tomas
    Dokoupil, Patrik
    Peska, Ladislav
    MULTIMEDIA MODELING, MMM 2022, PT II, 2022, 13142 : 505 - 510
  • [19] Query expansion and dimensionality reduction: Notions of optimality in Rocchio relevance feedback and latent semantic indexing
    Efron, Miles
    INFORMATION PROCESSING & MANAGEMENT, 2008, 44 (01) : 163 - 180
  • [20] Application of SVM Relevance Feedback Algorithms in Image Retrieval
    Wang, Xuejun
    Yang, Lingling
    ISISE 2008: INTERNATIONAL SYMPOSIUM ON INFORMATION SCIENCE AND ENGINEERING, VOL 1, 2008, : 210 - 213