Natural-language retrieval of images based on descriptive captions

被引:38
|
作者
Guglielmo, EJ [1 ]
Rowe, NC [1 ]
机构
[1] USN,POSTGRAD SCH,MONTEREY,CA 93943
关键词
algorithms; experimentation; human factors; performance; captions; multimedia database; type hierarchy; INFORMATION-RETRIEVAL; SYSTEM; ACCESS;
D O I
10.1145/230538.230539
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We describe a prototype intelligent information retrieval system that uses natural-language understanding to efficiently locate captioned data. Multimedia data generally require captions to explain their features and significance. Such descriptive captions often rely on long nominal compounds (strings of consecutive nouns) which create problems of disambiguating word sense. In our system, captions and user queries are parsed and interpreted to produce a logical form, using a detailed theory of the meaning of nominal compounds. A fine-grain match can then compare the logical form of the query to the logical forms for each caption. To improve system efficiency, we first perform a coarse-grain match with index files, using nouns and verbs extracted from the query. Our experiments with randomly selected queries and captions from an existing image library show an increase of 30% in precision and 50% in recall over the keyphrase approach currently used. Our processing times have a median of seven seconds as compared to eight minutes for the existing system, and our system is much easier to use.
引用
收藏
页码:237 / 267
页数:31
相关论文
共 50 条
  • [31] Automated residential layout generation and editing using natural language and images
    Zeng, Pengyu
    Gao, Wen
    Li, Jizhizi
    Yin, Jun
    Chen, Jiling
    Lu, Shuai
    AUTOMATION IN CONSTRUCTION, 2025, 174
  • [32] Word-Based Self-Indexes for Natural Language Text
    Farina, Antonio
    Brisaboa, Nieves R.
    Navarro, Gonzalo
    Claude, Francisco
    Places, Angeles S.
    Rodriguez, Eduardo
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2012, 30 (01)
  • [33] Keyword based Information Retrieval System for Urdu Document Images
    Hussain, Raashid
    Khan, Haris Ahmad
    Siddiqi, Imran
    Khurshid, Khurram
    Masood, Asif
    2015 11TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS (SITIS), 2015, : 27 - 33
  • [34] Research on Chinese information retrieval based on a hybrid language modeling
    Zheng, De-Quan
    Zhao, Tie-Jun
    Yu, Feng
    Li, Sheng
    Yu, Hao
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 2586 - +
  • [35] On smoothing and scaling language model for sentiment based information retrieval
    Fatma Najar
    Nizar Bouguila
    Advances in Data Analysis and Classification, 2023, 17 : 725 - 744
  • [36] On smoothing and scaling language model for sentiment based information retrieval
    Najar, Fatma
    Bouguila, Nizar
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2023, 17 (03) : 725 - 744
  • [37] Content-Based Retrieval of Medical Images: from Context to Perception
    Bugatti, Pedro H.
    Ponciano-Silva, Marcelo
    Traina, Agma J. M.
    Traina, Caetano, Jr.
    Marques, Paulo M. A.
    2009 22ND IEEE INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, 2009, : 374 - +
  • [38] Content-Based Medical Image Retrieval for Medical Radiology Images
    Barac, Dario
    Manojlovic, Teo
    Napravnik, Mateja
    Hrzic, Franko
    Saracevic, Mihaela Mamula
    Miletic, Damir
    Stajduhar, Ivan
    ARTIFICIAL INTELLIGENCE IN MEDICINE, PT II, AIME 2024, 2024, 14845 : 45 - 59
  • [39] BIRAM: A content-based image retrieval framework for medical images
    Moreno, Ramon A.
    Furuie, Sergio S.
    MEDICAL IMAGING 2006: PACS AND IMAGING INFORMATICS, 2006, 6145
  • [40] Detected text-based image retrieval approach for textual images
    Unar, Salahuddin
    Wang, Xingyuan
    Zhang, Chuan
    Wang, Chunpeng
    IET IMAGE PROCESSING, 2019, 13 (03) : 515 - 521