LitAI: Enhancing Multimodal Literature Understanding and Mining with Generative AI

被引:0
|
作者
Medisetti, Gowtham [1 ]
Compson, Zacchaeus [1 ]
Fan, Heng [1 ]
Yang, Huaxiao [1 ]
Feng, Yunhe [1 ]
机构
[1] Univ North Texas, Denton, TX 76205 USA
关键词
Literature Mining; OCR; Generative AI; Prompt Engineering; ChatGPT; GPT-4;
D O I
10.1109/MIPR62202.2024.00080
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Information processing and retrieval in literature are critical for advancing scientific research and knowledge discovery. The inherent multimodality and diverse literature formats, including text, tables, and figures, present significant challenges in literature information retrieval. This paper introduces LitAI, a novel approach that employs readily available generative AI tools to enhance multimodal information retrieval from literature documents. By integrating tools such as optical character recognition (OCR) with generative AI services, LitAI facilitates the retrieval of text, tables, and figures from PDF documents. We have developed specific prompts that leverage in-context learning and prompt engineering within Generative AI to achieve precise information extraction. Our empirical evaluations, conducted on datasets from the ecological and biological sciences, demonstrate the superiority of our approach over several established baselines including Tesseract-OCR and GPT-4. The implementation of LitAI is accessible at https://github.com/ResponsibleAILab/LitAI.
引用
收藏
页码:471 / 476
页数:6
相关论文
共 50 条
  • [1] Integrating Multimodal Generative AI and Blockchain for Enhancing Generative Design in the Early Phase of Architectural Design Process
    Fitriawijaya, Adam
    Jeng, Taysheng
    BUILDINGS, 2024, 14 (08)
  • [2] Neurosymbolic AI for Enhancing Instructability in Generative AI
    Sheth, Amit
    Pallagani, Vishal
    Roy, Kaushik
    IEEE INTELLIGENT SYSTEMS, 2024, 39 (05) : 5 - 11
  • [3] VizChat: Enhancing Learning Analytics Dashboards with Contextualised Explanations Using Multimodal Generative AI Chatbots
    Yan, Lixiang
    Zhao, Linxuan
    Echeverria, Vanessa
    Jin, Yueqiao
    Alfredo, Riordan
    Li, Xinyu
    Gasevi'c, Dragan
    Martinez-Maldonado, Roberto
    ARTIFICIAL INTELLIGENCE IN EDUCATION, PT II, AIED 2024, 2024, 14830 : 180 - 193
  • [4] Multimodal generative AI for medical image interpretation
    Vishwanatha M. Rao
    Michael Hla
    Michael Moor
    Subathra Adithan
    Stephen Kwak
    Eric J. Topol
    Pranav Rajpurkar
    Nature, 2025, 639 (8056) : 888 - 896
  • [5] Generative AI for scalable feedback to multimodal exercises
    Jurgensmeier, Lukas
    Skiera, Bernd
    INTERNATIONAL JOURNAL OF RESEARCH IN MARKETING, 2024, 41 (03) : 468 - 488
  • [6] A multimodal generative AI copilot for human pathology
    Lu, Ming Y.
    Chen, Bowen
    Williamson, Drew F. K.
    Chen, Richard J.
    Zhao, Melissa
    Chow, Aaron K.
    Ikemura, Kenji
    Kim, Ahrong
    Pouli, Dimitra
    Patel, Ankush
    Soliman, Amr
    Chen, Chengkuan
    Ding, Tong
    Wang, Judy J.
    Gerber, Georg
    Liang, Ivy
    Le, Long Phi
    Parwani, Anil V.
    Weishaupt, Luca L.
    Mahmood, Faisal
    NATURE, 2024, 634 (8033) : 466 - 473
  • [7] Enhancing children's understanding of algorithmic biases in and with text-to-image generative AI
    Vartiainen, Henriikka
    Kahila, Juho
    Tedre, Matti
    Lopez-Pernas, Sonsoles
    Pope, Nicolas
    NEW MEDIA & SOCIETY, 2024,
  • [8] Understanding generative AI and its applications
    Lovati, Stefano
    Electronic Products, 2024, 66 (01): : 4 - 5
  • [9] Enhancing User Experience With a Generative AI Chatbot
    Kim, Jeong Soo
    Kim, Minseong
    Baek, Tae Hyun
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2025, 41 (01) : 651 - 663
  • [10] Multimodal Image Synthesis and Editing: The Generative AI Era
    Zhan, Fangneng
    Yu, Yingchen
    Wu, Rongliang
    Zhang, Jiahui
    Lu, Shijian
    Liu, Lingjie
    Kortylewski, Adam
    Theobalt, Christian
    Xing, Eric
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 15098 - 15119