LitAI: Enhancing Multimodal Literature Understanding and Mining with Generative AI

被引:0
|
作者
Medisetti, Gowtham [1 ]
Compson, Zacchaeus [1 ]
Fan, Heng [1 ]
Yang, Huaxiao [1 ]
Feng, Yunhe [1 ]
机构
[1] Univ North Texas, Denton, TX 76205 USA
关键词
Literature Mining; OCR; Generative AI; Prompt Engineering; ChatGPT; GPT-4;
D O I
10.1109/MIPR62202.2024.00080
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Information processing and retrieval in literature are critical for advancing scientific research and knowledge discovery. The inherent multimodality and diverse literature formats, including text, tables, and figures, present significant challenges in literature information retrieval. This paper introduces LitAI, a novel approach that employs readily available generative AI tools to enhance multimodal information retrieval from literature documents. By integrating tools such as optical character recognition (OCR) with generative AI services, LitAI facilitates the retrieval of text, tables, and figures from PDF documents. We have developed specific prompts that leverage in-context learning and prompt engineering within Generative AI to achieve precise information extraction. Our empirical evaluations, conducted on datasets from the ecological and biological sciences, demonstrate the superiority of our approach over several established baselines including Tesseract-OCR and GPT-4. The implementation of LitAI is accessible at https://github.com/ResponsibleAILab/LitAI.
引用
收藏
页码:471 / 476
页数:6
相关论文
共 50 条
  • [21] Understanding and enhancing the selectivity of multimodal protein chromatography
    Cramer, Steven
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2013, 246
  • [22] Enhancing Academic Performance with Generative AI -Based Quiz Platform
    Chang, Chia-Kai
    Chien, Lee-Chia-Tung
    2024 IEEE INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES, ICALT 2024, 2024, : 193 - 195
  • [23] Beyond ChatGPT: Multimodal generative AI for L2 writers
    Kang, Joohoon
    Yi, Youngjoo
    JOURNAL OF SECOND LANGUAGE WRITING, 2023, 62
  • [24] Enhancing Programming Error Messages in Real Time with Generative AI
    Kimmel, Bailey
    Geisert, Austin Lee
    Yaro, Lily
    Gipson, Brendan
    Hotchkiss, Ronald Taylor
    Osae-Asante, Sidney Kwame
    Vaught, Hunter
    Wininger, Grant
    Yamaguchi, Chase
    EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
  • [25] Enhancing fieldwork readiness in occupational therapy students with generative AI
    Mansour, Tara
    Wong, John
    FRONTIERS IN MEDICINE, 2024, 11
  • [26] Exploring the scope of generative AI in literature review development
    Schryen, Guido
    Marrone, Mauricio
    Yang, Jiaqi
    ELECTRONIC MARKETS, 2025, 35 (01)
  • [27] Enhancing Generative AI Chatbot Accuracy Using Knowledge Graph
    Bandi, Ajay
    Babu, Jameer
    Zeng, Ruida
    Muthyala, Sai Ram
    SOFTWARE AND DATA ENGINEERING, SEDE 2024, 2025, 2244 : 157 - 167
  • [28] Ironies of Generative AI: Understanding and Mitigating Productivity Loss in Human-AI Interaction
    Simkute, Auste
    Tankelevitch, Lev
    Kewenig, Viktor
    Scott, Ava Elizabeth
    Sellen, Abigail
    Rintel, Sean
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2025, 41 (05) : 2898 - 2919
  • [29] Understanding the Ethics of Generative AI: Established and New Ethical Principles
    Laine, Joakim
    Minkkinen, Matti
    Mantymaki, Matti
    COMMUNICATIONS OF THE ASSOCIATION FOR INFORMATION SYSTEMS, 2025, 56
  • [30] Understanding the Impact of Using Generative AI Tools in a Database Course
    Osorio, Valeria Ramirez
    Bernuy, Angela Zavaleta
    Simion, Bogdan
    Liut, Michael
    PROCEEDINGS OF THE 56TH ACM TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION, SIGCSE TS 2025, VOL 2, 2025, : 959 - 965