Text Processing

被引:0
作者
Couto, Francisco M. [1 ]
机构
[1] Univ Lisbon, Fac Ciencias, Dept Informat, LASIGE, Lisbon, Portugal
来源
DATA AND TEXT PROCESSING FOR HEALTH AND LIFE SCIENCES | 2019年 / 1137卷
关键词
NLP: Natural Language Processing; Text mining; Pattern matching; String matching; Word matching; Evaluation metrics; Regular expressions; Tokenization; NER: Named-Entity Recognition; Relation extraction;
D O I
10.1007/978-3-030-13845-5_4
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In the previous chapter we were able to automatically process structured data to retrieve biomedical text about any chemical compound, such as caffeine. This chapter will provide a step-by-step introduction to how we can process that text using shell script commands, specifically extract information about diseases related to caffeine. The goal is to equip the reader with an essential set of skills to extract meaningful information from any text.
引用
收藏
页码:45 / 60
页数:16
相关论文
共 50 条
  • [21] A term normalization method for efficient knowledge acquisition through text processing
    Hwang, Myunggwon
    Jeong, Do-Heon
    Kim, Jinhyung
    Song, Sa-Kwang
    Jung, Hanmin
    Shin, Juhyun
    Kim, Pankoo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2013, 65 (01) : 75 - 91
  • [22] A term normalization method for efficient knowledge acquisition through text processing
    Myunggwon Hwang
    Do-Heon Jeong
    Jinhyung Kim
    Sa-Kwang Song
    Hanmin Jung
    Juhyun Shin
    Pankoo Kim
    Multimedia Tools and Applications, 2013, 65 : 75 - 91
  • [23] Analysis of Document Pre-Processing Effects in Text and Opinion Mining
    Eler, Danilo Medeiros
    Grosa, Denilson
    Pola, Ives
    Garcia, Rogerio
    Correia, Ronaldo
    Teixeira, Jaqueline
    INFORMATION, 2018, 9 (04)
  • [24] Keyword selection and processing strategy for applying text mining to patent analysis
    Noh, Heeyong
    Jo, Yeongran
    Lee, Sungjoo
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (09) : 4348 - 4360
  • [25] A study on the impact of pre-processing techniques in Spanish and English text classification over short and large text documents
    Orellana, Gerardo
    Arias, Belen
    Orellana, Marcos
    Saquicela, Victor
    Baculima, Fernando
    Piedra, Nelson
    PROCEEDINGS 3RD INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND COMPUTER SCIENCE (INCISCOS 2018), 2018, : 277 - 283
  • [26] Automating the generation of lexical patterns for processing free text in clinical documents
    Meng, Frank
    Morioka, Craig
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2015, 22 (05) : 980 - 986
  • [27] Low-level natural language technique for arabic text processing
    Awajan, A
    COMPUTERS AND THEIR APPLICATIONS, 2001, : 387 - 390
  • [28] Text Mining and Analysis of Treatise on Febrile Diseases Based on Natural Language Processing
    Kai Zhao
    Na Shi
    Zhen Sa
    Hua-Xing Wang
    Chun-Hua Lu
    Xiao-Ying Xu
    WorldJournalofTraditionalChineseMedicine, 2020, 6 (01) : 67 - 73
  • [29] Impact of Text Pre-processing and Ensemble Learning on Arabic Sentiment Analysis
    Oussous, Ahmed
    Lahcen, Ayoub Ait
    Belfkih, Samir
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON NETWORKING, INFORMATION SYSTEMS & SECURITY (NISS19), 2019,
  • [30] Text mining and analysis of treatise on febrile diseases based on natural language processing
    Zhao, Kai
    Shi, Na
    Sa, Zhen
    Wang, Hua-Xing
    Lu, Chun-Hua
    Xu, Xiao-Ying
    WORLD JOURNAL OF TRADITIONAL CHINESE MEDICINE, 2020, 6 (01) : 67 - 73