Extraction and representation of contextual information for knowledge discovery in texts

被引:25
|
作者
Perrin, P [1 ]
Petry, FE
机构
[1] Merck Res Labs, Med Chem Mol Syst, Rahway, NJ 07065 USA
[2] Tulane Univ, Dept Elect Engn & Comp Sci, New Orleans, LA 70118 USA
关键词
text mining; text feature construction; extraction and selection; collocational expressions; text representation; first-order logic;
D O I
10.1016/S0020-0255(02)00400-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies the role of lexical contextual relations for the problem of unsupervised knowledge discovery in full texts. Narrative texts have inherent structure dictated by language usage in generating them. We suggest that the relative distance of terms within a text gives sufficient information about its structure and its relevant content. Furthermore, this structure can be used to discover implicit knowledge embedded in the text, therefore serving as a good candidate to represent effectively the text content for knowledge elicitation tasks. We qualitatively demonstrate that a useful text structure and content can be systematically extracted by collocational lexical analysis without the need to encode any supplemental sources of knowledge. We present an algorithm that systematically extracts the most relevant facts in the texts and labels them by their overall theme, dictated by local contextual information. It exploits domain independent lexical frequencies and mutual information measures to find the relevant Contextual units in the texts. We report results from experiments in a real-world textual database of psychiatric evaluation reports. (C) 2002 Elsevier Science Inc. All rights reserved.
引用
收藏
页码:125 / 152
页数:28
相关论文
共 50 条
  • [41] Knowledge representation for information integration
    Rousset, MC
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2002, 2366 : 1 - 3
  • [42] Knowledge representation for information integration
    Rousset, MC
    Reynaud, C
    INFORMATION SYSTEMS, 2004, 29 (01) : 3 - 22
  • [43] Lexical knowledge extraction from technical texts
    Blank, I
    ADVANCES IN INTELLIGENT SYSTEMS: CONCEPTS, TOOLS AND APPLICATIONS, 1999, 21 : 155 - 166
  • [44] Knowledge Extraction From Texts Based onWikidata
    Shimorina, Anastasia
    Heinecke, Johannes
    Herledan, Frederic
    2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2022, 2022, : 297 - 304
  • [45] A Deep Manifold Representation for Information Discovery
    Gao, Lei
    Guan, Ling
    2020 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2020,
  • [46] Contextual information extraction in brain tumour segmentation
    Zia, Muhammad Sultan
    Baig, Usman Ali
    Rehman, Zaka Ur
    Yaqub, Muhammad
    Ahmed, Shahzad
    Zhang, Yudong
    Wang, Shuihua
    Khan, Rizwan
    IET IMAGE PROCESSING, 2023, 17 (12) : 3371 - 3391
  • [47] Knowledge Discovery in Texts for Constructing Decision Support Systems
    Stanley Loh
    José Palazzo M. de Oliveira
    Mauricio A. Gameiro
    Applied Intelligence, 2003, 18 : 357 - 366
  • [48] CLASITEX+:: A tool for knowledge discovery from texts
    Trinidad, JFM
    Martínez, BB
    Arenas, AG
    Shulcloper, JR
    PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 1510 : 459 - 467
  • [49] Knowledge discovery in texts for constructing decision support systems
    Loh, S
    De Oliveira, JPM
    Gameiro, MA
    APPLIED INTELLIGENCE, 2003, 18 (03) : 357 - 366
  • [50] Natural language processing for drug information extraction : Advancing knowledge discovery in biomedical literature
    Koparde, A. A.
    Jadhav, Pradnya A.
    Annie, G. Jisha
    Goyal, Dinesh
    JOURNAL OF STATISTICS AND MANAGEMENT SYSTEMS, 2024, 27 (02) : 383 - 393