A general framework for subjective information extraction from unstructured English text

被引:10
|
作者
Mangassarian, Hratch [1 ]
Artail, Hassan [1 ]
机构
[1] Amer Univ Beirut, Dept Elect & Comp Engn, Beirut, Lebanon
关键词
information extraction; natural language processing; text evaluation; intelligent systems; financial analysis;
D O I
10.1016/j.datak.2006.10.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present an information extraction (IE) strategy for handling subjective information from unstructured text. The presented methodology is general: it can be useful in many real-life applications that could potentially benefit from an automatic IE system that makes human-like decisions. We test our methodology in the sphere of company news evaluation with respect to the potential effect of the news on the company's stock prices. The described general framework comprises four sequential processing steps: part-of-speech tagging, syntactic parsing, relation generation, and criteria evaluation. The first two steps perform generic NLP tasks, while the last two phases are application-specific and require a thorough understanding of the application domain. We describe each stage and illustrate the flow of the modus operandi. We keep up with the company news evaluation example throughout the paper. Due to the inherent subjectivity of the envisaged problem, results cannot be categorically justified. However, comparing the system's evaluation of company news to our own, the results were very encouraging. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:352 / 367
页数:16
相关论文
共 50 条
  • [1] Extraction of protein interaction information from unstructured text using a link grammar parser
    Seoud, Rania A. Abul
    Youssef, Abou-Bakr M.
    Kadah, Yasser M.
    2007 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS: ICCES '07, 2007, : 70 - +
  • [2] A Framework for Relationship Extraction from Unstructured Text via Link Grammar Parsing
    Samuel, Kenneth
    Savas, Onur
    Manikonda, Vikram
    NEXT-GENERATION ANALYST VI, 2018, 10653
  • [3] Information Extraction from Unstructured Recipe Data
    Silva, Nuno
    Ribeiro, David
    Ferreira, Liliana
    PROCEEDINGS OF THE 2019 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND TECHNOLOGY APPLICATIONS (ICCTA 2019), 2019, : 165 - 168
  • [4] CyNER: Information Extraction from Unstructured Text of CTI Sources with Noncontextual IOCs
    Fujii, Shota
    Kawaguchi, Nobutaka
    Shigemoto, Tomohiro
    Yamauchi, Toshihiro
    ADVANCES IN INFORMATION AND COMPUTER SECURITY, IWSEC 2022, 2022, 13504 : 85 - 104
  • [5] GeoXTag: Relative Spatial Information Extraction and Tagging of Unstructured Text
    Syed, Mehtab Alam
    Arsevska, Elena
    Roche, Mathieu
    Teisseire, Maguelonne
    25TH AGILE CONFERENCE ON GEOGRAPHIC INFORMATION SCIENCE ARTIFICIAL INTELLIGENCE IN THE SERVICE OF GEOSPATIAL TECHNOLOGIES, 2022, 3
  • [6] Mathematical Expression Extraction from Unstructured Plain Text
    Fernando, Kulakshi
    Ranathunga, Surangika
    Dias, Gihan
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2019), 2019, 11608 : 312 - 320
  • [7] Automatic spatiotemporal and semantic information extraction from unstructured geoscience reports using text mining techniques
    Qiu, Qinjun
    Xie, Zhong
    Wu, Liang
    Tao, Liufeng
    EARTH SCIENCE INFORMATICS, 2020, 13 (04) : 1393 - 1410
  • [8] Automatic spatiotemporal and semantic information extraction from unstructured geoscience reports using text mining techniques
    Qinjun Qiu
    Zhong Xie
    Liang Wu
    Liufeng Tao
    Earth Science Informatics, 2020, 13 : 1393 - 1410
  • [9] Automatic problem extraction and analysis from unstructured text in IT tickets
    Agarwal, S.
    Aggarwal, V.
    Akula, A. R.
    Dasgupta, G. B.
    Sridhara, G.
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2017, 61 (01) : 41 - 52
  • [10] CBCP: A Method of Causality Extraction from Unstructured Financial Text
    Cao, Lang
    Zhang, Shihua
    Chen, Juxing
    2021 5TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2021, 2021, : 135 - 140