A general framework for subjective information extraction from unstructured English text

被引:10
|
作者
Mangassarian, Hratch [1 ]
Artail, Hassan [1 ]
机构
[1] Amer Univ Beirut, Dept Elect & Comp Engn, Beirut, Lebanon
关键词
information extraction; natural language processing; text evaluation; intelligent systems; financial analysis;
D O I
10.1016/j.datak.2006.10.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present an information extraction (IE) strategy for handling subjective information from unstructured text. The presented methodology is general: it can be useful in many real-life applications that could potentially benefit from an automatic IE system that makes human-like decisions. We test our methodology in the sphere of company news evaluation with respect to the potential effect of the news on the company's stock prices. The described general framework comprises four sequential processing steps: part-of-speech tagging, syntactic parsing, relation generation, and criteria evaluation. The first two steps perform generic NLP tasks, while the last two phases are application-specific and require a thorough understanding of the application domain. We describe each stage and illustrate the flow of the modus operandi. We keep up with the company news evaluation example throughout the paper. Due to the inherent subjectivity of the envisaged problem, results cannot be categorically justified. However, comparing the system's evaluation of company news to our own, the results were very encouraging. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:352 / 367
页数:16
相关论文
共 50 条
  • [21] A unified framework of medical information annotation and extraction for Chinese clinical text
    Zhu, Enwei
    Sheng, Qilin
    Yang, Huanwan
    Liu, Yiyang
    Cai, Ting
    Li, Jinpeng
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2023, 142
  • [22] Vulcan: Automatic extraction and analysis of cyber threat intelligence from unstructured text
    Jo, Hyeonseong
    Lee, Yongjae
    Shin, Seungwon
    COMPUTERS & SECURITY, 2022, 120
  • [23] A Framework for the Automatic Extraction of Rules from Online Text
    Hassanpour, Saeed
    O'Connor, Martin J.
    Das, Amar K.
    RULE-BASED REASONING, PROGRAMMING, AND APPLICATIONS, 2011, 6826 : 266 - 280
  • [24] Semi-Automated Information Extraction from Unstructured Threat Advisories
    Ramnani, Roshni R.
    Shivaram, Karthik
    Sengupta, Shubhashis
    Annervaz, K. M.
    PROCEEDINGS OF THE 10TH INNOVATIONS IN SOFTWARE ENGINEERING CONFERENCE, 2017, : 181 - 187
  • [25] Contextual Text Mining Framework for Unstructured Textual Judicial Corpora through Ontologies
    Nabi, Zubair
    Talib, Ramzan
    Hanif, Muhammad Kashif
    Awais, Muhammad
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 43 (03): : 1357 - 1374
  • [26] Information Extraction from Text Based on Semantic Inferentialism
    Pinheiro, Vladia
    Pequeno, Tarcisio
    Furtado, Vasco
    Nogueira, Douglas
    FLEXIBLE QUERY ANSWERING SYSTEMS: 8TH INTERNATIONAL CONFERENCE, FQAS 2009, 2009, 5822 : 333 - 344
  • [27] Combining Relations for Information Extraction from Free Text
    Maslennikov, Mstislav
    Chua, Tat-Seng
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2010, 28 (03)
  • [28] Case information extraction from natural procedure text
    Ni W.
    Wei Z.
    Zeng Q.
    Liu T.
    Zeng, Qingtian (qtzeng@163.com), 1680, CIMS (24): : 1680 - 1689
  • [29] A Semi-Supervised Approach for Temporal Information Extraction from Clinical Text
    Moharasan, Gandhimathi
    Tu Bao Ho
    2016 IEEE RIVF INTERNATIONAL CONFERENCE ON COMPUTING & COMMUNICATION TECHNOLOGIES, RESEARCH, INNOVATION, AND VISION FOR THE FUTURE (RIVF), 2016, : 7 - 12
  • [30] Automatic Extraction of Engineering Rules From Unstructured Text: A Natural Language Processing Approach
    Ye, Xinfeng
    Lu, Yuqian
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2020, 20 (03)