A general framework for subjective information extraction from unstructured English text

被引:10
|
作者
Mangassarian, Hratch [1 ]
Artail, Hassan [1 ]
机构
[1] Amer Univ Beirut, Dept Elect & Comp Engn, Beirut, Lebanon
关键词
information extraction; natural language processing; text evaluation; intelligent systems; financial analysis;
D O I
10.1016/j.datak.2006.10.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present an information extraction (IE) strategy for handling subjective information from unstructured text. The presented methodology is general: it can be useful in many real-life applications that could potentially benefit from an automatic IE system that makes human-like decisions. We test our methodology in the sphere of company news evaluation with respect to the potential effect of the news on the company's stock prices. The described general framework comprises four sequential processing steps: part-of-speech tagging, syntactic parsing, relation generation, and criteria evaluation. The first two steps perform generic NLP tasks, while the last two phases are application-specific and require a thorough understanding of the application domain. We describe each stage and illustrate the flow of the modus operandi. We keep up with the company news evaluation example throughout the paper. Due to the inherent subjectivity of the envisaged problem, results cannot be categorically justified. However, comparing the system's evaluation of company news to our own, the results were very encouraging. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:352 / 367
页数:16
相关论文
共 50 条
  • [31] On Extraction of Event Information from Social Text Streams: An Unpretentious NLP Solution
    Iqbal, Kanwal
    Khan, Muhammad Yaseen
    Wasi, Shaukat
    Mahboob, Shumaila
    Ahmed, Tafseer
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2019, 19 (09): : 121 - 131
  • [32] Transformer based named entity recognition for place name extraction from unstructured text
    Berragan, Cillian
    Singleton, Alex
    Calafiore, Alessia
    Morley, Jeremy
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2023, 37 (04) : 747 - 766
  • [33] Information Extraction System for Transforming Unstructured Text Data in Fire Reports into Structured Forms: A Polish Case Study
    Mironczuk, Marcin Michal
    FIRE TECHNOLOGY, 2020, 56 (02) : 545 - 581
  • [34] Information Extraction System for Transforming Unstructured Text Data in Fire Reports into Structured Forms: A Polish Case Study
    Marcin Michał Mirończuk
    Fire Technology, 2020, 56 : 545 - 581
  • [35] Fusion of visual representations for multimodal information extraction from unstructured transactional documents
    Berke Oral
    Gülşen Eryiğit
    International Journal on Document Analysis and Recognition (IJDAR), 2022, 25 : 187 - 205
  • [36] Fusion of visual representations for multimodal information extraction from unstructured transactional documents
    Oral, Berke
    Eryigit, Gulsen
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2022, 25 (3) : 187 - 205
  • [37] Automatic information extraction from unstructured mammography reports using distributed semantics
    Gupta, Anupama
    Banerjee, Imon
    Rubin, Daniel L.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 78 : 78 - 86
  • [38] CustFRE: An annotated dataset for extraction of family relations from English text
    Mumtaz, Raabia
    Qadir, Muhammad Abdul
    Saeed, Asif
    DATA IN BRIEF, 2022, 41
  • [39] A hybrid system for temporal information extraction from clinical text
    Tang, Buzhou
    Wu, Yonghui
    Jiang, Min
    Chen, Yukun
    Denny, Joshua C.
    Xu, Hua
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2013, 20 (05) : 828 - 835
  • [40] Extraction of Meaningful Information from Unstructured Clinical Notes Using Web Scraping
    Varshini, K. Sukanya
    Uthra, R. Annie
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023, 32 (03)