Semantic annotation of unstructured and ungrammatical text

被引:0
|
作者
Michelson, Matthew [1 ]
Knoblock, Craig A. [1 ]
机构
[1] Univ So Calif, Inst Informat Sci, Marina Del Rey, CA 90292 USA
来源
19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05) | 2005年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are vast amounts of free text on the internet that are neither grammatical nor formally structured, such as item descriptions on Ebay or internet classifieds like Craig's list. These sources of data, called "posts," are full of useful information for agents scouring the Semantic Web, but they lack the semantic annotation to make them searchable. Annotating these posts is difficult since the text generally exhibits little formal grammar and the structure of the posts varies. However, by leveraging collections of known entities and their common attributes, called " reference sets," we can annotate these posts despite their lack of grammar and structure. To use this reference data, we align a post to a member of the reference set, and then exploit this matched member during information extraction. We compare this extraction approach to more traditional information extraction methods that rely on structural and grammatical characteristics, and we show that our approach outperforms traditional methods on this type of data.
引用
收藏
页码:1091 / 1098
页数:8
相关论文
共 50 条
  • [1] Constructing Reference Sets from Unstructured, Ungrammatical Text
    Michelson, Matthew
    Knoblock, Craig A.
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2010, 38 : 189 - 221
  • [2] Semantic annotation: Mapping text to ontologies
    Laboratoire d'Informatique de Paris-Nord, CNRS, Universiteá Paris 13, 99, Avenue J-B. Cleáment, F-93430 Villetaneuse, France
    Int. J. Metadata Semant. Ontol., 2007, 2 (67-78):
  • [3] Semantic Annotation of Unstructured Documents Using Concepts Similarity
    Pech, Fernando
    Martinez, Alicia
    Estrada, Hugo
    Hernandez, Yasmin
    SCIENTIFIC PROGRAMMING, 2017, 2017
  • [4] Semantic role parsing: Adding semantic structure to unstructured text
    Pradhan, S
    Hacioglu, K
    Ward, W
    Martin, JH
    Jurafsky, D
    THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 629 - 632
  • [5] Semantic Representation Extraction from Unstructured Arabic Text
    Zakria, Gehad
    Farouk, Mamdouh
    Fathy, Khaled
    Makar, Malak N.
    PROCEEDINGS OF 2019 8TH INTERNATIONAL CONFERENCE ON SOFTWARE AND INFORMATION ENGINEERING (ICSIE 2019), 2019, : 222 - 226
  • [6] Dynamic Semantic Network Analysis of Unstructured Text Corpora
    Kharlamov, Alexander
    Gradoselskaya, Galina
    Dokuka, Sofia
    ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS, AIST 2017, 2018, 10716 : 392 - 403
  • [7] Text mining through semi automatic semantic annotation
    Kiyavitskaya, Nadzeya
    Zeni, Nicola
    Mich, Luisa
    Cordy, James R.
    Mylopoulos, John
    PRACTICAL ASPECTS OF KNOWLEDGE MANAGEMENT, PROCEEDINGS, 2006, 4333 : 143 - +
  • [8] Semantic annotation and sharing of text information in AEC/FM
    Schapke, S. -E.
    Scherer, R. J.
    EWORK AND EBUSINESS IN ARCHITECTURE, ENGINEERING AND CONSTRUCTIO N, 2009, : 279 - 287
  • [9] Reliability of human annotation of semantic roles in noisy text
    Higgins, Derrick
    ICSC 2007: INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, PROCEEDINGS, 2007, : 501 - +
  • [10] Creating relational data from unstructured and ungrammatical data sources
    Michelson, Matthew
    Knoblock, Craig A.
    Journal of Artificial Intelligence Research, 1600, 31 : 543 - 590