Online Reasoning for Semantic Error Detection in Text

被引:3
作者
Gutierrez F. [1 ]
Dou D. [1 ]
de Silva N. [1 ]
Fickas S. [1 ]
机构
[1] Department of Computer and Information Science, 1202 University of Oregon, Eugene, 97403, OR
来源
Dou, Dejing (dou@cs.uoregon.edu) | 1600年 / Springer Science and Business Media Deutschland GmbH卷 / 06期
基金
美国国家科学基金会;
关键词
Information extraction; Ontology; Semantic error detection;
D O I
10.1007/s13740-017-0079-6
中图分类号
学科分类号
摘要
Identifying incorrect content (i.e., semantic error) in text is a difficult task because of the ambiguous nature of written natural language and the many factors that can make a statement semantically erroneous. Current methods identify semantic errors in a sentence by determining whether it contradicts the domain to which the sentence belongs. However, because these methods are constructed on expected logic contradictions, they cannot handle new or unexpected semantic errors. In this paper, we propose a new method for detecting semantic errors that is based on logic reasoning. Our proposed method converts text into logic clauses, which are later analyzed against a domain ontology by an automatic reasoner to determine its consistency. This approach can provide a complete analysis of the text, since it can analyze a single sentence or sets of multiple sentences. When there are multiple sentences to analyze, in order to avoid the high complexity of reasoning over a large set of logic clauses, we propose rules that reduce the set of sentences to analyze, based on the logic relationships between sentences. In our evaluation, we have found that our proposed method can identify a significant percentage of semantic errors and, in the case of multiple sentences, it does so without significant computational cost. We have also found that both the quality of the information extraction output and modeling elements of the ontology (i.e., property domain and range) affect the capability of detecting errors. © 2017, Springer-Verlag GmbH Germany.
引用
收藏
页码:139 / 153
页数:14
相关论文
共 38 条
[1]  
Bechhofer S., van Harmelen F., Hendler J., Horrocks I., Patel-Schneider P.F., McGuinness D., Stein L., OWL Web Ontology Language, (2004)
[2]  
Bos J., Wide-coverage semantic analysis with boxer, Proceedings of the 2008 conference on semantics in text processing, STEP ’08, pp. 277-286, (2008)
[3]  
Open-domain semantic parsing with boxer. In: Proceedings of the 20th Nordic conference of computational linguistics, NODALIDA, pp. 301-304, (2015)
[4]  
Buitelaar P., Cimiano P., Racioppa S., Siegel M., Ontology-based information extraction with SOBA, Proceedings of the international conference on language resources and evaluation (LREC). ELRA, pp. 2321-2324, (2006)
[5]  
Carlson A., Betteridge J., Hruschka E.R., Coupling semi-supervised learning of categories and relations. In: Proceedings of the NAACL HLT 2009 workshop on semi-supervised learning for natural language processing (SemiSupLearn), pp 1–9, (2009)
[6]  
Fader A., Soderland S., Etzioni O., Identifying relations for open information extraction, Conference on empirical methods in natural language processing (EMNLP, pp. 1535-1545, (2011)
[7]  
Flouris G., Manakanatas D., Kondylakis H., Plexousakis D., Antoniou G., Ontology change: classification and survey, Knowl Eng Rev, 23, 2, pp. 117-152, (2008)
[8]  
Fuchs N.E., Kaljurand K., Schneider G., Attempto controlled English meets the challenges of knowledge representation, reasoning, interoperability and user interfaces, FLAIRS conference. AAAI Press, pp. 664-669, (2006)
[9]  
Gruber T.R., A translation approach to portable ontology specifications, Knowl Acquis, 5, 2, pp. 199-220, (1993)
[10]  
Gutierrez F., Dou D., Fickas S., Griffiths G., Providing grades and feedback for student summaries by ontology-based information extraction, Proceedings of the 21st ACM conference on information and knowledge management (CIKM), pp. 1722-1726, (2012)