Using Linked Data for polarity classification of patients' experiences

被引:15
作者
Noferesti, Samira [1 ]
Shamsfard, Mehrnoush [1 ]
机构
[1] Shahid Beheshti Univ Med Sci, Fac Comp Sci & Engn, Tehran, Iran
关键词
Sentiment analysis; Opinion mining; Patient experience mining; Polarity classification; Linked Data; Drug reviews; SENTIMENT;
D O I
10.1016/j.jbi.2015.06.017
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Polarity classification is the main subtask of sentiment analysis and opinion mining, well-known problems in natural language processing that have attracted increasing attention in recent years. Existing approaches mainly rely on the subjective part of text in which sentiment is expressed explicitly through specific words, called sentiment words. These approaches, however, are still far from being good in the polarity classification of patients' experiences since they are often expressed without any explicit expression of sentiment, but an undesirable or desirable effect of the experience implicitly indicates a positive or negative sentiment. This paper presents a method for polarity classification of patients' experiences of drugs using domain knowledge. We first build a knowledge base of polar facts about drugs, called FactNet, using extracted patterns from Linked Data sources and relation extraction techniques. Then, we extract generalized semantic patterns of polar facts and organize them into a hierarchy in order to overcome the missing knowledge issue. Finally, we apply the extracted knowledge, i.e., polar fact instances and generalized patterns, for the polarity classification task. Different from previous approaches for personal experience classification, the proposed method explores the potential benefits of polar facts in domain knowledge aiming to improve the polarity classification performance, especially in the case of indirect implicit experiences, i.e., experiences which express the effect of one entity on other ones without any sentiment words. Using our approach, we have extracted 9703 triplets of polar facts at a precision of 92.26 percent. In addition, experiments on drug reviews demonstrate that our approach can achieve 79.78 percent precision in polarity classification task, and outperforms the state-of-the-art sentiment analysis and opinion mining methods. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:6 / 19
页数:14
相关论文
共 56 条
[31]  
Inui Kentaro, 2008, Wl 2008. 2008 IEEE/WIC/ACM International Conference on Web Intelligence. IAT 2008. 2008 IEEE/WIC/ACM International Conference on Intelligent Agent Technology. Wl-IAT Workshop 2008 2008 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology Workshops, P314, DOI 10.1109/WIIAT.2008.373
[32]  
Jijkoun V., 2010, P NAACL HLT 2010 WOR, P17
[33]  
Jin-Cheon Na, 2012, The Outreach of Digital Libraries: A Globalized Resource Network. 14th International Conference on Asia-Pacific Digital Libraries. (ICADL 2012). Proceedings, P189, DOI 10.1007/978-3-642-34752-8_25
[34]  
Miao Q., 2012, PAC RIM INT C ART IN, P777, DOI [10.1007/978-3-642-32695-0_72, DOI 10.1007/978-3-642-32695-0_72]
[35]  
Mintz M., 2009, ACL, P1003, DOI DOI 10.3115/1690219.1690287
[36]   CROWDSOURCING A WORD-EMOTION ASSOCIATION LEXICON [J].
Mohammad, Saif M. ;
Turney, Peter D. .
COMPUTATIONAL INTELLIGENCE, 2013, 29 (03) :436-465
[37]  
Moro A., 2014, T ASS COMPUT LINGUIS, V2, p231 244, DOI 10.1162/tacl_a_00179
[38]  
Murphy W., 2014, USING SUPERVISED LEA
[39]   BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network [J].
Navigli, Roberto ;
Ponzetto, Simone Paolo .
ARTIFICIAL INTELLIGENCE, 2012, 193 :217-250
[40]   Thumbs up? Sentiment classification using machine learning techniques [J].
Pang, B ;
Lee, L ;
Vaithyanathan, S .
PROCEEDINGS OF THE 2002 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2002, :79-86