A deep learning approach for predicting the quality of online health expert question-answering services

被引:23
作者
Hu, Ze [1 ]
Zhang, Zhan [1 ]
Yang, Haiqin [2 ]
Chen, Qing [3 ]
Zuo, Decheng [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Heilongjiang, Peoples R China
[2] Hang Seng Management Coll, Dept Comp, Hong Kong, Hong Kong, Peoples R China
[3] Harbin Inst Technol, Res Ctr Satellite Technol, Harbin 150001, Heilongjiang, Peoples R China
基金
中国国家自然科学基金; 国家高技术研究发展计划(863计划);
关键词
Deep learning; Surface linguistic features; Multimodal learning; Deep belief network; Online health expert question-answering services; Social features;
D O I
10.1016/j.jbi.2017.06.012
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Recently, online health expert question-answering (HQA) services (systems) have attracted more and more health consumers to ask health-related questions everywhere at any time due to the convenience and effectiveness. However, the quality of answers in existing HQA systems varies in different situations. It is significant to provide effective tools to automatically determine the quality of the answers. Two main characteristics in HQA systems raise the difficulties of classification: (1) physicians' answers in an HQA system are usually written in short text, which yields the data sparsity issue; (2) HQA systems apply the quality control mechanism, which refrains the wisdom of crowd. The important information, such as the best answer and the number of users' votes, is missing. To tackle these issues, we prepare the first HQA research data set labeled by three medical experts in 90 days and formulate the problem of predicting the quality of answers in the system as a classification task. We not only incorporate the standard textual feature of answers, but also introduce a set of unique non-textual features, i.e., the popular used surface linguistic features and the novel social features, from other modalities. A multimodal deep belief network (DBN)-based learning framework is then proposed to learn the high-level hidden semantic representations of answers from both textual features and non-textual features while the learned joint representation is fed into popular classifiers to determine the quality of answers. Finally, we conduct extensive experiments to demonstrate the effectiveness of including the non-textual features and the proposed multimodal deep learning framework. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:241 / 253
页数:13
相关论文
共 59 条
  • [1] [Anonymous], 2008, Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '08, DOI DOI 10.1145/1390334.1390417
  • [2] [Anonymous], 2008, P 2008 INT C WEB SEA
  • [3] [Anonymous], 2006, Proceedings of the 15th international conference on World Wide Web
  • [4] Beloborodov Alexander, 2013, Advances in Information Retrieval. 35th European Conference on IR Research, ECIR 2013. Proceedings, P680, DOI 10.1007/978-3-642-36973-5_59
  • [5] Beloborodov Alexander, 2014, INFORM ACCESS EVALUA, V8685, P7
  • [6] Bian Jiang., 2009, P 18 INT C WORLD WID, P51, DOI DOI 10.1145/1526709.1526717
  • [7] Cairns Brian L, 2011, AMIA Annu Symp Proc, V2011, P171
  • [8] Cal Y., 2013, INT J NEXT GEN COMPU, V4
  • [9] Campbell C., 2003, P 12 INT C INFORM KN, P528, DOI [DOI 10.1145/956863.956965, 10.1145/956863.956965]
  • [10] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)