Building a Question-Answering Corpus Using Social Media and News Articles

被引:5
作者
Cavalin, Paulo [1 ]
Figueiredo, Flavio [1 ]
de Bayser, Maira [1 ]
Moyano, Luis [1 ]
Candello, Heloisa [1 ]
Appel, Ana [1 ]
Souza, Renan [1 ]
机构
[1] IBM Res, Sao Paulo, Brazil
来源
COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE (PROPOR 2016) | 2016年 / 9727卷
关键词
Question and Answer; Social media; Finance;
D O I
10.1007/978-3-319-41552-9_36
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Is it possible to develop a reliable QA-Corpus using social media data? What are the challenges faced when attempting such a task? In this paper, we discuss these questions and present our findings when developing a QA-Corpus on the topic of Brazilian finance. In order to populate our corpus, we relied on opinions from experts on Brazilian finance that are active on the Twitter application. From these experts, we extracted information from news websites that are used as answers in the corpus. Moreover, to effectively provide rankings of answers to questions, we employ novel word vector based similarity measures between short sentences (that accounts for both questions and Tweets). We validated our methods on a recently released dataset of similarity between short Portuguese sentences. Finally, we also discuss the effectiveness of our approach when used to rank answers to questions from real users.
引用
收藏
页码:353 / 358
页数:6
相关论文
共 11 条
  • [1] [Anonymous], 2004, P INT C COMP LING
  • [2] [Anonymous], 2013, P 26 INT C NEUR INF
  • [3] Dow S, 2010, CHI2010: PROCEEDINGS OF THE 28TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, VOLS 1-4, P547
  • [4] Hajjem M., 2013, BUCC
  • [5] Kenter T, 2015, CIKM 2015
  • [6] Ljubesic N., 2014, P 9 INT C LANG RES E
  • [7] Murphy T., 2009, P C EUROPEAN CHAPTER, P612
  • [8] Paul Sharoda, 2011, INT AAAI C WEB SOC M
  • [9] Question Answering: A Survey of Research, Techniques and Issues
    Singh, Vaishali
    Dwivedi, Sanjay K.
    [J]. INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2014, 4 (03) : 14 - 33
  • [10] Socher R., 2011, Advances in neural information processing systems