Investigating human reading behavior during sentiment judgment

被引:2
作者
Chen, Xuesong [1 ]
Mao, Jiaxin [1 ]
Liu, Yiqun [1 ]
Zhang, Min [1 ]
Ma, Shaoping [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
User behavior; Eye movement; Sentiment judgment; Machine model; EYE-MOVEMENT CONTROL; MODEL;
D O I
10.1007/s13042-022-01523-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis is an essential task in natural language processing researches. Although existing works have gained much success with both statistical and neural-based solutions, little is known about the human decision process while performing this kind of complex cognitive task. Considering recent advances in human-inspired model design for NLP tasks, it is necessary to investigate the human reading and judging behavior in sentiment classification and adopt these findings to reconsider the sentiment analysis problem. In this paper, we carefully design a lab-based user study in which users' fine-grained reading behaviors during microblog sentiment classification are recorded with an eye-track device. Through systematic analysis of the collected data, we look into the differences between human and machine attention distributions and the differences in human attention while performing different tasks. We find that (1) sentiment judgment is more like an auxiliary task of content comprehension for humans. (2) people have different reading behavior patterns while reading microblog posts with varying labels of sentiment. Based on these findings, we build a human behavior-inspired sentiment prediction model for microblog posts. Experiment results on public-available benchmarks show that the proposed classification model outperforms existing solutions over 2.13% in terms of macro F1-score by introducing behavior features. Our findings may bring insight into the research of designing more effective and explainable sentiment analysis methods.
引用
收藏
页码:2283 / 2296
页数:14
相关论文
共 48 条
[1]  
Bahdanau D., 2014, CORR, DOI DOI 10.48550/ARXIV.1409.0473
[2]  
Barrett Maria, 2018, P 22 C COMP NAT LANG, P302, DOI 10.18653/v1/K18-1030
[3]  
Berger AL, 1996, COMPUT LINGUIST, V22, P39
[4]  
Bicknell K, 2010, ACL 2010: 48TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, P1168
[5]   Do People and Neural Nets Pay Attention to the Same Words? Studying Eye-tracking Data for Non-factoid QA Evaluation [J].
Bolotova, Valeria ;
Blinov, Vladislav ;
Zheng, Yukun ;
Croft, W. Bruce ;
Scholer, Falk ;
Sanderson, Mark .
CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, :85-94
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]  
Craswell Nick, 2008, P INT C WEB SEARCH W, P87, DOI DOI 10.1145/1341531.1341545
[8]   SWIFT: A dynamical model of saccade generation during reading [J].
Engbert, R ;
Nuthmann, A ;
Richter, EM ;
Kliegl, R .
PSYCHOLOGICAL REVIEW, 2005, 112 (04) :777-813
[9]   Limitations of Transformers on Clinical Text Classification [J].
Gao, Shang ;
Alawad, Mohammed ;
Young, M. Todd ;
Gounley, John ;
Schaefferkoetter, Noah ;
Yoon, Hong Jun ;
Wu, Xiao-Cheng ;
Durbin, Eric B. ;
Doherty, Jennifer ;
Stroup, Antoinette ;
Coyle, Linda ;
Tourassi, Georgia .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (09) :3596-3607
[10]  
Granka L. A., 2004, Proceedings of Sheffield SIGIR 2004. The Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P478, DOI 10.1145/1008992.1009079