Slang feature extraction by analysing topic change on social media

被引:15
作者
Matsumoto, Kazuyuki [1 ]
Ren, Fuji [1 ]
Matsuoka, Masaya [1 ]
Yoshida, Minoru [1 ]
Kita, Kenji [1 ]
机构
[1] Tokushima Univ, Grad Sch Technol Ind & Social Sci, Minamijosanjima Cho 2-1, Tokushima 7708506, Japan
关键词
37;
D O I
10.1049/trit.2018.1060
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, the authors often see words such as youth slang, neologism and Internet slang on social networking sites (SNSs) that are not registered on dictionaries. Since the documents posted to SNSs include a lot of fresh information, they are thought to be useful for collecting information. It is important to analyse these words (hereinafter referred to as 'slang') and capture their features for the improvement of the accuracy of automatic information collection. This study aims to analyse what features can be observed in slang by focusing on the topic. They construct topic models from document groups including target slang on Twitter by latent Dirichlet allocation. With the models, they chronologically the analyse change of topics during a certain period of time to find out the difference in the features between slang and general words. Then, they propose a slang classification method based on the change of features.
引用
收藏
页码:64 / 71
页数:8
相关论文
共 32 条
[11]   Topic detection using paragraph vectors to support active learning in systematic reviews [J].
Hashimoto, Kazuma ;
Kontonatsios, Georgios ;
Miwa, Makoto ;
Ananiadou, Sophia .
JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 62 :59-65
[12]  
Hisano Y., 2013, IEICE TECHNICAL REPO
[13]  
Hong L., 2010, P 1 WORKSHOP SOCIAL, P80, DOI 10.1145/1964858.1964870
[14]   A probabilistic method for emerging topic tracking in Microblog stream [J].
Huang, Jiajia ;
Peng, Min ;
Wang, Hua ;
Cao, Jinli ;
Gao, Wang ;
Zhang, Xiuzhen .
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2017, 20 (02) :325-350
[15]  
Kimura T., 2015, IEICE T INF SYST, VJ98-D, P1151, DOI [10.14923/transinfj.2014JDP7142, DOI 10.14923/TRANSINFJ.2014JDP7142]
[16]  
Larochelle H., 2012, Adv. Neural Inf. Process. Syst, V25, P2708
[17]  
Lau J.H., 2012, P COLING 2012, P1519
[18]  
Liu Qian, 2018, P 27 INT C COMP LING, P2023
[19]  
Matsumoto K., 2017, INT J ADV INTELL, V9, P145
[20]  
Matsumoto K., 2016, INT J ADV INTELLIGEN, V8, P84