Semantic Augmented Topic Model over Short Text

被引:0
作者
Li, Lingyun [1 ,2 ]
Sun, Yawei [1 ,2 ]
Wang, Cong [1 ,2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Software Engn, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, Minist Educ, Key Lab Trustworthy Distributed Comp & Serv, Beijing 100876, Peoples R China
来源
PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS) | 2018年
关键词
topic model; short text; latent semantic; bi-term topic model;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the rapid development of Internet and mobile devices, a vast number of short texts are produced by users, which also post great challenges to topic modeling because of the severe sparsity in context. The traditional topic model cannot do well in short text because of lacking word co-occurrence patterns. An effective approach bi-term topic model(BTM) has been proposed which models the word co-occurrence at the whole corpus directly and performs better than conventional topic models. However, BTM only consider the frequency of bi-term simply and ignore the latent semantic information between bi-terms which cause the words with similar semantic having a great risk of being grouped under different topic. In this paper, we propose a latent semantic augmented bi-term topic model(LS-BTM) which incorporates semantic information as prior knowledge to infer the topic more reasonable. The experimental result shows that our model gets better result than other short text topic models over real-world dataset.
引用
收藏
页码:652 / 656
页数:5
相关论文
共 50 条
[41]   Topic Modeling on Podcast Short-Text Metadata [J].
Valero, Francisco B. ;
Baranes, Marion ;
Epure, Elena, V .
ADVANCES IN INFORMATION RETRIEVAL, PT I, 2022, 13185 :472-486
[42]   A systematic review of the use of topic models for short text social media analysis [J].
Caitlin Doogan Poet Laureate ;
Wray Buntine ;
Henry Linger .
Artificial Intelligence Review, 2023, 56 :14223-14255
[43]   A systematic review of the use of topic models for short text social media analysis [J].
Laureate, Caitlin Doogan Poet ;
Buntine, Wray ;
Linger, Henry .
ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (12) :14223-14255
[44]   Semantic Coherence of Short Text at the Word Level [J].
Junior, Osmar de Oliveira Braz ;
Fileto, Renato .
Journal of the Brazilian Computer Society, 2025, 31 (01) :450-465
[45]   News Text Classification Model Based on Topic Model [J].
Li, Zhenzhong ;
Shang, Wenqian ;
Yan, Menghan .
2016 IEEE/ACIS 15TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2016, :1197-1201
[46]   Combine Topic Modeling with Semantic Embedding: Embedding Enhanced Topic Model [J].
Zhang, Peng ;
Wang, Suge ;
Li, Deyu ;
Li, Xiaoli ;
Xu, Zhikang .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (12) :2322-2335
[47]   Towards the Improvement of a Topic Model with Semantic Knowledge [J].
Ferrugento, Adriana ;
Alves, Ana ;
Oliveira, Hugo Goncalo ;
Rodrigues, Filipe .
PROGRESS IN ARTIFICIAL INTELLIGENCE-BK, 2015, 9273 :759-770
[48]   SBTM: Topic Modeling over Short Texts [J].
Pang, Jianhui ;
Li, Xiangsheng ;
Xie, Haoran ;
Rao, Yanghui .
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2016, 2016, 9645 :43-56
[49]   TOPIC MODEL AND SIMILARITY CALCULATION OF TEXT ON SPARK [J].
Dai, Changsong ;
Wang, Yongbin ;
Wang, Qi .
2017 14TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2017, :15-19
[50]   Fast Supervised Topic Models for Short Text Emotion Detection [J].
Pang, Jianhui ;
Rao, Yanghui ;
Xie, Haoran ;
Wang, Xizhao ;
Wang, Fu Lee ;
Wong, Tak-Lam ;
Li, Qing .
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (02) :815-828