LDA-based term profiles for expert finding in a political setting

被引:12
|
作者
de Campos, Luis M. [1 ]
Fernandez-Luna, Juan M. [1 ]
Huete, Juan F. [1 ]
Redondo-Exposito, Luis [1 ]
机构
[1] Univ Granada, CITIC UGR, Dept Ciencias Computac & Inteligencia Artificial, ETSI Informat & Telecomunicac, Periodista Daniel Saucedo Aranda S-N, Granada 18014, Spain
关键词
Expert finding; User profiles; Topic selection; LDA; Politics; OF-THE-ART; RECOMMENDATION;
D O I
10.1007/s10844-021-00636-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A common task in many political institutions (i.e. Parliament) is to find politicians who are experts in a particular field. In order to tackle this problem, the first step is to obtain politician profiles which include their interests, and these can be automatically learned from their speeches. As a politician may have various areas of expertise, one alternative is to use a set of subprofiles, each of which covers a different subject. In this study, we propose a novel approach for this task by using latent Dirichlet allocation (LDA) to determine the main underlying topics of each political speech, and to distribute the related terms among the different topic-based subprofiles. With this objective, we propose the use of fifteen distance and similarity measures to automatically determine the optimal number of topics discussed in a document, and to demonstrate that every measure converges into five strategies: Euclidean, Dice, Sorensen, Cosine and Overlap. Our experimental results showed that the scores of the different accuracy metrics of the proposed strategies tended to be higher than those of the baselines for expert recommendation tasks, and that the use of an appropriate number of topics has proved relevant.
引用
收藏
页码:529 / 559
页数:31
相关论文
共 50 条
  • [11] A personalized hashtag recommendation approach using LDA-based topic model in microblog environment
    Zhao, Feng
    Zhu, Yajun
    Jin, Hai
    Yang, Laurence T.
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2016, 65 : 196 - 206
  • [12] A LDA-Based Social Media Data Mining Framework for Plastic Circular Economy
    Yangyimin Xue
    Chandrasekhar Kambhampati
    Yongqiang Cheng
    Nishikant Mishra
    Nur Wulandhari
    Pauline Deutz
    International Journal of Computational Intelligence Systems, 17
  • [13] A LDA-Based Social Media Data Mining Framework for Plastic Circular Economy
    Xue, Yangyimin
    Kambhampati, Chandrasekhar
    Cheng, Yongqiang
    Mishra, Nishikant
    Wulandhari, Nur
    Deutz, Pauline
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2024, 17 (01)
  • [14] An Empirical Study for PCA- and LDA-Based Feature Reduction for Gas Identification
    Akbar, Muhammad Ali
    Ali, Amine Ait Si
    Amira, Abbes
    Bensaali, Faycal
    Benammar, Mohieddine
    Hassan, Muhammad
    Bermak, Amine
    IEEE SENSORS JOURNAL, 2016, 16 (14) : 5734 - 5746
  • [15] An Empirical Study on Forensic Analysis of Urdu Text Using LDA-Based Authorship Attribution
    Anwar, Waheed
    Bajwa, Imran Sarwar
    Choudhary, M. Abbas
    Ramzan, Shabana
    IEEE ACCESS, 2019, 7 : 3224 - 3234
  • [16] Evaluating the Performance of Topic Modeling Techniques for Bibliometric Analysis Research: An LDA-based Approach
    Nguyen L.T.
    Chansanam W.
    Hunsapun N.
    Chaichuay V.
    Kanyacome S.
    Takhom A.
    Jaroenruen Y.
    Li C.
    HighTech and Innovation Journal, 2024, 5 (02): : 312 - 330
  • [17] High accuracy handwritten Chinese character recognition using LDA-based compound distances
    Gao, Tian-Fu
    Liu, Cheng-Lin
    PATTERN RECOGNITION, 2008, 41 (11) : 3442 - 3451
  • [18] Labeling Blog Posts with Wikipedia Entries through LDA-Based Topic Modeling of Wikipedia
    Makita, Kensaku
    Suzuki, Hiroko
    Koike, Daichi
    Utsuro, Takehito
    Kawada, Yasuhide
    Fukuhara, Tomohiro
    JOURNAL OF INTERNET TECHNOLOGY, 2013, 14 (02): : 297 - 306
  • [19] Mining Keywords from Short Text Based on LDA-Based Hierarchical Semantic Graph Model
    Chen, Wei
    Yu, Zhengtao
    Xian, Yantuan
    Wang, Zhenhan
    Wen, Yonghua
    INTERNATIONAL JOURNAL OF INFORMATION SYSTEMS IN THE SERVICE SECTOR, 2020, 12 (02) : 76 - 87
  • [20] Tweeting on COVID-19 pandemic in South Africa: LDA-based topic modelling approach
    Mutanga, Murimo Bethel
    Abayomi, Abdultaofeek
    AFRICAN JOURNAL OF SCIENCE TECHNOLOGY INNOVATION & DEVELOPMENT, 2022, 14 (01) : 163 - 172