Bert-Based Latent Semantic Analysis (Bert-LSA): A Case Study on Geospatial Data Technology and Application Trend Analysis

被引:6
|
作者
Cheng, Quanying [1 ,2 ]
Zhu, Yunqiang [1 ,3 ]
Song, Jia [1 ,3 ]
Zeng, Hongyun [4 ]
Wang, Shu [1 ]
Sun, Kai [1 ]
Zhang, Jinqu [5 ]
机构
[1] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, State Key Lab Resources & Environm Informat Syst, Beijing 100101, Peoples R China
[2] Univ Chinese Acad Sci, Coll Resources & Environm, Beijing 100049, Peoples R China
[3] Jiangsu Ctr Collaborat Innovat Geog Informat Reso, Nanjing 210023, Peoples R China
[4] Yunnan Univ, Sch Earth Sci, Kunming 650500, Yunnan, Peoples R China
[5] South China Normal Univ, Sch Comp Sci, Guangzhou 510000, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 24期
基金
中国国家自然科学基金;
关键词
trend analysis; topic modeling; Bert; geospatial data technology and application; BIBLIOMETRIC ANALYSIS;
D O I
10.3390/app112411897
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Geospatial data is an indispensable data resource for research and applications in many fields. The technologies and applications related to geospatial data are constantly advancing and updating, so identifying the technologies and applications among them will help foster and fund further innovation. Through topic analysis, new research hotspots can be discovered by understanding the whole development process of a topic. At present, the main methods to determine topics are peer review and bibliometrics, however they just review relevant literature or perform simple frequency analysis. This paper proposes a new topic discovery method, which combines a word embedding method, based on a pre-trained model, Bert, and a spherical k-means clustering algorithm, and applies the similarity between literature and topics to assign literature to different topics. The proposed method was applied to 266 pieces of literature related to geospatial data over the past five years. First, according to the number of publications, the trend analysis of technologies and applications related to geospatial data in several leading countries was conducted. Then, the consistency of the proposed method and the existing method PLSA (Probabilistic Latent Semantic Analysis) was evaluated by using two similar consistency evaluation indicators (i.e., U-Mass and NMPI). The results show that the method proposed in this paper can well reveal text content, determine development trends, and produce more coherent topics, and that the overall performance of Bert-LSA is better than PLSA using NPMI and U-Mass. This method is not limited to trend analysis using the data in this paper; it can also be used for the topic analysis of other types of texts.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] An Effective BERT-Based Pipeline for Twitter Sentiment Analysis: A Case Study in Italian
    Pota, Marco
    Ventura, Mirko
    Catelli, Rosario
    Esposito, Massimo
    SENSORS, 2021, 21 (01) : 1 - 21
  • [2] BERT-based Conformal Predictor for Sentiment Analysis
    Maltoudoglou, Lysimachos
    Paisios, Andreas
    Papadopoulos, Harris
    CONFORMAL AND PROBABILISTIC PREDICTION AND APPLICATIONS, VOL 128, 2020, 128 : 269 - 284
  • [3] BERT-Based Stock Market Sentiment Analysis
    Lee, Chien-Cheng
    Gao, Zhongjian
    Tsai, Chun-Li
    2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,
  • [4] Bert-based graph unlinked embedding for sentiment analysis
    Jin, Youkai
    Zhao, Anping
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (02) : 2627 - 2638
  • [5] Bert-based graph unlinked embedding for sentiment analysis
    Youkai Jin
    Anping Zhao
    Complex & Intelligent Systems, 2024, 10 : 2627 - 2638
  • [6] BERT-Based Sentiment Analysis: A Software Engineering Perspective
    Batra, Himanshu
    Punn, Narinder Singh
    Sonbhadra, Sanjay Kumar
    Agarwal, Sonali
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2021, PT I, 2021, 12923 : 138 - 148
  • [7] BERT-Based Sentiment Analysis for Low-Resourced Languages: A Case Study of Urdu Language
    Ashraf, Muhammad Rehan
    Jana, Yasmeen
    Umer, Qasim
    Jaffar, M. Arfan
    Chung, Sungwook
    Ramay, Waheed Yousuf
    IEEE ACCESS, 2023, 11 : 110245 - 110259
  • [8] Semantic and Sentiment Analysis of Selected Bhagavad Gita Translations Using BERT-Based Language Framework
    Chandra, Rohitash
    Kulkarni, Venkatesh
    IEEE ACCESS, 2022, 10 : 21291 - 21315
  • [9] A BERT-based Hierarchical Model for Vietnamese Aspect Based Sentiment Analysis
    Oanh Thi Tran
    Viet The Bui
    2020 12TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (IEEE KSE 2020), 2020, : 269 - 274
  • [10] Adaptive Thresholding for Sentiment Analysis Across Online Reviews Based on BERT Model BERT-based Adaptive Thresholding for Sentiment Analysis
    Lu, Zijie
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MODELING, NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING, CMNM 2024, 2024, : 70 - 75