Bert-Based Latent Semantic Analysis (Bert-LSA): A Case Study on Geospatial Data Technology and Application Trend Analysis

被引:6
|
作者
Cheng, Quanying [1 ,2 ]
Zhu, Yunqiang [1 ,3 ]
Song, Jia [1 ,3 ]
Zeng, Hongyun [4 ]
Wang, Shu [1 ]
Sun, Kai [1 ]
Zhang, Jinqu [5 ]
机构
[1] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, State Key Lab Resources & Environm Informat Syst, Beijing 100101, Peoples R China
[2] Univ Chinese Acad Sci, Coll Resources & Environm, Beijing 100049, Peoples R China
[3] Jiangsu Ctr Collaborat Innovat Geog Informat Reso, Nanjing 210023, Peoples R China
[4] Yunnan Univ, Sch Earth Sci, Kunming 650500, Yunnan, Peoples R China
[5] South China Normal Univ, Sch Comp Sci, Guangzhou 510000, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 24期
基金
中国国家自然科学基金;
关键词
trend analysis; topic modeling; Bert; geospatial data technology and application; BIBLIOMETRIC ANALYSIS;
D O I
10.3390/app112411897
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Geospatial data is an indispensable data resource for research and applications in many fields. The technologies and applications related to geospatial data are constantly advancing and updating, so identifying the technologies and applications among them will help foster and fund further innovation. Through topic analysis, new research hotspots can be discovered by understanding the whole development process of a topic. At present, the main methods to determine topics are peer review and bibliometrics, however they just review relevant literature or perform simple frequency analysis. This paper proposes a new topic discovery method, which combines a word embedding method, based on a pre-trained model, Bert, and a spherical k-means clustering algorithm, and applies the similarity between literature and topics to assign literature to different topics. The proposed method was applied to 266 pieces of literature related to geospatial data over the past five years. First, according to the number of publications, the trend analysis of technologies and applications related to geospatial data in several leading countries was conducted. Then, the consistency of the proposed method and the existing method PLSA (Probabilistic Latent Semantic Analysis) was evaluated by using two similar consistency evaluation indicators (i.e., U-Mass and NMPI). The results show that the method proposed in this paper can well reveal text content, determine development trends, and produce more coherent topics, and that the overall performance of Bert-LSA is better than PLSA using NPMI and U-Mass. This method is not limited to trend analysis using the data in this paper; it can also be used for the topic analysis of other types of texts.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] A Study of BERT-Based Classification Performance of Text-Based Health Counseling Data
    Sung, Yeol Woo
    Park, Dae Seung
    Kim, Cheong Ghil
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 135 (01): : 795 - 808
  • [22] BERT-based combination of convolutional and recurrent neural network for indonesian sentiment analysis
    Murfi, Hendri
    Syamsyuriani
    Gowandi, Theresia
    Ardaneswari, Gianinna
    Nurrohmah, Siti
    APPLIED SOFT COMPUTING, 2024, 151
  • [23] BERT-based Classifiers for Fake News Detection on Short and Long Texts with Noisy Data: A Comparative Analysis
    Shushkevich, Elena
    Alexandrov, Mikhail
    Cardiff, John
    TEXT, SPEECH, AND DIALOGUE (TSD 2022), 2022, 13502 : 263 - 274
  • [24] BERT-based NLP techniques for classification and severity modeling in basic warranty data study
    Xu, Shuzhe
    Zhang, Chuanlong
    Hong, Don
    INSURANCE MATHEMATICS & ECONOMICS, 2022, 107 : 57 - 67
  • [25] Span-Level Emotion Cause Analysis by BERT-based Graph Attention Network
    Li, Xiangju
    Gao, Wei
    Feng, Shi
    Wang, Daling
    Joty, Shafiq
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3221 - 3226
  • [26] Advancements in Text Subjectivity Analysis: From Simple Approaches to BERT-Based Models and Generalization Assessments
    Antal, Margit
    Buza, Krisztian
    Nemes, Szilard
    ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2024, PART I, 2024, 2165 : 245 - 255
  • [27] An Efficient Long Chinese Text Sentiment Analysis Method Using BERT-Based Models with BiGRU
    Sheng, Deming
    Yuan, Jingling
    PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 192 - 197
  • [28] A Novel Framework for Agricultural Futures Price Prediction With BERT-Based Topic Identification and Sentiment Analysis
    Wang, Wensheng
    Liu, Yuxi
    JOURNAL OF FORECASTING, 2025,
  • [29] Temporal Convolutional Networks and BERT-Based Multi-Label Emotion Analysis for Financial Forecasting
    Liapis, Charalampos M.
    Kotsiantis, Sotiris
    INFORMATION, 2023, 14 (11)
  • [30] Enhancing Sentiment Analysis for Chinese Texts Using a BERT-Based Model with a Custom Attention Mechanism
    Ding, Linlin
    Han, Yiming
    Li, Mo
    Li, Dong
    WEB INFORMATION SYSTEMS AND APPLICATIONS, WISA 2024, 2024, 14883 : 172 - 179