A Case Study and Qualitative Analysis of Simple Cross-lingual Opinion Mining

被引:0
|
作者
Hagerer, Gerhard [1 ]
Leung, Wing Sheung [1 ]
Liu, Qiaoxi [1 ]
Danner, Hannah [2 ]
Groh, Georg [1 ]
机构
[1] Tech Univ Munich, Dept Informat, Social Comp Res Grp, Munich, Germany
[2] Tech Univ Munich, TUM Sch Management, Chair Mkt & Consumer Res, Munich, Germany
来源
PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (KDIR), VOL 1: | 2021年
关键词
Opinion Mining; Topic Modeling; Sentiment Analysis; Cross-lingual; Multi-lingual; Market Research;
D O I
10.5220/0010649500003064
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
User-generated content from social media is produced in many languages, making it technically challenging to compare the discussed themes from one domain across different cultures and regions. It is relevant for domains in a globalized world, such as market research, where people from two nations and markets might have different requirements for a product. We propose a simple, modern, and effective method for building a single topic model with sentiment analysis capable of covering multiple languages simultanteously, based on a pre-trained state-of-the-art deep neural network for natural language understanding. To demonstrate its feasibility, we apply the model to newspaper articles and user comments of a specific domain, i.e., organic food products and related consumption behavior. The themes match across languages. Additionally, we obtain an high proportion of stable and domain-relevant topics, a meaningful relation between topics and their respective textual contents, and an interpretable representation for social media documents. Marketing can potentially benefit from our method, since it provides an easy-to-use means of addressing specific customer interests from different market regions around the globe. For reproducibility, we provide the code, data, and results of our study(a).
引用
收藏
页码:17 / 26
页数:10
相关论文
共 50 条
  • [31] Cross-lingual citations in English papers: a large-scale analysis of prevalence, usage, and impact
    Saier, Tarek
    Farber, Michael
    Tsereteli, Tornike
    INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2022, 23 (02) : 179 - 195
  • [32] How a Deep Contextualized Representation and Attention Mechanism Justifies Explainable Cross-Lingual Sentiment Analysis
    Ghasemi, Rouzbeh
    Momtazi, Saeedeh
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (11)
  • [33] An empirical study of cross-lingual transfer learning techniques for small-footprint keyword spotting
    Sun, Ming
    Schwarz, Andreas
    Wu, Minhua
    Strom, Nikko
    Matsoukas, Spyros
    Vitaladevuni, Shiv
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 255 - 260
  • [34] A Preliminary Study of Cross-lingual Emotion Recognition from Speech: Automatic Classification versus Human Perception
    Jeon, Je Hun
    Le, Duc
    Xia, Rui
    Liu, Yang
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2836 - 2839
  • [35] KNetwork: advancing cross-lingual sentiment analysis for enhanced decision-making in linguistically diverse environments
    Jain, Ankush
    Jain, Garima
    Tewari, Dhruv
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (05) : 2925 - 2943
  • [36] KNetwork: advancing cross-lingual sentiment analysis for enhanced decision-making in linguistically diverse environments
    Ankush Jain
    Garima Jain
    Dhruv Tewari
    Knowledge and Information Systems, 2024, 66 : 2925 - 2943
  • [37] Unveiling the Linguistic Capabilities of a Self-Supervised Speech Model Through Cross-Lingual Benchmark and Layer- Wise Similarity Analysis
    Ashihara, Takanori
    Delcroix, Marc
    Ijima, Yusuke
    Kashino, Makio
    IEEE ACCESS, 2024, 12 : 98835 - 98855
  • [38] Combination of active learning and self-training for cross-lingual sentiment classification with density analysis of unlabelled samples
    Hajmohammadi, Mohammad Sadegh
    Ibrahim, Roliana
    Selamat, Ali
    Fujita, Hamido
    INFORMATION SCIENCES, 2015, 317 : 67 - 77
  • [39] Hate speech detection in low-resourced Indian languages: An analysis of transformer-based monolingual and multilingual models with cross-lingual experiments
    Ghosh, Koyel
    Senapati, Apurbalal
    NATURAL LANGUAGE PROCESSING, 2025, 31 (02): : 393 - 414
  • [40] Online Public Opinion Mining for Large Cross-Regional Projects: Case Study of the South-to-North Water Diversion Project in China
    Wan, Xin
    Wang, Rubing
    Wang, Minye
    Deng, Jiran
    Zhou, Zhipeng
    Yi, Xin
    Pan, Junnan
    Du, Yifan
    JOURNAL OF MANAGEMENT IN ENGINEERING, 2022, 38 (01)