A Case Study and Qualitative Analysis of Simple Cross-lingual Opinion Mining

被引:0
|
作者
Hagerer, Gerhard [1 ]
Leung, Wing Sheung [1 ]
Liu, Qiaoxi [1 ]
Danner, Hannah [2 ]
Groh, Georg [1 ]
机构
[1] Tech Univ Munich, Dept Informat, Social Comp Res Grp, Munich, Germany
[2] Tech Univ Munich, TUM Sch Management, Chair Mkt & Consumer Res, Munich, Germany
来源
PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (KDIR), VOL 1: | 2021年
关键词
Opinion Mining; Topic Modeling; Sentiment Analysis; Cross-lingual; Multi-lingual; Market Research;
D O I
10.5220/0010649500003064
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
User-generated content from social media is produced in many languages, making it technically challenging to compare the discussed themes from one domain across different cultures and regions. It is relevant for domains in a globalized world, such as market research, where people from two nations and markets might have different requirements for a product. We propose a simple, modern, and effective method for building a single topic model with sentiment analysis capable of covering multiple languages simultanteously, based on a pre-trained state-of-the-art deep neural network for natural language understanding. To demonstrate its feasibility, we apply the model to newspaper articles and user comments of a specific domain, i.e., organic food products and related consumption behavior. The themes match across languages. Additionally, we obtain an high proportion of stable and domain-relevant topics, a meaningful relation between topics and their respective textual contents, and an interpretable representation for social media documents. Marketing can potentially benefit from our method, since it provides an easy-to-use means of addressing specific customer interests from different market regions around the globe. For reproducibility, we provide the code, data, and results of our study(a).
引用
收藏
页码:17 / 26
页数:10
相关论文
共 50 条
  • [21] Cross-lingual Portability of MLP-Based Tandem Features-A Case Study for English and Hungarian
    Toth, Laszlo
    Frankel, Joe
    Gosztolya, Gabor
    King, Simon
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2695 - +
  • [22] Translating Justice: A Cross-Lingual Information Retrieval System for Maltese Case Law Documents
    Azzopardi, Joel
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT V, 2024, 14612 : 236 - 240
  • [23] Contrastive pre-training and instruction tuning for cross-lingual aspect-based sentiment analysis
    Zhao, Wenwen
    Yang, Zhisheng
    Yu, Song
    Zhu, Shiyu
    Li, Li
    APPLIED INTELLIGENCE, 2025, 55 (05)
  • [24] A Comparative Study of BNF and DNN Multilingual Training on Cross-lingual Low-resource Speech Recognition
    Xu, Haihua
    Van Hai Do
    Xiao, Xiong
    Chng, Eng-Siong
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2132 - 2136
  • [25] Deep Persian sentiment analysis: Cross-lingual training for low-resource languages
    Ghasemi, Rouzbeh
    Ashrafi Asli, Seyed Arad
    Momtazi, Saeedeh
    JOURNAL OF INFORMATION SCIENCE, 2022, 48 (04) : 449 - 462
  • [26] Opinion Mining Techniques and Tools: A Case Study on Arab Newspaper
    Emam, Ahmed
    Alzahrani, Maha
    PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2017, : 292 - 296
  • [27] Class-Dependent Canonical Correlation Analysis for Scalable Cross-Lingual Document Categorization
    Hady, Mohamed Farouk Abdel
    Asham, Mina
    2013 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING (CIDM), 2013, : 308 - 315
  • [28] Persian Sentiment Analysis without Training Data Using Cross-Lingual Word Embeddings
    Aliramezani, Mohammad
    Doostmohammadi, Ehsan
    Bokaei, Mohammad Hadi
    Sameti, Hossien
    2020 10TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2020, : 78 - 82
  • [29] Geospatial Information Categories Mapping in a Cross-lingual Environment: A Case Study of "Surface Water" Categories in Chinese and American Topographic Maps
    Kuai, Xi
    Li, Lin
    Luo, Heng
    Hang, Shen
    Zhang, Zhijun
    Liu, Yu
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2016, 5 (06):
  • [30] Cross-lingual citations in English papers: a large-scale analysis of prevalence, usage, and impact
    Tarek Saier
    Michael Färber
    Tornike Tsereteli
    International Journal on Digital Libraries, 2022, 23 : 179 - 195