Gender classification of product reviewers in China: a data-driven approach

被引:0
|
作者
Wang, Jing [1 ]
Yan, Xiangbin [2 ]
Zhu, Bin [3 ]
机构
[1] Commun Univ China, Sch Econ & Management, Dept Management Sci & Engn, 1 Dingfuzhuang East St, Beijing, Peoples R China
[2] Univ Sci & Technol Beijing, Donlinks Sch Econ & Management, Dept Management Sci & Engn, 30 Xueyuan Rd, Beijing, Peoples R China
[3] Oregon State Univ, Coll Business, Dept Business Informat Syst, 2751 SW Jefferson Way, Corvallis, OR USA
基金
中国国家自然科学基金;
关键词
Text mining; Gender classification; Chinese gender lexicon; Na & iuml; ve Bayesian; BP neural network; Support vector machines; ONLINE; DISCOURSE; EMOTION; AUTHOR;
D O I
10.1007/s10799-024-00443-0
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Online product discussion forums have become essential resources for marketers seeking to understand market dynamics and consumer preferences. Identifying the gender of forum participants can further enhance the effectiveness and efficiency of marketing efforts. However, the relationship between linguistic features and gender classification often varies due to contextual factors such as genres, social networks, and social classes. Recognizing that the discriminatory power of gender markers changes with context, this study proposes and validates a framework to guide the adoption of existing gender classification systems specifically for online product discussions. We demonstrate that beyond optimizing the classification methods themselves, performance can be improved by strategically applying these methods to archived discussion data. Our findings reveal that, for a given classification method and discussion forum, the size of the input data significantly influences performance, with an optimal data size existing to achieve the best results.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] A Simulation Data-Driven Design Approach for Rapid Product Optimization
    Shao, Yanli
    Zhu, Huawei
    Wang, Rui
    Liu, Ying
    Liu, Yusheng
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2020, 20 (02)
  • [2] A data-driven classification of feelings
    Thomson, David M. H.
    Crocker, Christopher
    FOOD QUALITY AND PREFERENCE, 2013, 27 (02) : 137 - 152
  • [3] A supervised data-driven approach for microarray spot quality classification
    Bicego, M
    Martinez, MD
    Murino, V
    PATTERN ANALYSIS AND APPLICATIONS, 2005, 8 (1-2) : 181 - 187
  • [4] A supervised data-driven approach for microarray spot quality classification
    Manuele Bicego
    Maria Del Rosario Martinez
    Vittorio Murino
    Pattern Analysis and Applications, 2005, 8 : 181 - 187
  • [5] A Novel Control-Performance-Oriented Data-Driven Fault Classification Approach
    Liu, Tianyu
    Luo, Hao
    Kaynak, Okyay
    Yin, Shen
    IEEE SYSTEMS JOURNAL, 2020, 14 (02): : 1830 - 1839
  • [6] Data-driven classification of the certainty of scholarly assertions
    Prieto, Mario
    Deus, Helena
    de Waard, Anita
    Schultes, Erik
    Garcia-Jimenez, Beatriz
    Wilkinson, Mark D.
    PEERJ, 2020, 8
  • [7] A Data-Driven Approach for Accurate Rainfall Prediction
    Manandhar, Shilpa
    Dev, Soumyabrata
    Lee, Yee Hui
    Meng, Yu Song
    Winkler, Stefan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (11): : 9323 - 9331
  • [8] Data-Driven Fault Classification Using Support Vector Machines
    Jallepalli, Deepthi
    Kakhki, Fatemeh Davoudi
    INTELLIGENT HUMAN SYSTEMS INTEGRATION 2021, 2021, 1322 : 316 - 322
  • [9] A DATA-DRIVEN TEXT SIMILARITY MEASURE BASED ON CLASSIFICATION ALGORITHMS
    Cho, Su Gon
    Kim, Seoung Bum
    INTERNATIONAL JOURNAL OF INDUSTRIAL ENGINEERING-THEORY APPLICATIONS AND PRACTICE, 2017, 24 (03): : 328 - 339
  • [10] MAVERIC: A Data-Driven Approach to Personalized Autonomous Driving
    Schrum, Mariah L.
    Sumner, Emily
    Gombolay, Matthew C.
    Best, Andrew
    IEEE TRANSACTIONS ON ROBOTICS, 2024, 40 : 1952 - 1965