Gender classification of product reviewers in China: a data-driven approach

被引:0
|
作者
Wang, Jing [1 ]
Yan, Xiangbin [2 ]
Zhu, Bin [3 ]
机构
[1] Commun Univ China, Sch Econ & Management, Dept Management Sci & Engn, 1 Dingfuzhuang East St, Beijing, Peoples R China
[2] Univ Sci & Technol Beijing, Donlinks Sch Econ & Management, Dept Management Sci & Engn, 30 Xueyuan Rd, Beijing, Peoples R China
[3] Oregon State Univ, Coll Business, Dept Business Informat Syst, 2751 SW Jefferson Way, Corvallis, OR USA
基金
中国国家自然科学基金;
关键词
Text mining; Gender classification; Chinese gender lexicon; Na & iuml; ve Bayesian; BP neural network; Support vector machines; ONLINE; DISCOURSE; EMOTION; AUTHOR;
D O I
10.1007/s10799-024-00443-0
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Online product discussion forums have become essential resources for marketers seeking to understand market dynamics and consumer preferences. Identifying the gender of forum participants can further enhance the effectiveness and efficiency of marketing efforts. However, the relationship between linguistic features and gender classification often varies due to contextual factors such as genres, social networks, and social classes. Recognizing that the discriminatory power of gender markers changes with context, this study proposes and validates a framework to guide the adoption of existing gender classification systems specifically for online product discussions. We demonstrate that beyond optimizing the classification methods themselves, performance can be improved by strategically applying these methods to archived discussion data. Our findings reveal that, for a given classification method and discussion forum, the size of the input data significantly influences performance, with an optimal data size existing to achieve the best results.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Who Are the Intended Users of CSR Reports? Insights from a Data-Driven Approach
    Lindgren, Charlie
    Huq, Asif M.
    Carling, Kenneth
    SUSTAINABILITY, 2021, 13 (03) : 1 - 20
  • [42] Data-driven budget allocation of retail media by ad product, funnel metric, and brand size
    Qin, Vivian
    Pauwels, Koen
    Zhou, Bobby
    JOURNAL OF MARKETING ANALYTICS, 2024, 12 (02) : 235 - 249
  • [43] A Data-Driven Approach for qu Prediction of Laboratory Soil-Cement Mixtures
    Tinoco, Joaquim
    Alberto, Antonio
    da Venda, Paulo
    Correia, Antonio Gomes
    Lemos, Luis
    ADVANCES IN TRANSPORTATION GEOTECHNICS III, 2016, 143 : 566 - 573
  • [44] An Integrated Data-Driven Procedure for Product Specification Recommendation Optimization with LDA-LightGBM and QFD
    Wang, Tzu-Chien
    Guo, Ruey-Shan
    Chen, Chialin
    SUSTAINABILITY, 2023, 15 (18)
  • [45] EXPLAINING DATA-DRIVEN DOCUMENT CLASSIFICATIONS
    Martens, David
    Provost, Foster
    MIS QUARTERLY, 2014, 38 (01) : 73 - +
  • [46] Data-driven classification of narrative speech characteristics in stroke aphasia distinguishes neurological and strategic contributions
    Ding, Junhua
    Mirman, Daniel
    CORTEX, 2025, 186 : 61 - 73
  • [47] A new intelligent and data-driven product quality control system of industrial valve manufacturing process in CPS
    Pang, Jihong
    Zhang, Nan
    Xiao, Quan
    Qi, Faqun
    Xue, Xiaobo
    COMPUTER COMMUNICATIONS, 2021, 175 : 25 - 34
  • [48] A Novel Weighted Data-Driven Robust Optimization Approach for Creating Adjustable Uncertainty Sets
    Ghiasvand, Mohsen Roytvand
    Rahmani, Donya
    COMPUTERS & CHEMICAL ENGINEERING, 2023, 178
  • [49] Modelling perceptions of criminality and remorse from faces using a data-driven computational approach
    Funk, Friederike
    Walker, Mirella
    Todorov, Alexander
    COGNITION & EMOTION, 2017, 31 (07) : 1431 - 1443
  • [50] Optimal processing of surface facial EMG to identify emotional expressions: A data-driven approach
    Rutkowska, J. M.
    Ghilardi, T.
    Vacaru, S. V.
    van Schaik, J. E.
    Meyer, M.
    Hunnius, S.
    Oostenveld, R.
    BEHAVIOR RESEARCH METHODS, 2024, 56 (07) : 7331 - 7344