Analysis of Job Offers to Measure Gender Barriers through Natural Language Processing and Soft Computing Techniques

被引:0
作者
Puente, Cristina [1 ]
Sanchez-Perez, Ivan [1 ,2 ]
Kolomiyets-Ludwig, Evhenia [3 ]
Palacios-Castrillo, Clara [4 ,5 ]
Wang, Patrick S. P. [6 ]
Palacios, Rafael [5 ,7 ]
机构
[1] Comillas Pontifical Univ, ICAI Sch Engn, Dept Comp Sci, Alberto Aguilera 23, Madrid 28015, Spain
[2] Univ Carlos III Madrid, Dept Phys & Engn, Av Univ 30, Madrid 28911, Spain
[3] Comillas Pontifical Univ, ICADE Business Sch, Alberto Aguilera 23, Madrid 28015, Spain
[4] Carnegie Mellon Univ, Dept Elect & Comp Engn, 5032 Forbes Ave, Pittsburgh, PA 15289 USA
[5] Comillas Pontifical Univ, Inst Res Technol IIT, ICAI Sch Engn, Alberto Aguilera 23, Madrid 28015, Spain
[6] Northeastern Univ, 360 Huntington Ave, Boston, MA 02115 USA
[7] MIT, Cybersecur Sloan CAMS, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
Gender-biased job advertisement; natural language processing; artificial intelligence; equal opportunities; text classification techniques; machine learning; STEREOTYPES; INFORMATION;
D O I
10.1142/S021800142551005X
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gender-biased language is still traced in job advertisements. Legal requirements to avoid direct gender-biased adjectives, and the usage of special software to detect and substitute gender-based words, scale up the issue more than solve it. The veil of discrimination on gender in job advertisements becomes more sophisticated with each succeeding level of its official and technical (including AI) prevention. This paper is mainly focused on the application of natural language processing (NLP) to detect gender-biased and discrimination of candidates by analyzing job offers posted online. NLP is an Artificial Intelligence tool that was applied in combination with Term Frequency-Inverse Document Frequency (TF-IDF) and Latent Dirichlet Allocation (LDA) to analyze the type of language used in job advertisements, detect the most relevant words used in the ads, and ultimately detect gender-bias. The main objective of this work is to provide equal access to employment opportunities from the very initial stage of the recruitment process. In addition, clustering techniques were applied to create groups based on the target public and the type of language used, providing evidence of gender-biased practices. The system was tested using a database of 2000 job ads in four different sectors: nursery, secretarial, managerial, and engineering.
引用
收藏
页数:24
相关论文
共 75 条
[1]   An information-theoretic perspective of tf-idf measures [J].
Aizawa, A .
INFORMATION PROCESSING & MANAGEMENT, 2003, 39 (01) :45-65
[2]   Analysis of Harassment Complaints to Detect Witness Intervention by Machine Learning and Soft Computing Techniques [J].
Alonso-Parra, Marina ;
Puente, Cristina ;
Laguna, Ana ;
Palacios, Rafael .
APPLIED SCIENCES-BASEL, 2021, 11 (17)
[3]  
[Anonymous], 2023, The STEM Gap: Women and Girls in Science, Technology, Engineering and Mathematics-AAUW: Empowering Women Since 1881
[4]  
[Anonymous], 2022, The challenge of gender bias: Experiences of women pursuing careers in STEM
[5]   Fuzzy AHP approach for legal judgement summarization [J].
Bansal, Neha ;
Sharma, Arun ;
Singh, R. K. .
JOURNAL OF MANAGEMENT ANALYTICS, 2019, 6 (03) :323-340
[6]  
Basta Christine, 2023, Computational Linguistics and Intelligent Text Processing: 20th International Conference, CICLing 2019, Revised Selected Papers. Lecture Notes in Computer Science (13451), P342, DOI 10.1007/978-3-031-24337-0_25
[7]  
Basta C, 2019, GENDER BIAS IN NATURAL LANGUAGE PROCESSING (GEBNLP 2019), P33
[8]  
Bhaskaran J, 2019, GENDER BIAS IN NATURAL LANGUAGE PROCESSING (GEBNLP 2019), P62
[9]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[10]  
Bogen Miranda, 2018, Help wanted: An examination of hiring algorithms, equity, and bias