Discovering research data management trends from job advertisements using a text-mining approach

被引:3
作者
Sheriff, Naseema [1 ,2 ]
Sevukan, R. [1 ]
机构
[1] Pondicherry Univ, Dept Lib & Informat Sci, Pondicherry, India
[2] Pondicherry Univ, Dept Lib & Informat Sci, Pondicherry 605014, India
关键词
Data mining; LDA; natural language processing; recruitment; topic modelling; web scraping; RESEARCH-LIBRARIES; SERVICES; SUPPORT;
D O I
10.1177/01655515231193845
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In today's data-driven culture, research data management (RDM) is essential for the research community. The demand for reusing research datasets is a challenging and diverse process for the scientific community. Despite this, it is essential in RDM to discover trends and themes using text mining, which is scarce. The purpose of this study is to employ text mining to discover insights from job advertisements associated with RDM profiles, which collected 810 advertisements. We found RDM-related patterns using latent Dirichlet allocation (LDA) and identified three key contexts. The first is 'research services in libraries', with the topics of research services, research information, research universities, collection processes and library services. The second context is 'research data', which includes RDM, business data, university data, research data, health research, science research, social science research, data centres, data services, statistical software, digital scholarship and digital preservation. The third context is 'workplace environment', and the topics are leadership, work development and scientific position. Job title normalisation reveals names such as 'data librarian', 'librarian', 'director', 'data curator', 'data manager', 'research data librarian', 'data specialist' and 'data officer' are frequently employed. Focusing on titles with a single or double occurrence is new and interesting for developing nations. Reputable institutions such as Harvard, Stanford and the Massachusetts Institute of Technology, as well as countries such as the United States, the United Kingdom, Canada and Germany, are the major participants in RDM practises and services. This discovery will assist higher education institutions, RDM stakeholders, which aid in the formulation of curriculum, and job seekers to familiarise themselves with the themes.
引用
收藏
页数:17
相关论文
共 82 条
[1]  
Aggarwal C.-C., 2012, Mining text data, P163, DOI [10.1007/978-1-4614-3223-4, DOI 10.1007/978-1-4614-3223-4]
[2]  
Aggarwal Charu C., 2018, Machine Learning for Text
[3]  
Ahmed E., 2023, FAC RES PUBL
[4]   A comparison of research data management platforms: architecture, flexible metadata and interoperability [J].
Amorim, Ricardo Carvalho ;
Castro, Joao Aguiar ;
da Silva, Joao Rocha ;
Ribeiro, Cristina .
UNIVERSAL ACCESS IN THE INFORMATION SOCIETY, 2017, 16 (04) :851-862
[5]   Using text mining to glean insights from COVID-19 literature [J].
Anderson, Billie S. .
JOURNAL OF INFORMATION SCIENCE, 2023, 49 (02) :373-381
[6]  
Anilkumar N., 2018, EPJ WEB C EDP SCI
[7]  
[Anonymous], 2022, METHODOLOGY OVERALL
[8]   Gender stereotypes in job advertisements: What do they imply for the gender salary gap? [J].
Arceo-Gomez, Eva O. ;
Campos-Vazquez, Raymundo M. ;
Badillo, Raquel Y. ;
Lopez-Araiza, Sergio .
JOURNAL OF LABOR RESEARCH, 2022, 43 (01) :65-102
[9]   e!DAL - a framework to store, share and publish research data [J].
Arend, Daniel ;
Lange, Matthias ;
Chen, Jinbo ;
Colmsee, Christian ;
Flemming, Steffen ;
Hecht, Denny ;
Scholz, Uwe .
BMC BIOINFORMATICS, 2014, 15
[10]   A systematic literature review on research data management practices and services [J].
Ashiq, Murtaza ;
Usmani, Muhammad Haroon ;
Naeem, Muhammad .
GLOBAL KNOWLEDGE MEMORY AND COMMUNICATION, 2022, 71 (8/9) :649-671