PyBibX - a Python']Python library for bibliometric and scientometric analysis powered with artificial intelligence tools

被引:4
作者
Pereira, Valdecy [1 ]
Basilio, Marcio Pereira [2 ]
Santos, Carlos Henrique Tarjano [2 ]
机构
[1] Fed Fluminense Univ, Dept Prod Engn, Niteroi, Brazil
[2] Fed Fluminense Univ, Niteroi, Brazil
关键词
Bibliometrics; Scientometrics; Network analysis; Artificial intelligence; chatGPT; CRITERIA DECISION-ANALYSIS; INCONSISTENCY REDUCTION; CITATION; SCIENCE; SOFTWARE; INFORMATION; SELECTION; TRENDS; SCOPUS; BIAS;
D O I
10.1108/DTA-08-2023-0461
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
PurposeThis paper presents pyBibX, a Python library devised to conduct comprehensive bibliometric and scientometric analyses on raw data files sourced from Scopus, Web of Science and PubMed, seamlessly integrating state-of-the-art artificial intelligence (AI) capabilities into its core functionality.Design/methodology/approachThe library executes a comprehensive exploratory data analysis (EDA), presenting outcomes via visually appealing graphical illustrations. Network capabilities have been deftly integrated, encompassing citation, collaboration and similarity analysis. Furthermore, the library incorporates AI capabilities, including embedding vectors, topic modeling, text summarization and other general natural language processing tasks, employing models such as sentence-BERT, BerTopic, BERT, chatGPT and PEGASUS.FindingsAs a demonstration, we have analyzed 184 documents associated with "multiple-criteria decision analysis" published between 1984 and 2023. The EDA emphasized a growing fascination with decision-making and fuzzy logic methodologies. Next, network analysis further accentuated the significance of central authors and intra-continental collaboration, identifying Canada and China as crucial collaboration hubs. Finally, AI analysis distinguished two primary topics and chatGPT's preeminence in text summarization. It also proved to be an indispensable instrument for interpreting results, as our library enables researchers to pose inquiries to chatGPT regarding bibliometric outcomes. Even so, data homogeneity remains a daunting challenge due to database inconsistencies.Originality/valuePyBibX is the first application integrating cutting-edge AI capabilities for analyzing scientific publications, enabling researchers to examine and interpret these outcomes more effectively. pyBibX is freely available at https://bit.ly/442wD5z.
引用
收藏
页码:302 / 337
页数:36
相关论文
共 81 条
[1]   A Report from the International Mathematical Union (IMU) in Cooperation with the International Council of Industrial and Applied Mathematics (ICIAM) and the Institute of Mathematical Statistics (IMS) [J].
Adler, Robert ;
Ewing, John ;
Taylor, Peter ;
Hall, Peter Gavin .
STATISTICAL SCIENCE, 2009, 24 (01) :1-28
[2]   Assessment of publication bias, selection bias, and unavailable data in meta-analyses using individual participant data: a database survey [J].
Ahmed, Ikhlaaq ;
Sutton, Alexander J. ;
Riley, Richard D. .
BMJ-BRITISH MEDICAL JOURNAL, 2012, 344
[3]   40 years of research on eating disorders in domain-specific journals: Bibliometrics, network analysis, and topic modeling [J].
Almenara, Carlos A. .
PLOS ONE, 2022, 17 (12)
[4]  
Angelis Aris, 2018, MDM Policy Pract, V3, p2381468318796218, DOI [10.1177/2381468318796218, 10.1177/2381468318796218]
[5]   bibliometrix: An R-tool for comprehensive science mapping analysis [J].
Aria, Massimo ;
Cuccurullo, Corrado .
JOURNAL OF INFORMETRICS, 2017, 11 (04) :959-975
[6]   INTUITIONISTIC FUZZY-SETS [J].
ATANASSOV, KT .
FUZZY SETS AND SYSTEMS, 1986, 20 (01) :87-96
[7]   Knowledge discovery in research on domestic violence: an overview of the last fifty years [J].
Basilio, Marcio Pereira ;
Pereira, Valdecy ;
de Oliveira, Max William Coelho Moreira ;
da Costa Neto, Antonio Fernandes ;
de Moraes, Orlinda Claudia Rosa ;
Siqueira, Samya Cotta Brandao .
DATA TECHNOLOGIES AND APPLICATIONS, 2021, 55 (04) :480-510
[8]   A model of policing strategy choice The integration of the Latent Dirichlet Allocation (LDA) method with ELECTRE I [J].
Basilio, Marcio Pereira ;
Brum, Gabrielle Souza ;
Pereira, Valdecy .
JOURNAL OF MODELLING IN MANAGEMENT, 2020, 15 (03) :849-891
[9]   Identification of operational demand in law enforcement agencies An application based on a probabilistic model of topics [J].
Basilio, Marcio Pereira ;
Pereira, Valdecy ;
Brum, Gabrielle .
DATA TECHNOLOGIES AND APPLICATIONS, 2019, 53 (03) :333-372
[10]   A Bibliometric Analysis of the Use of Artificial Intelligence Technologies for Social Sciences [J].
Bircan, Tuba ;
Salah, Almila Alkim Akdag .
MATHEMATICS, 2022, 10 (23)