On the agreement between bibliometrics and peer review: Evidence from the Italian research assessment exercises

被引:11
作者
Baccini, Alberto [1 ]
Barabesi, Lucio [1 ]
De Nicolao, Giuseppe [2 ]
机构
[1] Univ Siena, Dept Econ & Stat, Siena, Italy
[2] Univ Pavia, Dept Elect Comp & Biomed Engn, Pavia, Italy
关键词
EVALUATING SCIENTIFIC-RESEARCH; KAPPA;
D O I
10.1371/journal.pone.0242520
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper analyzes the concordance between bibliometrics and peer review. It draws evidence from the data of two experiments of the Italian governmental agency for research evaluation. The experiments were performed by the agency for validating the adoption in the Italian research assessment exercises of a dual system of evaluation, where some outputs were evaluated by bibliometrics and others by peer review. The two experiments were based on stratified random samples of journal articles. Each article was scored by bibliometrics and by peer review. The degree of concordance between the two evaluations is then computed. The correct setting of the experiments is defined by developing the design-based estimation of the Cohen's kappa coefficient and some testing procedures for assessing the homogeneity of missing proportions between strata. The results of both experiments show that for each research areas of science, technology, engineering and mathematics the degree of agreement between bibliometrics and peer review is-at most-weak at an individual article level. Thus, the outcome of the experiments does not validate the use of the dual system of evaluation in the Italian research assessments. More in general, the very weak concordance indicates that metrics should not replace peer review at the level of individual article. Hence, the use of the dual system in a research assessment might worsen the quality of information compared to the adoption of peer review only or bibliometrics only.
引用
收藏
页数:28
相关论文
共 47 条
[1]   On tit for tat: Franceschini and Maisano versus ANVUR regarding the Italian research assessment exercise VQR 2011-2014 [J].
Abramo, Giovanni ;
D'Angelo, Ciriaco Andrea .
JOURNAL OF INFORMETRICS, 2017, 11 (03) :783-787
[2]   The north-south divide in the Italian higher education system [J].
Abramo, Giovanni ;
D'Angelo, Ciriaco Andrea ;
Rosati, Francesco .
SCIENTOMETRICS, 2016, 109 (03) :2093-2117
[3]  
Albert J, 2009, USE R, P1, DOI 10.1007/978-0-387-92298-0_1
[4]  
Alf M, 2017, USE BIBLIOMETRIC INF
[5]  
Altman DG., 1991, Practical Statistics for Medical Research
[6]   Evaluating scientific research in Italy: The 2004-10 research evaluation exercise [J].
Ancaiani, Alessio ;
Anfossi, Alberto F. ;
Barbara, Anna ;
Benedetto, Sergio ;
Blasi, Brigida ;
Carletti, Valentina ;
Cicero, Tindaro ;
Ciolfi, Alberto ;
Costa, Filippo ;
Colizza, Giovanna ;
Costantini, Marco ;
di Cristina, Fabio ;
Ferrara, Antonio ;
Lacatena, Rosa M. ;
Malgarini, Marco ;
Mazzotta, Irene ;
Nappi, Carmela A. ;
Romagnosi, Sandra ;
Sileoni, Serena .
RESEARCH EVALUATION, 2015, 24 (03) :242-255
[7]  
[Anonymous], 2004, Surv. Methodol.
[8]  
[Anonymous], Inter-rater reliability of delirium measuring instru
[9]  
[Anonymous], 2022, Testing Statistical Hypotheses, DOI DOI 10.1007/978-3-030-70578-7
[10]  
[Anonymous], 2017, VALUTAZIONE QUALIT R