Evaluation of Inter-Observer Reliability of Animal Welfare Indicators: Which Is the Best Index to Use?

被引:12
作者
Giammarino, Mauro [1 ]
Mattiello, Silvana [2 ]
Battini, Monica [2 ]
Quatto, Piero [3 ]
Battaglini, Luca Maria [4 ]
Vieira, Ana C. L. [5 ]
Stilwell, George [6 ]
Renna, Manuela [7 ]
机构
[1] Asl TO3, Vet Serv, Area Anim San, Dept Prevent, I-10045 Piossasco, Italy
[2] Univ Milan, Dept Agr & Environm Sci Prod, Landscape, Agroenergy, Milan, Italy
[3] Univ Milano Bicocca, Dept Econ Management & Stat, I-20126 Milan, Italy
[4] Univ Turin, Dept Agr Forest & Food Sci, I-10095 Grugliasco, Italy
[5] Univ Lisbon, Inst Super Tecn CEG IST, Ctr Management Studies, P-1049001 Lisbon, Portugal
[6] Univ Lisbon, Dept Vet Med, P-1300477 Lisbon, Portugal
[7] Univ Turin, Dept Vet Sci, I-10095 Grugliasco, Italy
来源
ANIMALS | 2021年 / 11卷 / 05期
关键词
agreement index; animal-based measure; dichotomous categorical indicator; inter-rater reliability; ASSESSMENT PROTOCOL; INTERRATER RELIABILITY; OBSERVER AGREEMENT; KAPPA; PREVALENCE; INTERVAL;
D O I
10.3390/ani11051445
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
This study focuses on the problem of assessing inter-observer reliability (IOR) in the case of dichotomous categorical animal-based welfare indicators and the presence of two observers. Based on observations obtained from Animal Welfare Indicators (AWIN) project surveys conducted on nine dairy goat farms, and using udder asymmetry as an indicator, we compared the performance of the most popular agreement indexes available in the literature: Scott's pi, Cohen's k, k(PABAK), Holsti's H, Krippendorff's alpha, Hubert's Gamma, Janson and Vegelius' J, Bangdiwala's B, Andres and Marzo's Delta, and Gwet's gamma(AC(1)). Confidence intervals were calculated using closed formulas of variance estimates for pi, k, k(PABAK), H, alpha, Gamma, J, Delta, and gamma(AC(1)), while the bootstrap and exact bootstrap methods were used for all the indexes. All the indexes and closed formulas of variance estimates were calculated using Microsoft Excel. The bootstrap method was performed with R software, while the exact bootstrap method was performed with SAS software. k, pi, and alpha exhibited a paradoxical behavior, showing unacceptably low values even in the presence of very high concordance rates. B and gamma(AC(1)) showed values very close to the concordance rate, independently of its value. Both bootstrap and exact bootstrap methods turned out to be simpler compared to the implementation of closed variance formulas and provided effective confidence intervals for all the considered indexes. The best approach for measuring IOR in these cases is the use of B or gamma(AC(1)), with bootstrap or exact bootstrap methods for confidence interval calculation.
引用
收藏
页数:16
相关论文
共 62 条
[1]  
Altman DG, 2000, STAT MED, V19, P3275, DOI 10.1002/1097-0258(20001215)19:23<3275::AID-SIM626>3.0.CO
[2]  
2-M
[3]   Delta:: A new measure of agreement between two raters [J].
Andrés, AM ;
Marzo, PF .
BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2004, 57 :1-19
[4]  
Ato M, 2011, PSICOLOGICA, V32, P385
[5]  
AWIN (Animal Welfare Indicators), 2015, AWIN WELF ASS PROT G
[6]   Beyond kappa: A review of interrater agreement measures [J].
Banerjee, M .
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 1999, 27 (01) :3-23
[7]  
Bangdiwala S.I., 1985, Proceedings of the 45th International Statistical Institute Meeting, V1, P307
[8]   Invited review: Animal-based indicators for on-farm welfare assessment for dairy goats [J].
Battini, M. ;
Vieira, A. ;
Barbieri, S. ;
Ajuda, I. ;
Stilwell, G. ;
Mattiello, S. .
JOURNAL OF DAIRY SCIENCE, 2014, 97 (11) :6625-6648
[9]   On-Farm Welfare Assessment Protocol for Adult Dairy Goats in Intensive Production Systems [J].
Battini, Monica ;
Stilwell, George ;
Vieira, Ana ;
Barbieri, Sara ;
Canali, Elisabetta ;
Mattiello, Silvana .
ANIMALS, 2015, 5 (04) :934-950
[10]  
Bennett FM, 1954, PUBLIC OPIN QUART, V18, P303