Combining artificial neural network classification with fully continuous probabilistic genotyping to remove the need for an analytical threshold and electropherogram reading

被引:5
作者
Taylor, Duncan [1 ,2 ]
Buckleton, John [3 ,4 ]
机构
[1] Forens Sci SA, GPO Box 2790, Adelaide, SA 5001, Australia
[2] Flinders Univ S Australia, Sch Biol Sci, GPO Box 2100, Adelaide, SA 5001, Australia
[3] Inst Environm Sci & Res Ltd, Private Bag 92021, Auckland 1142, New Zealand
[4] Univ Auckland, Dept Stat, Auckland, New Zealand
关键词
Artificial neural network; Continuous DNA interpretation; STRmix; Modelling artefacts; DNA mixtures; DNA; CONTRIBUTORS; SYSTEM; NUMBER;
D O I
10.1016/j.fsigen.2022.102787
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Standard processing of electrophoretic data within a forensic DNA laboratory is for one (or two) analysts to designate peaks as either artefactual or non-artefactual in a process commonly referred to as profile 'reading'. Recently, FaSTRTM DNA has been developed to use artificial neural networks to automatically classify fluorescence within an electropherogram as baseline, allele, stutter or pull-up. These classifications are based on probabilities assigned to each timepoint (scan) within the electropherogram. Instead of using the probabilities to assign fluorescence into a category they can be used directly in the profile analysis. This has a number of advantages; increased objectivity in DNA profile processing, the removal for the need for analysts to read profiles, the removal for the need of an analytical threshold. Models within STRmixTM were extended to incorporate the peak label probabilities assigned by FaSTRTM DNA. The performance of the model extensions was tested on a DNA mixture dataset, comprising 2-4 person samples. This dataset was processed in a 'standard' manner using an analytical threshold of 50rfu, analyst peak designations and STRmixTM V2.9 models. The same dataset was then processed in an automated manner using no analytical threshold, no analysts reading the profile and using the STRmixTM models extended to incorporate peak label probabilities. Both datasets were compared to the known DNA donors and a set of non-donors. The result between the two processes was a very close performance, but with a large efficiency gain in the 0rfu process. Utilising peak label probabilities opens up the possibility for a range of workflow process efficiency gains, but beyond this allows full use of all data within an electropherogram.
引用
收藏
页数:22
相关论文
共 24 条
[1]   TAWSEEM: A Deep-Learning-Based Tool for Estimating the Number of Unknown Contributors in DNA Profiling [J].
Alotaibi, Hamdah ;
Alsolami, Fawaz ;
Abozinadah, Ehab ;
Mehmood, Rashid .
ELECTRONICS, 2022, 11 (04)
[2]   Utilising allelic dropout probabilities estimated by logistic regression in casework [J].
Buckleton, John ;
Kelly, Hannah ;
Bright, Jo-Anne ;
Taylor, Duncan ;
Tvedebrink, Torben ;
Curran, James M. .
FORENSIC SCIENCE INTERNATIONAL-GENETICS, 2014, 9 :9-11
[3]   Analysis of forensic DNA mixtures with artefacts [J].
Cowell, R. G. ;
Graversen, T. ;
Lauritzen, S. L. ;
Mortera, J. .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2015, 64 (01) :1-48
[4]   Ratios and distances of pull-up peaks observed in GlobalFiler kit data [J].
Fujii, Koji ;
Fukagawa, Takashi ;
Watahiki, Haruhiko ;
Mita, Yusuke ;
Kitayama, Tetsushi ;
Mizuno, Natsuko .
LEGAL MEDICINE, 2018, 34 :58-63
[5]   DNA commission of the International Society of Forensic Genetics: Recommendations on the evaluation of STR typing results that may include drop-out and/or drop-in using probabilistic methods [J].
Gill, P. ;
Gusmao, L. ;
Haned, H. ;
Mayr, W. R. ;
Morling, N. ;
Parson, W. ;
Prieto, L. ;
Prinz, M. ;
Schneider, H. ;
Schneider, P. M. ;
Weir, B. S. .
FORENSIC SCIENCE INTERNATIONAL-GENETICS, 2012, 6 (06) :679-688
[6]   Characterization of degradation and heterozygote balance by simulation of the forensic DNA analysis process [J].
Hansson, Oskar ;
Egeland, Thore ;
Gill, Peter .
INTERNATIONAL JOURNAL OF LEGAL MEDICINE, 2017, 131 (02) :303-317
[7]   Estimating the number of contributors to a DNA profile using decision trees [J].
Kruijver, Maarten ;
Kelly, Hannah ;
Cheng, Kevin ;
Lin, Meng-Han ;
Morawitz, Judi ;
Russell, Laura ;
Buckleton, John ;
Bright, Jo-Anne .
FORENSIC SCIENCE INTERNATIONAL-GENETICS, 2021, 50
[8]   Exploring the probative value of mixed DNA profiles [J].
Kruijver, Maarten ;
Bright, Jo-Anne ;
Kelly, Hannah ;
Buckleton, John .
FORENSIC SCIENCE INTERNATIONAL-GENETICS, 2019, 41 :1-10
[9]  
Lin MH, 2021, Forensic Science International Reports, V3, P100217, DOI [10.1016/j.fsir.2021.100217, 10.1016/j.fsir.2021.100217, DOI 10.1016/J.FSIR.2021.100217]
[10]   PACE: Probabilistic Assessment for Contributor Estimation-A machine learning-based assessment of the number of contributors in DNA mixtures [J].
Marciano, Michael A. ;
Adelman, Jonathan D. .
FORENSIC SCIENCE INTERNATIONAL-GENETICS, 2017, 27 :82-91