Statistical evaluation of bacterial source tracking data obtained by rep-PCR DNA fingerprinting of Escherichia coli

被引:18
作者
Albert, JM
Munakata-Marr, J [1 ]
Tenorio, L
Siegrist, RL
机构
[1] Colorado Sch Mines, Environm Sci & Engn Div, Golden, CO 80401 USA
[2] Colorado Sch Mines, Dept Math & Comp Sci, Golden, CO 80401 USA
关键词
D O I
10.1021/es034211q
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Pattern recognition has been applied to environmental systems for identification of numerous pollution sources including aerosolized lead and petroleum hydrocarbons. In recent years, DNA fingerprinting has gained widespread application as a means to characterize genetic variations for such purposes as microbial source tracking. This approach, however, is strongly dependent on the statistical and image analyses applied. Several statistical analyses of rep-PCR DNA fingerprints were assessed as a means to differentiate between potential sources of fecal contamination. GelCompar 11 and methods based on penalized discriminant analysis (PDA) and k-nearest neighbors (KNN) classification procedures were used to differentiate between 10 source groups within a library containing DNA fingerprints of 548 Escherichia coli isolates from known human and nonhuman sources. KNN performed significantly better than PDA in a jackknife analysis, though the library was not large enough to detect significant differences between GelCompar 11 and the other two methods. GelCompar 11 and KNN both attained greater than or equal to90% correct classification in a holdout procedure. In addition, interpoint distance analyses indicate coherency within source groups, while library randomization demonstrated that KNN does not create artificial groupings. This investigation stresses the need to understand limitations of statistical analyses used in pattern recognition of DNA fingerprints.
引用
收藏
页码:4554 / 4560
页数:7
相关论文
共 62 条
[1]  
[Anonymous], 1998, STAND METH EX WAT WA, V20th
[2]   Percent G+C profiling accurately reveals diet-related differences in the gastrointestinal microbial community of broiler chickens [J].
Apajalahti, JHA ;
Kettunen, A ;
Bedford, MR ;
Holben, WE .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2001, 67 (12) :5656-5667
[3]   Variability in presence-absence scoring of AP PCR fingerprints affects computer matching of bacterial isolates [J].
Burr, MD ;
Pepper, IL .
JOURNAL OF MICROBIOLOGICAL METHODS, 1997, 29 (01) :63-68
[4]   Identification of fecal Escherichia coli from humans and animals by ribotyping [J].
Carson, CA ;
Shear, BL ;
Ellersieck, MR ;
Asfaw, A .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2001, 67 (04) :1503-1507
[5]  
CAUGANT DA, 1981, GENETICS, V98, P467
[6]   DISTRIBUTION OF MULTILOCUS GENOTYPES OF ESCHERICHIA-COLI WITHIN AND BETWEEN HOST FAMILIES [J].
CAUGANT, DA ;
LEVIN, BR ;
SELANDER, RK .
JOURNAL OF HYGIENE, 1984, 92 (03) :377-384
[7]  
*CDC, 2002, SURV WAT DIS OUTBR U
[8]  
DENBRUIJN FJ, 1998, BACTERIAL GENOMES PH
[9]   Use of repetitive DNA sequences and the PCR to differentiate Escherichia coli isolates from human and animal sources [J].
Dombek, PE ;
Johnson, LK ;
Zimmerley, ST ;
Sadowsky, MJ .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2000, 66 (06) :2572-2577
[10]   The distribution of enteric bacteria from Australian mammals: host and geographical effects [J].
Gordon, DM ;
FitzGibbon, F .
MICROBIOLOGY-SGM, 1999, 145 :2663-2671