Evaluation of the application of sequence data to the identification of outbreaks of disease using anomaly detection methods

被引:1
|
作者
Diaz-Cao, Jose Manuel [1 ,2 ]
Liu, Xin [3 ]
Kim, Jeonghoon [3 ]
Clavijo, Maria Jose [4 ]
Martinez-Lopez, Beatriz [1 ]
机构
[1] Univ Calif Davis, Ctr Anim Dis Modeling & Surveillance CADMS, Sch Vet Med, Dept Med & Epidemiol, Davis, CA 95616 USA
[2] Univ Santiago de Compostela, Dept Patoloxia Anim, Fac Vet Lugo, Lugo, Spain
[3] Univ Calif Davis, Dept Comp Sci, Davis, CA USA
[4] Iowa State Univ, Coll Vet Med, Dept Vet Diagnost & Prod Anim Med, Ames, IA USA
关键词
Outbreak detection; machine learning; regression; control chart; surveillance; epidemics; production system; animal health; RESPIRATORY SYNDROME VIRUS; CHANGE-POINT ANALYSIS; SYNDROMIC SURVEILLANCE; ABERRATION DETECTION; TIME-SERIES; DETECTION ALGORITHMS; INFECTIOUS-DISEASES; MODEL; TRANSMISSION; INITIATIVES;
D O I
10.1186/s13567-023-01197-3
中图分类号
S85 [动物医学(兽医学)];
学科分类号
0906 ;
摘要
Anomaly detection methods have a great potential to assist the detection of diseases in animal production systems. We used sequence data of Porcine Reproductive and Respiratory Syndrome (PRRS) to define the emergence of new strains at the farm level. We evaluated the performance of 24 anomaly detection methods based on machine learning, regression, time series techniques and control charts to identify outbreaks in time series of new strains and compared the best methods using different time series: PCR positives, PCR requests and laboratory requests. We introduced synthetic outbreaks of different size and calculated the probability of detection of outbreaks (POD), sensitivity (Se), probability of detection of outbreaks in the first week of appearance (POD1w) and background alarm rate (BAR). The use of time series of new strains from sequence data outperformed the other types of data but POD, Se, POD1w were only high when outbreaks were large. The methods based on Long Short-Term Memory (LSTM) and Bayesian approaches presented the best performance. Using anomaly detection methods with sequence data may help to identify the emergency of cases in multiple farms, but more work is required to improve the detection with time series of high variability. Our results suggest a promising application of sequence data for early detection of diseases at a production system level. This may provide a simple way to extract additional value from routine laboratory analysis. Next steps should include validation of this approach in different settings and with different diseases.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Evaluation of the application of sequence data to the identification of outbreaks of disease using anomaly detection methods
    José Manuel Díaz-Cao
    Xin Liu
    Jeonghoon Kim
    Maria Jose Clavijo
    Beatriz Martínez-López
    Veterinary Research, 54
  • [2] Disease Detection and Identification Using Sequence Data and Information Retrieval Methods
    Joshi, Sankranti
    Radhika, Pai M.
    Manohara, Pai M. M.
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, NETWORKING AND INFORMATICS (ICACNI 2015), VOL 1, 2016, 43 : 565 - 572
  • [3] Comparative Evaluation of Anomaly Detection Techniques for Sequence Data
    Chandola, Varun
    Mithal, Varun
    Kumar, Vipin
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 743 - +
  • [4] Hybrid Group Anomaly Detection for Sequence Data: Application to Trajectory Data Analytics
    Belhadi, Asma
    Djenouri, Youcef
    Srivastava, Gautam
    Cano, Alberto
    Lin, Jerry Chun-Wei
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 9346 - 9357
  • [5] Application of Anomaly Detection Methods in the Housing and Utility Infrastructure Data
    Shanin, Ivan
    Stupnikov, Sergey
    Zakharov, Viktor
    2019 IVANNIKOV MEMORIAL WORKSHOP (IVMEM 2019), 2019, : 101 - 105
  • [6] Anomaly Rule Detection in Sequence Data
    Gan, Wensheng
    Chen, Lili
    Wan, Shicheng
    Chen, Jiahui
    Chen, Chien-Ming
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (12) : 12095 - 12108
  • [7] Clinically Focussed Evaluation of Anomaly Detection and Localisation Methods Using Inpatient CT Head Data
    Kascenas, Antanas
    Wang, Chaoyang
    Schrempf, Patrick
    Grech, Ryan
    Goh, Hui Lu
    Hall, Mark
    O'Neil, Alison Q.
    DATA AUGMENTATION, LABELLING, AND IMPERFECTIONS, DALI 2023, 2024, 14379 : 63 - 72
  • [8] Visual Anomaly Detection in Event Sequence Data
    Guo, Shunan
    Jin, Zhuochen
    Chen, Qing
    Gotz, David
    Zha, Hongyuan
    Cao, Nan
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 1125 - 1130
  • [9] Anomaly detection in hyperspectral imagery: A comparison of methods using seasonal data
    Hytla, Patrick
    Hardie, Russell C.
    Eismann, Michael T.
    Meola, Joseph
    ALGORITHMS AND TECHNOLOGIES FOR MULTISPECTRAL, HYPERSPECTRAL, AND ULTRASPECTRAL IMAGERY XIII, 2007, 6565
  • [10] A Comparative Evaluation of SOM-based Anomaly Detection Methods for Multivariate Data
    Guo, Bingjun
    Song, Lei
    Zheng, Taisheng
    Liang, Haoran
    Wang, Hongfei
    2019 PROGNOSTICS AND SYSTEM HEALTH MANAGEMENT CONFERENCE (PHM-QINGDAO), 2019,