Investigation of acoustic and visual features for pig cough classification

被引:18
作者
Ji, Nan [1 ]
Shen, Weizheng [1 ]
Yin, Yanling [1 ]
Bao, Jun [2 ]
Dai, Baisheng [1 ]
Hou, Handan [3 ]
Kou, Shengli [4 ]
Zhao, Yize [5 ]
机构
[1] Northeast Agr Univ, Sch Elect Engn & Informat, Harbin 150030, Peoples R China
[2] Northeast Agr Univ, Sch Anim Sci & Technol, Harbin 150030, Peoples R China
[3] Harbin Finance Univ, Sch Comp Sci, Harbin 150030, Peoples R China
[4] Northeast Agr Univ, Sci & Technol Off, Harbin 150030, Peoples R China
[5] Univ Calif Irvine, Donald Bren Sch Informat & Comp Sci, Dept Comp Sci, Irvine, CA USA
基金
中国国家自然科学基金;
关键词
Pig cough; Acoustic features; Visual features; Support vector machine; TEXTURE FEATURES; RECOGNITION; LBP; HISTOGRAM; GABOR;
D O I
10.1016/j.biosystemseng.2022.05.010
中图分类号
S2 [农业工程];
学科分类号
0828 ;
摘要
The precise detection of pig cough is a crucial step for establishing an early warning system for pig respiratory diseases. With regard to high precision pig cough recognition, feature extraction and selection are of importance. However, few studies have investigated both acoustic and visual features of pig vocalisations as input features. In this paper, we proposed a novel feature fusion method which fusing acoustic and visual features to achieve an enhanced pig cough recognition rate. We firstly extracted acoustic features from audio signals, including root-mean-square energy (RMS), mel-frequency cepstral coefficients (MFCCs), zero-crossing rates (ZCRs), spectral centroid, spectral roll-off, spectral flatness, spectral bandwidth and chroma. Then, constant-Q transform (CQT) spectrograms were employed to extract visual features involving local binary pattern (LBP) and histogram of gradient (HOG). Subsequently, a hybrid feature set was created by combining acoustic and visual features. In this stage, Pearson correlation coefficient (PCC), recursive feature elimination based on random forest (RF-RFE) and principal component analysis (PCA) were exploited for dimensionality reduction. Finally, support vector machine (SVM), random forest (RF) and k-nearest neighbours (KNN) classifiers were used to conduct a performance evaluation. It is shown that the fused acoustic features (Acoustic) combined with LBP and HOG (A-LH) achieved 96.45% pig cough accuracy. The results reveal that the fusion feature set outperforms acoustic and visual features alone. (C) 2022 IAgrE. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:281 / 293
页数:13
相关论文
共 43 条
[11]   Combined application of Power Spectrum Centroid and Support Vector Machines for measurement improvement in Optical Scanning Systems [J].
Flores-Fuentes, Wendy ;
Rivas-Lopez, Moises ;
Sergiyenko, Oleg ;
Gonzalez-Navarro, Felix F. ;
Rivera-Castillo, Javier ;
Hernandez-Balbuena, Daniel ;
Rodriguez-QuitioneZ, Julio C. .
SIGNAL PROCESSING, 2014, 98 :37-51
[12]   A Survey of Audio-Based Music Classification and Annotation [J].
Fu, Zhouyu ;
Lu, Guojun ;
Ting, Kai Ming ;
Zhang, Dengsheng .
IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (02) :303-319
[13]   Intelligent feature extraction and classification of anuran vocalizations [J].
Huang, Chenn-Jung ;
Chen, You-Jia ;
Chen, Heng-Ming ;
Jian, Jui-Jiun ;
Tseng, Sheng-Chieh ;
Yang, Yi-Ju ;
Hsu, Po-An .
APPLIED SOFT COMPUTING, 2014, 19 :1-7
[14]   A Permutation Importance-Based Feature Selection Method for Short-Term Electricity Load Forecasting Using Random Forest [J].
Huang, Nantian ;
Lu, Guobo ;
Xu, Dianguo .
ENERGIES, 2016, 9 (10)
[15]   Identification of group-housed pigs based on Gabor and Local Binary Pattern features [J].
Huang, Weijia ;
Zhu, Weixing ;
Ma, Changhua ;
Guo, Yizheng ;
Chen, Chen .
BIOSYSTEMS ENGINEERING, 2018, 166 :90-100
[16]   Hybrid-Recursive Feature Elimination for Efficient Feature Selection [J].
Jeon, Hyelynn ;
Oh, Sejong .
APPLIED SCIENCES-BASEL, 2020, 10 (09)
[17]   An Audio Data Representation for Traffic Acoustic Scene Recognition [J].
Jiang, Dazhi ;
Huang, Dongmin ;
Song, Youyi ;
Wu, Kaichao ;
Lu, Huakang ;
Liu, Quanquan ;
Zhou, Teng .
IEEE ACCESS, 2020, 8 :177863-177873
[18]   SPECTRAL-ANALYSIS AND DISCRIMINATION BY ZERO-CROSSINGS [J].
KEDEM, B .
PROCEEDINGS OF THE IEEE, 1986, 74 (11) :1477-1493
[19]   Pre-processing spectrogram parameters improve the accuracy of bioacoustic classification using convolutional neural networks [J].
Knight, Elly C. ;
Hernandez, Sergio Poo ;
Bayne, Erin M. ;
Bulitko, Vadim ;
Tucker, Benjamin, V .
BIOACOUSTICS-THE INTERNATIONAL JOURNAL OF ANIMAL SOUND AND ITS RECORDING, 2020, 29 (03) :337-355
[20]   Recommendations for acoustic recognizer performance assessment with application to five common automated signal recognition programs [J].
Knight, Elly C. ;
Hannah, Kevin C. ;
Foley, Gabriel J. ;
Scott, Chris D. ;
Brigham, R. Mark ;
Bayne, Erin .
AVIAN CONSERVATION AND ECOLOGY, 2017, 12 (02)