Fusion of acoustic and deep features for pig cough sound recognition

被引:38
作者
Shen, Weizheng [1 ]
Ji, Nan [1 ]
Yin, Yanling [1 ]
Dai, Baisheng [1 ]
Tu, Ding [2 ]
Sun, Baihui [1 ,3 ]
Hou, Handan [4 ]
Kou, Shengli [5 ]
Zhao, Yize [6 ]
机构
[1] Northeast Agr Univ, Sch Elect Engn & Informat, Harbin 150030, Peoples R China
[2] Guangxi Univ Sci & Technol, Tus Coll Digit, Liuzhou 545000, Peoples R China
[3] Heilongjiang Acad Agr Machinery Sci, Mudanjiang Branch, Mudanjiang 157000, Peoples R China
[4] Harbin Finance Univ, Sch Comp Sci, Harbin 150030, Peoples R China
[5] Northeast Agr Univ, Sch Elect Engn & Informat, Harbin, Peoples R China
[6] Univ Calif Irvine, Donald Bren Sch Informat & Comp Sci, Dept Comp Sci, Irvine, CA USA
基金
中国国家自然科学基金;
关键词
Pig cough; Feature fusion; Time-frequency representations; Convolutional neural networks; CLASSIFICATION; ENHANCEMENT;
D O I
10.1016/j.compag.2022.106994
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
The recognition of pig cough sound is a prerequisite for early warning of respiratory diseases in pig houses, which is essential for detecting animal welfare and predicting productivity. With respect to pig cough recognition, it is a highly crucial step to create representative pig sound characteristics. To this end, this paper proposed a feature fusion method by combining acoustic and deep features from audio segments. First, a set of acoustic features from different domains were extracted from sound signals, and recursive feature elimination based on random forest (RF-RFE) was adopted to conduct feature selection. Second, time-frequency representations (TFRs) involving constant-Q transform (CQT) and short-time Fourier transform (STFT) were employed to extract visual features from a fine-tuned convolutional neural network (CNN) model. Finally, the ensemble of the two kinds of features was fed into support vector machine (SVM) by early fusion to identify pig cough sounds. This work investigated the performance of the proposed acoustic and deep features fusion, which achieved 97.35% accuracy for pig cough recognition. The results provide further evidence for the effectiveness of combining acoustic and deep spectrum features as a robust feature representation for pig cough recognition.
引用
收藏
页数:7
相关论文
共 37 条
[1]  
Amiriparian S, 2018, IEEE IJCNN
[2]   Snore Sound Classification Using Image-based Deep Spectrum Features [J].
Amiriparian, Shahin ;
Gerczuk, Maurice ;
Ottl, Sandra ;
Cummins, Nicholas ;
Freitag, Michael ;
Pugachevskiy, Sergey ;
Baird, Alice ;
Schuller, Bjoern .
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, :3512-3516
[3]   Impact of fully connected layers on performance of convolutional neural networks for image classification [J].
Basha, S. H. Shabbeer ;
Dubey, Shiv Ram ;
Pulabaigari, Viswanath ;
Mukherjee, Snehasis .
NEUROCOMPUTING, 2020, 378 :112-119
[4]   Precision Livestock Farming in Swine Welfare: A Review for Swine Practitioners [J].
Benjamin, Madonna ;
Yik, Steven .
ANIMALS, 2019, 9 (04)
[5]   Livestock vocalisation classification in farm soundscapes [J].
Bishop, James C. ;
Falzon, Greg ;
Trotter, Mark ;
Kwan, Paul ;
Meek, Paul D. .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2019, 162 :531-542
[6]   Fusing MFCC and LPC Features Using 1D Triplet CNN for Speaker Recognition in Severely Degraded Audio Signals [J].
Chowdhury, Anurag ;
Ross, Arun .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 15 :1616-1629
[7]   Automatic Detection and Recognition of Pig Wasting Diseases Using Sound Data in Audio Surveillance Systems [J].
Chung, Yongwha ;
Oh, Seunggeun ;
Lee, Jonguk ;
Park, Daihee ;
Chang, Hong-Hee ;
Kim, Suk .
SENSORS, 2013, 13 (10) :12929-12942
[8]   Recursive Feature Elimination and Random Forest Classification of Natura 2000 Grasslands in Lowland River Valleys of Poland Based on Airborne Hyperspectral and LiDAR Data Fusion [J].
Demarchi, Luca ;
Kania, Adam ;
Ciezkowski, Wojciech ;
Piorkowski, Hubert ;
Ogwiecimska-Piasko, Zuzanna ;
Chormanski, Jaroslaw .
REMOTE SENSING, 2020, 12 (11)
[9]   A Novel Approach for Classification of Speech Emotions Based on Deep and Acoustic Features [J].
Er, Mehmet Bilal .
IEEE ACCESS, 2020, 8 :221640-221653
[10]   Real-time recognition of sick pig cough sounds [J].
Exadaktylos, V. ;
Silva, M. ;
Aerts, J.-M. ;
Taylor, C. J. ;
Berckmans, D. .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2008, 63 (02) :207-214