Analysis of LFCC Feature Extraction in Baby Crying Classification using KNN

被引:0
作者
Dewi, Sita Purnama [1 ]
Prasasti, Anggunmeka Luhur [1 ]
Irawan, Budhi [1 ]
机构
[1] Telkom Univ Bandung, Fac Elect Engn, Bandung, Indonesia
来源
2019 IEEE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS AND INTELLIGENCE SYSTEM (IOTAIS) | 2019年
关键词
Speech Recognition; Audio Processing; Baby Crying; LFCC; KNN;
D O I
10.1109/iotais47347.2019.8980389
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cry is a form of communication for children to express their feeling. Baby's cry can be characterized according to its natural periodic tone and the change of voice. It has a base frequency (pitch) in range 250Hz to 600Hz. Through their baby's cries detection, parents can monitor their baby remotely only in important condition. This study of sound recognition has two main processes, the first process is feature extraction and the second process is classification or determining the sound pattern. In the Linear Frequency Cepstral Coefficient (LFCC) method, the analysis of changes in pre-emphasis, numbers of filter bank and numbers of cepstral are conducted. The selection of the filter bank value which applied must be greater than the cepstral value which applied. Cepstral values is adjusted to get the better accuracy. The highest percentage of accuracy is 90% when this system uses 8 as the cepstral value and 3 as the nearest neighbor value, and all rules are considered the best value based on the test results. The use of LFCC as feature extraction method and K-Nearest Neighbor (K-NN) classification can be implemented to detect the baby is crying or not so that it can be applied as a solution for parents to monitor their children remotely only in certain condition.
引用
收藏
页码:86 / 91
页数:6
相关论文
共 19 条
  • [1] Aarts R. M., 2006, J ACOUST SOC AM, V120, P3445
  • [2] Tandem Features for Text-dependent Speaker Verification on the RedDots Corpus
    Alam, Md Jahangir
    Kenny, Patrick
    Gupta, Vishwa
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 420 - 424
  • [3] Balandong R.P., 2013, ACOUSTIC ANAL BABY C
  • [4] Bhagatpatil M.V., 2014, Int J Sci Eng Res, V5, P1379
  • [5] LORD: LOw-complexity, Rate-controlled, Distributed video coding system
    Cohen, Rami
    Malah, David
    [J]. 8TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY & INTERNET BASED SYSTEMS (SITIS 2012), 2012, : 21 - 28
  • [6] Dave N., 2013, Int J Adv Res Eng Technol, V1, P1
  • [7] Dewi SP, 2019, 2019 IEEE INTERNATIONAL CONFERENCE ON SIGNALS AND SYSTEMS (ICSIGSYS), P18, DOI [10.1109/icsigsys.2019.8811070, 10.1109/ICSIGSYS.2019.8811070]
  • [8] Djamal E. C., 2017, INT C EL ENG COMP SC, V4, P250
  • [9] Evolutionary-Neural System to Classify Infant Cry Units for Pathologies Identification in Recently Born Babies
    Fausto Reyes-Galaviz, Orion
    Daniel Cano-Ortiz, Sergio
    Alberto Reyes-Garcia, Carlos
    [J]. PROCEEDINGS OF THE SPECIAL SESSION OF THE SEVENTH MEXICAN INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE - MICAI 2008, 2008, : 330 - +
  • [10] Franti E, 2018, 2018 41ST INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), P424, DOI 10.1109/TSP.2018.8441412