Speech Signal Classification Based on Convolutional Neural Networks

被引：0

作者：

Zhang, Xiaomeng ^{[1
]}

Sun, Hao ^{[1
]}

Wang, Shuopeng ^{[1
]}

Xu, Jing ^{[1
]}

机构：

[1] Hebei Univ Technol, Sch Artificial Intelligence, Tianjin 300130, Peoples R China

来源：

COGNITIVE SYSTEMS AND SIGNAL PROCESSING, PT II | 2019年 / 1006卷

关键词：

Speech signal classification; Spectrogram; Convolutional neural networks; IMPLEMENTATION;

D O I：

10.1007/978-981-13-7986-4_25

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In the field of intelligent human-computer interaction, speech signal is the hotspot research field, and has been widely used. For the traditional classification algorithm, the computational complexity is high and the classification accuracy is low. This paper proposes a convolutional neural network based on convolutional neural network. The speech signal classification method converts the speech signal into a form of a spectrogram and inputs it into a convolutional neural network to realize classification of the speech signal. Finally, the training and testing of convolutional neural networks are completed by using the framework of tensorflow. Compared with the traditional classification algorithm, the accuracy of the classification algorithm proposed in this paper reaches about 98%. The results show the feasibility and effectiveness of the experimental method.

引用

页码：281 / 287

页数：7

共 12 条

[1] Prediction of human-Bacillus anthracis protein-protein interactions using multi-layer neural network [J].

Ahmed, Ibrahim ;

Witbooi, Peter ;

Christoffels, Alan .

BIOINFORMATICS, 2018, 34 (24) :4159-4164

[2] Classification of lung sounds using convolutional neural networks [J].

Aykanat, Murat ;

Kilic, Ozkan ;

Kurt, Bahar ;

Saryal, Sevgi .

EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2017,

[3]

Badshah AM, 2017, 2017 INTERNATIONAL CONFERENCE ON PLATFORM TECHNOLOGY AND SERVICE (PLATCON), P125

[4]

Huang C., 2014, Mathematical Problems in Engineering, V2014, P1, DOI [DOI 10.1155/2014/545723, DOI 10.1155/2014/749604]

[5] Content-based audio classification and retrieval using the nearest feature line method [J].

Li, SZ .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (05) :619-625

[6] Efficient Implementation of an SVM-Based Speech/Music Classifier by Enhancing Temporal Locality in Support Vector References [J].

Lim, Chungsoo ;

Lee, Seong-Ro ;

Chang, Joon-Hyuk .

IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (03) :898-904

[7] Classification of Audible Signals by Characteristics of the Human Vocal Apparatus [J].

Llasat, V. .

IEEE LATIN AMERICA TRANSACTIONS, 2013, 11 (01) :77-80

[8] Advances in phase-aware signal processing in speech communication [J].

Mowlaee, Pejman ;

Saeidi, Rahim ;

Stylianou, Yannis .

SPEECH COMMUNICATION, 2016, 81 :1-29

[9] Speaker verification using adapted Gaussian mixture models [J].

Reynolds, DA ;

Quatieri, TF ;

Dunn, RB .

DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) :19-41

[10] Program Implementation of an Algorithm for Recognition of Speech Signals in the Labview Graphics Programming Environment [J].

Tychkov, A. Yu. ;

Alimuradov, A. K. ;

Frantsuzov, M. V. ;

Churakov, P. P. .

MEASUREMENT TECHNIQUES, 2015, 58 (09) :965-969

← 1 2 →