共 37 条
[1]
Desai S(2010)Spectral mapping using artificial neural networks for voice conversion IEEE Trans Audio Speech Lang Process 18 954-964
[2]
Black A(2011)Exemplar-based sparse representations for noise robust automatic speech recognition IEEE Trans Audio Speech Lang Process 19 2067-2080
[3]
Yegnanarayana B(2012)Voice conversion using dynamic kernel partial least squares regression IEEE Trans Audio Speech Lang Process 20 806-817
[4]
Prahallad K(2010)Voice conversion using partial least squares regression IEEE Trans Audio Speech Lang Process 18 912-921
[5]
Gemmeke J(1999)Restructuring speech representations using a pitch-adaptive time–frequency smoothing and an instantaneous-frequency-based f0 extraction: possible role of a repetitive structure in sounds Speech Commun 27 187-207
[6]
Virtanen T(2010)An overview of text-independent speaker recognition: from features to supervectors Speech Commun 52 12-40
[7]
Hurmalainen A(2014)How do we recognise who is speaking? Front Biosci (Scholar edition) 6 92-146
[8]
Helander E(2012)Speaking-aid systems using gmm-based voice conversion for electrolaryngeal speech Speech Commun 54 134-562
[9]
Silén H(2001)Algorithms for non-negative matrix factorization Adv Neural Inf Process Syst 13 556-142
[10]
Virtanen T(1998)Continuous probabilistic transform for voice conversion IEEE Trans Speech Audio Process 6 131-2235