Development of a femininity estimator using speaker recognition techniques for voice therapy of gender identity disorder clients
被引:0
作者:
Minematsu, Nobuaki
论文数: 0引用数: 0
h-index: 0
机构:
Univ Tokyo, Tokyo, JapanUniv Tokyo, Tokyo, Japan
Minematsu, Nobuaki
[1
]
Maruyama, Kazutaka
论文数: 0引用数: 0
h-index: 0
机构:
Univ Tokyo, Tokyo, JapanUniv Tokyo, Tokyo, Japan
Maruyama, Kazutaka
[1
]
Sakuraba, Kyoko
论文数: 0引用数: 0
h-index: 0
机构:
Kiyose Shi Welfare Ctr Handicapped, Tokyo, JapanUniv Tokyo, Tokyo, Japan
Sakuraba, Kyoko
[2
]
Hirose, Keikichi
论文数: 0引用数: 0
h-index: 0
机构:
Univ Tokyo, Tokyo, JapanUniv Tokyo, Tokyo, Japan
Hirose, Keikichi
[1
]
Tayama, Niro
论文数: 0引用数: 0
h-index: 0
机构:
Int Med Ctr, Tokyo, JapanUniv Tokyo, Tokyo, Japan
Tayama, Niro
[3
]
Imaizumi, Satoshi
论文数: 0引用数: 0
h-index: 0
机构:
Prefectural Univ Hiroshima, Hiroshima, JapanUniv Tokyo, Tokyo, Japan
Imaizumi, Satoshi
[4
]
Yamauchi, Toshio
论文数: 0引用数: 0
h-index: 0
机构:
Saitama Med Univ, Moroyama, Saitama, JapanUniv Tokyo, Tokyo, Japan
Yamauchi, Toshio
[5
]
机构:
[1] Univ Tokyo, Tokyo, Japan
[2] Kiyose Shi Welfare Ctr Handicapped, Tokyo, Japan
[3] Int Med Ctr, Tokyo, Japan
[4] Prefectural Univ Hiroshima, Hiroshima, Japan
[5] Saitama Med Univ, Moroyama, Saitama, Japan
来源:
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3
|
2007年
关键词:
femininity;
GID;
speaker recognition;
GMM;
D O I:
暂无
中图分类号:
O42 [声学];
学科分类号:
070206 ;
082403 ;
摘要:
This paper describes the development of an estimator of perceptual femininity (PF) of an input utterance using speaker recognition techniques. The estimator was designed for its clinical use and the target speakers are Gender Identity Disorder (GID) clients, especially MtF (Male to Female) transsexuals. The voice therapy for MtFs is composed of three kinds of training; 1) raising the baseline F-0 range, 2) changing the baseline voice quality, and 3) enhancing F-0 dynamics to produce an exaggerated intonation pattern. The first two focus on static acoustic properties of speech and the voice quality is mainly controlled by size and shape of the articulators, which can be acoustically characterized by the spectral envelope. Gaussian Mixture Models (GMM) of F-0 values and spectrums were built separately for biologically male speakers and female ones. Using the four models, PF was estimated automatically for each of 142 utterances of 111 MtFs. The estimated values were compared with the PF values obtained through listening tests. Results showed very high correlation (R=0.86), which is comparable to the intra-rater correlation.