A DISCRIMINATIVE APPROACH FOR SPEAKER SELECTION IN SPEAKER DE-IDENTIFICATION SYSTEMS

被引:0
作者
Abou-Zleikha, Mohamed [1 ,2 ]
Tan, Zheng-Hua [2 ]
Christensen, Mads Graesboll [1 ]
Jensen, Soren Holdt [2 ]
机构
[1] Aalborg Univ, AD MT, Audio Anal Lab, Aalborg, Denmark
[2] Aalborg Univ, Dept Elect Syst, Aalborg, Denmark
来源
2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) | 2015年
关键词
speaker de-identification; speaker identification; speaker transformation;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speaker de-identification is an interesting and newly investigated task in speech processing. In the current implementations, this task is based on transforming one speaker speech to another speaker in order to hide the speaker identity. In this paper we present a discriminative approach for human speaker selection for speaker de-identification. We used two modules, a speaker identification system and a speaker transformation one, to select the most appropriate speaker to transform the source speaker speech from a set of speakers, In order to select the target speaker, we minimize the identification confidence of the transformed speech as the source speaker and maximize the confusion about the transformed speech membership to the rest of the speaker models and the identification confidence of the re-transformed speech using the source speaker model. These three factors are combined to achieve overall optimization performance in order to select the best target speaker to transform the source,
引用
收藏
页码:2102 / 2106
页数:5
相关论文
共 50 条
[41]   Limited data speaker identification [J].
Jayanna, H. S. ;
Prasanna, S. R. Mahadeva .
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2010, 35 (05) :525-546
[42]   Speech Enhancement for Speaker Identification [J].
Mahesh, R. .
2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
[43]   FORENSIC APPLICATION OF SPEAKER IDENTIFICATION [J].
Draghicescu, Dragos .
UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2015, 77 (03) :107-122
[44]   SUPERVISED SPEAKER EMBEDDING DE-MIXING IN TWO-SPEAKER ENVIRONMENT [J].
Shi, Yanpei ;
Hain, Thomas .
2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, :758-765
[45]   A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR [J].
Morrone, Giovanni ;
Zovato, Enrico ;
Brugnara, Fabio ;
Sartori, Enrico ;
Badino, Leonardo .
INTERSPEECH 2024, 2024, :3652-3653
[46]   GROUP NONNEGATIVE MATRIX FACTORISATION WITH SPEAKER AND SESSION VARIABILITY COMPENSATION FOR SPEAKER IDENTIFICATION [J].
Serizel, Romain ;
Essid, Slim ;
Richard, Gael .
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, :5470-5474
[47]   Automatic Speaker Localization based on Speaker Identification -A Smart Room Application- [J].
Ouamour, Siham ;
Sayoud, Halim .
2013 FOURTH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY AND ACCESSIBILITY (ICTA), 2013,
[48]   Speaker-Specific Utterance Ensemble based Transfer Attack on Speaker Identification [J].
Zuo, Chu-Xiao ;
Leng, Jia-Yi ;
Li, Wu-Jun .
INTERSPEECH 2022, 2022, :3203-3207
[49]   Probabilistic approach for speaker transformation [J].
Gao Yin-Qiu ;
Yang Zhen .
2007 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-15, 2007, :2845-2848
[50]   Discriminative Deep Audio Feature Embedding for Speaker Recognition in the Wild [J].
Bianco, Simone ;
Cereda, Elia ;
Napoletano, Paolo .
2018 IEEE 8TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - BERLIN (ICCE-BERLIN), 2018,