SpeakerSense: Energy Efficient Unobtrusive Speaker Identification on Mobile Phones

被引:0
|
作者
Lu, Hong [1 ]
Brush, A. J. Bernheim [1 ]
Priyantha, Bodhi [1 ]
Karlson, Amy K. [1 ]
Liu, Jie [1 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
来源
PERVASIVE COMPUTING | 2011年 / 6696卷
关键词
Continuous audio sensing; mobile phones; speaker identification; energy efficiency; heterogeneous multi-processor hardware;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatically identifying the person you are talking with using continuous audio sensing has the potential to enable many pervasive computing applications from memory assistance to annotating life logging data. However, a number of challenges, including energy efficiency and training data acquisition, must be addressed before unobtrusive audio sensing is practical on mobile devices. We built SpeakerSense, a speaker identification prototype that uses a heterogeneous multi-processor hardware architecture that splits computation between a low power processor and the phone's application processor to enable continuous background sensing with minimal power requirements. Using SpeakerSense, we benchmarked several system parameters (sampling rate, GMM complexity, smoothing window size, and amount of training data needed) to identify thresholds that balance computation cost with performance. We also investigated channel compensation methods that make it feasible to acquire training data from phone calls and an automatic segmentation method for training speaker models based on one-to-one conversations.
引用
收藏
页码:188 / 205
页数:18
相关论文
共 50 条
  • [31] Efficient cancelable speaker identification system based on a hybrid structure of DWT and SVD
    Khaled M. Abdelwahab
    Saied Abd El-atty
    Ayman M. Brisha
    Fathi E. Abd El-Samie
    International Journal of Speech Technology, 2022, 25 : 279 - 288
  • [32] A Survey on Energy Efficient Cellular Mobile Communication
    Purnendu Karmakar
    R. V. Rajakumar
    Rajarshi Roy
    Wireless Personal Communications, 2021, 120 : 1475 - 1500
  • [33] Mobile intelligent terminal speaker identification for real-time monitoring system of sports training
    Yue, Yibo
    Yang, Yucheng
    EVOLUTIONARY INTELLIGENCE, 2023, 16 (06) : 1801 - 1812
  • [34] Mobile intelligent terminal speaker identification for real-time monitoring system of sports training
    Yibo Yue
    Yucheng Yang
    Evolutionary Intelligence, 2023, 16 : 1801 - 1812
  • [35] An Efficient Text-Independent Speaker Identification Using Feature Fusion and Transformer Model
    Khan, Arfat Ahmad
    Jahangir, Rashid
    Alroobaea, Roobaea
    Alyahyan, Saleh Yahya
    Almulhi, Ahmed H.
    Alsafyani, Majed
    Wechtaisong, Chitapong
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (02): : 4085 - 4100
  • [36] COMPUTATIONALLY EFFICIENT SPEAKER IDENTIFICATION USING FAST-MLLR BASED ANCHOR MODELING
    Sarkar, A. K.
    Umesh, S.
    Bonastre, J. F.
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4357 - 4360
  • [37] Hybrid speech enhancement with empirical mode decomposition and spectral subtraction for efficient speaker identification
    El-Moneim, Samia
    Dessouky, Moawad
    El-Samie, Fathi
    Nassar, M.
    El-Naby, Mohammed
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2015, 18 (04) : 555 - 564
  • [38] Energy-Efficient Mobile Edge Hosts for Mobile Edge Computing System
    Thananjeyan, Shanmuganathan
    Chan, Chien Aun
    Wong, Elaine
    Nirmalathas, Ampalavanapillai
    2018 IEEE 9TH INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION FOR SUSTAINABILITY (ICIAFS' 2018), 2018,
  • [39] Energy Efficient Strategies with Mobile Sink for WSNs: A Survey
    Ruan, Feng
    Fan, Zhiyong
    Gong, Yiguang
    Hou, Jianmin
    Mei, Ping
    Li, Tao
    INTERNATIONAL JOURNAL OF FUTURE GENERATION COMMUNICATION AND NETWORKING, 2016, 9 (08): : 235 - 244
  • [40] Design and operation of energy efficient heterogeneous mobile networks
    Georgios Kyriazis
    Angelos Rouskas
    Wireless Networks, 2016, 22 : 2013 - 2028