SpeakerSense: Energy Efficient Unobtrusive Speaker Identification on Mobile Phones

被引:0
|
作者
Lu, Hong [1 ]
Brush, A. J. Bernheim [1 ]
Priyantha, Bodhi [1 ]
Karlson, Amy K. [1 ]
Liu, Jie [1 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
来源
PERVASIVE COMPUTING | 2011年 / 6696卷
关键词
Continuous audio sensing; mobile phones; speaker identification; energy efficiency; heterogeneous multi-processor hardware;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatically identifying the person you are talking with using continuous audio sensing has the potential to enable many pervasive computing applications from memory assistance to annotating life logging data. However, a number of challenges, including energy efficiency and training data acquisition, must be addressed before unobtrusive audio sensing is practical on mobile devices. We built SpeakerSense, a speaker identification prototype that uses a heterogeneous multi-processor hardware architecture that splits computation between a low power processor and the phone's application processor to enable continuous background sensing with minimal power requirements. Using SpeakerSense, we benchmarked several system parameters (sampling rate, GMM complexity, smoothing window size, and amount of training data needed) to identify thresholds that balance computation cost with performance. We also investigated channel compensation methods that make it feasible to acquire training data from phone calls and an automatic segmentation method for training speaker models based on one-to-one conversations.
引用
收藏
页码:188 / 205
页数:18
相关论文
共 50 条
  • [21] An Improved GMM-based Clustering Algorithm for Efficient Speaker Identification
    Lin, Wenyong
    PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 1490 - 1493
  • [22] Efficient speaker identification from speech transmitted over Bluetooth networks
    Khalil, Ali
    Elnaby, Mustafa
    Saad, Elsayed
    Al-Nahari, Azzam
    Al-Zubi, Nayel
    El-Bendary, Mohsen
    El-Samie, Fathi
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (04) : 409 - 416
  • [23] Study of Energy Absorption Field Emitted by Generic Mobile Phones at 900 MHz Frequency
    Eduard, Jeler Grigore
    Moraru, Calalin
    Mihai, Radu
    2012 9TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM), 2012, : 267 - 270
  • [24] Deliberation for Intuition: A Framework for Energy-Efficient Trip Detection on Cellular Phones
    Jiang, Yifei
    Li, Du
    Yang, Guang
    Lv, Qin
    Liu, Zhigang
    UBICOMP'11: PROCEEDINGS OF THE 2011 ACM INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING, 2011, : 315 - 324
  • [25] Statistical Analysis of Energy Consumption of Mobile Phones for Web-Based Applications in Mauritius
    Fowdur, T. P.
    Hurbungs, V.
    Beeharry, Y.
    2016 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2016,
  • [26] A Survey on Energy Efficient Cellular Mobile Communication
    Karmakar, Purnendu
    Rajakumar, R., V
    Roy, Rajarshi
    WIRELESS PERSONAL COMMUNICATIONS, 2021, 120 (02) : 1475 - 1500
  • [27] Efficient cancelable speaker identification system based on a hybrid structure of DWT and SVD
    Abdelwahab, Khaled M.
    Abd El-atty, Saied
    Brisha, Ayman M.
    Abd El-Samie, Fathi E.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2022, 25 (01) : 279 - 288
  • [28] Two-stage decision for short utterance speaker identification in mobile telecommunication environment
    Zheng, HS
    Yang, YC
    Wu, ZH
    2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 547 - 551
  • [29] Energy Efficient Multipath TCP for Mobile Devices
    Peng, Qiuyu
    Chen, Minghua
    Walid, Anwar
    Low, Steven
    MOBIHOC'14: PROCEEDINGS OF THE 15TH ACM INTERNATIONAL SYMPOSIUM ON MOBILE AD HOC NETWORKING AND COMPUTING, 2014, : 257 - 266
  • [30] COMMUNICATE GREEN Energy Efficient Mobile Communication
    Uzun, Abdulbaki
    Kuepper, Axel
    Einsiedler, Hans J.
    PECCS 2011: PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON PERVASIVE AND EMBEDDED COMPUTING AND COMMUNICATION SYSTEMS, 2011, : 302 - 305