Disentangled Prototypical Convolutional Network for Few-Shot Learning in In-Vehicle Noise Classification

被引:3
作者
Kee, Robin Inho [1 ]
Nam, Dahyun [2 ]
Buu, Seok-Jun [3 ]
Cho, Sung-Bae [4 ]
机构
[1] Univ Michigan, Dept Mech Engn, Ann Arbor, MI 48109 USA
[2] Yonsei Univ, Dept Mech Engn, Seoul 03722, South Korea
[3] Gyeongsang Natl Univ, Dept Comp Sci, Jinju 52828, Gyeongsangnam, South Korea
[4] Yonsei Univ, Dept Comp Sci, Seoul 03722, South Korea
关键词
Noise measurement; Data models; Task analysis; Prototypes; Predictive models; Adaptation models; Feature extraction; Acoustics; Classification algorithms; Representation learning; Few-shot learning; Acoustic classification; representation learning; few-shot learning (FSL); in-vehicle noise; prototypical network; triplet loss;
D O I
10.1109/ACCESS.2024.3397842
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study addresses the persistent challenge of in-vehicle noise, a significant factor affecting customer satisfaction and safety in the automotive industry. Despite advancements in understanding various noise sources and mitigation strategies, vehicle noise continues to contribute to driver and passenger discomfort, impacting stress levels, fatigue, and overall quality of life. Recent research has made significant strides in classifying in-vehicle noise, yet the complexity of obtaining comprehensive and diverse datasets remains a major hurdle, given the variability and transient nature of these noises. To overcome these challenges, our research introduces an innovative approach using Few-shot Learning (FSL). We propose a unique FSL model that integrates a Triplet-trained Prototypical Network for the classification of in-vehicle noises. This model is particularly adept at learning robust feature representations from limited data. The application of triplet sampling and loss significantly enhances the model's ability to distinguish between various types of in-vehicle noises. Our methodology was rigorously tested using a specially curated dataset of in-vehicle noises, reflecting real-world diversity. The experimental results, obtained through 10-fold cross-validation, demonstrate an exceptional average accuracy of 96.81% on a 9-way 1-shot task. This level of accuracy, achieved with a limited amount of training data, not only attests to the effectiveness of our model but also marks a significant advancement in the field of acoustic classification. Our study's findings highlight the potential of FSL in addressing complex challenges in the automotive industry, paving the way for more effective noise reduction strategies and improved vehicle design.
引用
收藏
页码:66801 / 66808
页数:8
相关论文
共 30 条
[1]  
Alt N. W., 2001, Tech. Paper 01-1539
[2]  
Bu SJ, 2020, INT CONF ACOUST SPEE, P3057, DOI [10.1109/icassp40776.2020.9053916, 10.1109/ICASSP40776.2020.9053916]
[3]  
Bu SJ, 2019, IEEE INT CONF BIG DA, P3545, DOI 10.1109/BigData47090.2019.9005960
[4]   Frequency Guidance Matters in Few-Shot Learning [J].
Cheng, Hao ;
Yang, Siyuan ;
Zhou, Joey Tianyi ;
Guo, Lanqing ;
Wen, Bihan .
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, :11780-11790
[5]   A novel approach to optimizing and stabilizing interior noise quality in vehicles [J].
Cornish, R .
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2000, 214 (D7) :685-692
[6]  
Finn C, 2017, PR MACH LEARN RES, V70
[7]  
Garcia V., 2018, 6 INT C LEARN REPR I, P1
[8]  
Graves A, 2014, PR MACH LEARN RES, V32, P1764
[9]  
Graves A, 2013, INT CONF ACOUST SPEE, P6645, DOI 10.1109/ICASSP.2013.6638947
[10]   Virtual prompt pre-training for prototype-based few-shot relation extraction [J].
He, Kai ;
Huang, Yucheng ;
Mao, Rui ;
Gong, Tieliang ;
Li, Chen ;
Cambria, Erik .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213