Leveraging transfer learning techniques for classifying infant vocalizations

被引:5
|
作者
Gujral, Aditya [1 ]
Feng, Kexin [1 ]
Mandhyan, Gulshan [1 ]
Snehil, Nfn [1 ]
Chaspari, Theodora [1 ]
机构
[1] Texas A&M Univ, Comp Sci & Engn, College Stn, TX 77845 USA
来源
2019 IEEE EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL & HEALTH INFORMATICS (BHI) | 2019年
关键词
Infant vocalization; transfer learning; neural network fine-tuning; Google AudioSet; OxVoc Sounds;
D O I
10.1109/bhi.2019.8834666
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Infant vocalizations serve various communicative functions and are related to several developmental factors. Different types of vocalizations depict distinct spectro-temporal patterns, which can be recovered and learned using emerging end-to-end machine learning systems. A common problem in such systems is the limited availability of labelled data preventing reliable training. Transfer learning can be used to mitigate this problem by taking advantage of additional data resources relevant to the problem of interest. We propose a transfer learning framework which relies on neural network fine-tuning, and explore various types of architectures, such as a convolutional neural network (CNN) and long-term-short-memory (LSTM) recurrent neural networks with and without an attention mechanism. Our target data come from the Cry Recognition In Early Development (CRIED), while the source data come from three publicly available resources: the Oxford Vocal (OxVoc) Sounds database, the Google AudioSet, and the Freesound repository. Our results indicate that the neural network architectures trained with the proposed transfer learning approach outperform the corresponding networks solely trained on the target data, as well as neural networks pre-trained on large-scale image datasets and adapted to the target data (e.g., VGG16). These suggest the effectiveness of adaptation techniques combined with appropriate publicly available datasets for mitigating the limited availability of labelled data in human-related applications.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] Leveraging Transfer Learning in Multiple Human Activity Recognition Using WiFi Signal
    Arshad, Sheheryar
    Feng, Chunhai
    Yu, Ruiyun
    Liu, Yonghe
    2019 IEEE 20TH INTERNATIONAL SYMPOSIUM ON A WORLD OF WIRELESS, MOBILE AND MULTIMEDIA NETWORKS (WOWMOM), 2019,
  • [42] CrossCount: Efficient Device-Free Crowd Counting by Leveraging Transfer Learning
    Khan, Danista
    Ho, Ivan Wang-Hei
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (05) : 4049 - 4058
  • [43] Leveraging Transfer Learning for Hate Speech Detection in Portuguese Social Media Posts
    Ramos, Gil
    Batista, Fernando
    Ribeiro, Ricardo
    Fialho, Pedro
    Moro, Sergio
    Fonseca, Antonio
    Guerra, Rita
    Carvalho, Paula
    Marques, Catarina
    Silva, Claudia
    IEEE ACCESS, 2024, 12 : 101374 - 101389
  • [44] Leveraging the Feature Distribution in Transfer-Based Few-Shot Learning
    Hu, Yuqing
    Gripon, Vincent
    Pateux, Stephane
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT II, 2021, 12892 : 487 - 499
  • [45] Leveraging anatomical information to improve transfer learning in brain-computer interfaces
    Wronkiewicz, Mark
    Larson, Eric
    Lee, Adrian K. C.
    JOURNAL OF NEURAL ENGINEERING, 2015, 12 (04)
  • [46] Classifying histopathological images of oral squamous cell carcinoma using deep transfer learning
    Panigrahi, Santisudha
    Nanda, Bhabani Sankar
    Bhuyan, Ruchi
    Kumar, Kundan
    Ghosh, Susmita
    Swarnkar, Tripti
    HELIYON, 2023, 9 (03)
  • [47] Emotion-based Autism Spectrum Disorder Detection by Leveraging Transfer Learning and Machine Learning Algorithms
    Sarwani, I. Srilalita
    Bhaskari, D. Lalitha
    Bhamidipati, Sangeeta
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (05) : 566 - 574
  • [48] Detection of driver drowsiness using transfer learning techniques
    Prajwal Mate
    Ninad Apte
    Manish Parate
    Sanjeev Sharma
    Multimedia Tools and Applications, 2024, 83 : 35553 - 35582
  • [49] Detection of cerebral ischaemia using transfer learning techniques
    Anton-Munarriz, Cristina
    Pastor-Vargas, Rafael
    Haut, Juan M.
    Robles-Gomez, Antonio
    Paoletti, Mercedes E.
    Benitez-Andrades, Jose Alberto
    2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 589 - 594
  • [50] Transfer learning techniques for medical image analysis: A review
    Kora, Padmavathi
    Ooi, Chui Ping
    Faust, Oliver
    Raghavendra, U.
    Gudigar, Anjan
    Chan, Wai Yee
    Meenakshi, K.
    Swaraja, K.
    Plawiak, Pawel
    Acharya, U. Rajendra
    BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2022, 42 (01) : 79 - 107