Acoustic-based LEGO recognition using attention-based convolutional neural networks

被引:0
|
作者
Van-Thuan Tran
Chia-Yang Wu
Wei-Ho Tsai
机构
[1] National Taipei University of Technology,Department of Electronic Engineering
来源
Artificial Intelligence Review | 2024年 / 57卷
关键词
LEGO recognition; Acoustic-based object detection; Attention mechanism; Audio classification; Audio features; Convolutional neural networks; Time-distributed layers;
D O I
暂无
中图分类号
学科分类号
摘要
This work investigates the classification of LEGO types using deep learning-based audio classification approaches. The motivation for this investigation is based on the following assumption. If objects of the same shape fall freely from a certain height and hit a fixed plane, the impact sounds will be very similar, so we can distinguish the same types of objects from the others. Applying this idea to LEGO recognition, we collect impact sounds of 200 LEGO objects that fall from a height of about 30cm from a designated plane, and design a CNN-based recognition system that processes the impact sounds to determine the type of LEGO it belongs to. Recognizing that the fall of LEGO results in the main impact sound (i.e., only the sound at the moment of impact) and several subsequent sounds, we examine whether considering only the first impact sound or all sounds brings about better classification accuracies. We propose a compact two-dimensional CNN model, namely LegoNet, which is designed with a frame-level attention module at the input spectrogram and time-distributed fully-connected layers. Our experiments show that free-fall impact sounds can be used efficiently for accurate object recognition, and the proposed LegoNet, with a much smaller size, achieves better accuracy and robustness compared to baseline models. Also, using the whole sequence of impact sounds is more informative for LEGO classification than only considering the first impact sound. Moreover, it is found that utilizing data of specific object postures can help to improve the classifier’s performance in the case of small training data. The proposed approach can be employed as an extra module to build intelligent agents or object classification systems that require a rich understanding of the surrounding physical world.
引用
收藏
相关论文
共 50 条
  • [21] A Lightweight Attention-Based Convolutional Neural Networks for Fresh-Cut Flower Classification
    Fei, Yeqi
    Li, Zhenye
    Zhu, Tingting
    Ni, Chao
    IEEE ACCESS, 2023, 11 : 17283 - 17293
  • [22] Hybrid data augmentation and deep attention-based dilated convolutional-recurrent neural networks for speech emotion recognition
    Pham, Nhat Truong
    Dang, Duc Ngoc Minh
    Nguyen, Ngoc Duy
    Nguyen, Thanh Thi
    Nguyen, Hai
    Manavalan, Balachandran
    Lim, Chee Peng
    Nguyen, Sy Dzung
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 230
  • [23] Thank you for attention: A survey on attention-based artificial neural networks for automatic speech recognition
    Karmakar, Priyabrata
    Teng, Shyh Wei
    Lu, Guojun
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2024, 23
  • [24] Early Lane Change Prediction for Automated Driving Systems Using Multi-Task Attention-Based Convolutional Neural Networks
    Mozaffari, Sajjad
    Arnold, Eduardo
    Dianati, Mehrdad
    Fallah, Saber
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2022, 7 (03): : 758 - 770
  • [25] Chinese Dialects Identification Using Attention-Based Deep Neural Networks
    Qiu, Yuanhang
    Ma, Yong
    Jin, Yun
    Li, Shidang
    Gu, Mingliang
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2019, 463 : 2051 - 2058
  • [26] Attention-Based Convolutional Neural Network and Bidirectional Gated Recurrent Unit for Human Activity Recognition
    Tao, Shuai
    Zhao, Zhiqiang
    Qin, Jing
    Ji, Changqing
    Wang, Zumin
    2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1128 - 1134
  • [27] Interpreting sarcasm on social media using attention-based neural networks
    Keivanlou-Shahrestanaki, Zahra
    Kahani, Mohsen
    Zarrinkalam, Fattane
    KNOWLEDGE-BASED SYSTEMS, 2022, 258
  • [28] A global attention-based convolutional neural network for process prediction
    Chen, Yunfan
    Xing, Mali
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7373 - 7377
  • [29] Hybrid convolutional neural networks for articulatory and acoustic information based speech recognition
    Mitra, Vikramjit
    Sivaraman, Ganesh
    Nam, Hosung
    Espy-Wilson, Carol
    Saltzman, Elliot
    Tiede, Mark
    SPEECH COMMUNICATION, 2017, 89 : 103 - 112
  • [30] ABCNN-IDS: Attention-Based Convolutional Neural Network for Intrusion Detection in IoT Networks
    Momand, Asadullah
    Jan, Sana Ullah
    Ramzan, Naeem
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 136 (04) : 1981 - 2003