Generative adversarial network based data augmentation and quantum based convolution neural network for the classification of Indian classical dance forms

被引:0
作者
Challapalli, Jhansi Rani [1 ]
Devarakonda, Nagaraju [2 ]
机构
[1] Koneru Lakshmaiah Educ Fdn, Dept Comp Sci & Engn, Vaddeswaram 522502, Andhra Pradesh, India
[2] VIT AP Univ, Sch Comp Sci & Engn, Amaravati, Andhra Pradesh, India
关键词
Quantum convolution neural network; data augmentation; generative adversarial network; Indian classical dance; transfer learning;
D O I
10.3233/JIFS-231183
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The study addresses the challenges of human action recognition and analysis in computer vision, with a focus on classifying Indian dance forms. The complexity of these dance styles, including variations in body postures and hand gestures, makes classification difficult. Deep learning models require large datasets for good performance, so standard data augmentation techniques are used to increase model generalizability. The study proposes the Indian Classical Dance Generative Adversarial Network (ICD-GAN) for augmentation and the quantum-based Convolutional Neural Network (QCNN) for classification. The research consists of three phases: traditional augmentation, GAN-based augmentation, and a combination of both. The proposed QCNN is introduced to reduce computational time. Different GAN variants DC-GAN, CGAN, MFCGAN are employed for augmentation, while transfer learning-based CNN models VGG-16, VGG-19, MobileNet-v2, ResNet-50, and new QCNN are implemented for classification. The study demonstrates that GAN-based augmentation outperforms traditional methods, and QCNN reduces computational complexity while improving prediction accuracy. The proposed method achieves a precision rate of 98.7% as validated through qualitative and quantitative analysis. It provides a more effective and efficient approach compared to existing methods for Indian dance form classification.
引用
收藏
页码:6107 / 6125
页数:19
相关论文
共 55 条
  • [1] Yeh RA, 2017, Arxiv, DOI arXiv:1607.07539
  • [2] MFC-GAN: Class-imbalanced dataset classification using Multiple Fake Class Generative Adversarial Network
    Ali-Gombe, Adamu
    Elyan, Eyad
    [J]. NEUROCOMPUTING, 2019, 361 : 212 - 221
  • [3] Antoniou A, 2018, Arxiv, DOI arXiv:1711.04340
  • [4] Arjovsky M, 2017, PR MACH LEARN RES, V70
  • [5] Traffic Signs Detection for Real-World Application of an Advanced Driving Assisting System Using Deep Learning
    Ayachi, Riadh
    Afif, Mouna
    Said, Yahia
    Atri, Mohamed
    [J]. NEURAL PROCESSING LETTERS, 2020, 51 (01) : 837 - 851
  • [6] Indian dance form recognition from videos
    Bisht, Ankita
    Bora, Riya
    Saini, Goutam
    Shukla, Pushkar
    Raman, Balasubrmanian
    [J]. 2017 13TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY AND INTERNET-BASED SYSTEMS (SITIS), 2017, : 123 - 128
  • [7] Brock A, 2019, Arxiv, DOI [arXiv:1809.11096, DOI 10.48550/ARXIV.1809.11096]
  • [8] A novel approach for optimization of convolution neural network with hybrid particle swarm and grey wolf algorithm for classification of Indian classical dances
    Challapalli, Jhansi Rani
    Devarakonda, Nagaraju
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (09) : 2411 - 2434
  • [9] Reprise: A Design Tool for Specifying, Generating, and Customizing 3D Printable Adaptations on Everyday Objects
    Chen, Xiang 'Anthony'
    Kim, Jeeeun
    Mankofr, Jennifer
    Grossman, Tovi
    Coros, Stelian
    Hudson, Scott E.
    [J]. UIST 2016: PROCEEDINGS OF THE 29TH ANNUAL SYMPOSIUM ON USER INTERFACE SOFTWARE AND TECHNOLOGY, 2016, : 29 - 39
  • [10] Fine-grained attention mechanism for neural machine translation
    Choi, Heeyoul
    Cho, Kyunghyun
    Bengio, Yoshua
    [J]. NEUROCOMPUTING, 2018, 284 : 171 - 176