RAIF: A deep learning-based architecture for multi-modal aesthetic biometric system

被引:2
作者
Iffath, Fariha [1 ]
Gavrilova, Marina [1 ]
机构
[1] Univ Calgary, Dept Comp Sci, Calgary, AB T2N 1N4, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
aesthetics; deep learning; intermediate fusion; machine learning; multi-modal biometrics; residual network; virtual humans; IMAGE;
D O I
10.1002/cav.2163
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Human aesthetics play a significant role in video game development, emotional-aware robot design, online recommender systems, digital human, and other domains of research focusing on human-computer interactions. Social network user recognition based on aesthetic preferences is an emerging research domain. In this paper, a novel deep learning architecture is proposed for multi-modal audio-visual person identification that combines audio and visual aesthetic features. A pre-trained ResNet architecture is utilized to extract high-level features from a set of user-preferred audio and image samples. A novel deep learning-based fusion technique called residual-aided intermediate fusion (RAIF) is introduced in order to effectively merge the audio and visual features. The proposed RAIF method achieved an accuracy of 98% and a loss of 0.01 on a proprietary multi-modal dataset, indicating its effectiveness in fusing audio and visual information.
引用
收藏
页数:11
相关论文
共 24 条
  • [1] Almohammad MS., 2013, INT J COMPUT SCI TEL, P19
  • [2] Aristidou A, 2021, Arxiv, DOI arXiv:2111.12159
  • [3] Azam S., 2017, ADV ARTIFICIAL INTEL
  • [4] AestheticNet: deep convolutional neural network for person identification from visual aesthetic
    Bari, A. S. M. Hossain
    Sieu, Brandon
    Gavrilova, Marina L.
    [J]. VISUAL COMPUTER, 2020, 36 (10-12) : 2395 - 2405
  • [5] A Brief Survey of Color Image Preprocessing and Segmentation Techniques
    Bhattacharyya, Siddhartha
    [J]. JOURNAL OF PATTERN RECOGNITION RESEARCH, 2011, 6 (01): : 120 - 129
  • [6] Chengfang Zhang, 2020, Advances in 3D Image and Graphics Representation, Analysis, Computing and Information Technology. Algorithms and Applications. Proceedings of IC3DIT 2019. Smart Innovation, Systems and Technologies (SIST 180), P159, DOI 10.1007/978-981-15-3867-4_19
  • [7] Ortega JDS, 2019, Arxiv, DOI arXiv:1907.03196
  • [8] Defferrard M, 2017, Arxiv, DOI [arXiv:1612.01840, DOI 10.48550/ARXIV.1612.01840]
  • [9] Gavrilova ML, 2017, STUD COMPUT INTELL, V691, P229, DOI 10.1007/978-3-319-44257-0_10
  • [10] Multimodal data fusion for systems improvement: A review
    Gaw, Nathan
    Yousefi, Safoora
    Gahrooei, Mostafa Reisi
    [J]. IISE TRANSACTIONS, 2022, 54 (11) : 1098 - 1116