Unsupervised Canine Emotion Recognition Using Momentum Contrast

被引:2
作者
Bhave, Aarya [1 ]
Hafner, Alina [2 ]
Bhave, Anushka [1 ]
Gloor, Peter A. [1 ]
机构
[1] MIT, MIT Syst Design & Management, 77 Massachusetts Ave, Cambridge, MA 02142 USA
[2] Tech Univ Munich, TUM Sch Computat Informat & Technol, Arcisstr 21, D-80333 Munich, Germany
关键词
contrastive learning; momentum contrast; Panksepp seven emotions; canine emotions; unsupervised learning; REPRESENTATION; DOGS;
D O I
10.3390/s24227324
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
We describe a system for identifying dog emotions based on dogs' facial expressions and body posture. Towards that goal, we built a dataset with 2184 images of ten popular dog breeds, grouped into seven similarly sized primal mammalian emotion categories defined by neuroscientist and psychobiologist Jaak Panksepp as 'Exploring', 'Sadness', 'Playing', 'Rage', 'Fear', 'Affectionate' and 'Lust'. We modified the contrastive learning framework MoCo (Momentum Contrast for Unsupervised Visual Representation Learning) to train it on our original dataset and achieved an accuracy of 43.2% and a baseline of 14%. We also trained this model on a second publicly available dataset that resulted in an accuracy of 48.46% but had a baseline of 25%. We compared our unsupervised approach with a supervised model based on a ResNet50 architecture. This model, when tested on our dataset with the seven Panksepp labels, resulted in an accuracy of 74.32%
引用
收藏
页数:22
相关论文
共 47 条
[1]  
[Anonymous], Kaggle Dataset
[2]   Going Deeper than Tracking: A Survey of Computer-Vision Based Recognition of Animal Pain and Emotions [J].
Broome, Sofia ;
Feighelstein, Marcelo ;
Zamansky, Anna ;
Lencioni, Gabriel Carreira ;
Andersen, Pia Haubro ;
Pessanha, Francisca ;
Mahmoud, Marwa ;
Kjellstrom, Hedvig ;
Salah, Albert Ali .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (02) :572-590
[3]   Convolutional Neural Network-Based Automated System for Dog Tracking and Emotion Recognition in Video Surveillance [J].
Chen, Huan-Yu ;
Lin, Chuen-Horng ;
Lai, Jyun-Wei ;
Chan, Yung-Kuan .
APPLIED SCIENCES-BASEL, 2023, 13 (07)
[4]  
Chen T, 2020, PR MACH LEARN RES, V119
[5]  
Chen XL, 2020, Arxiv, DOI arXiv:2003.04297
[6]   An Empirical Study of Training Self-Supervised Vision Transformers [J].
Chen, Xinlei ;
Xie, Saining ;
He, Kaiming .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9620-9629
[7]   Towards accelerating model parallelism in distributed deep learning systems [J].
Choi, Hyeonseong ;
Lee, Byung Hyun ;
Chun, Se Young ;
Lee, Jaehwan .
PLOS ONE, 2023, 18 (11)
[8]  
Coghlan S., 2023, Philos Technol, V36, P25, DOI DOI 10.1007/S13347-023-00627-6
[9]  
Cosgrove N., 2024, How Many Dogs Are There? US Worldwide Statistics 2024
[10]   Self-report captures 27 distinct categories of emotion bridged by continuous gradients [J].
Cowen, Alan S. ;
Keltner, Dacher .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (38) :E7900-E7909