Fine-grained recognition of maritime vessels and land vehicles by deep feature embedding

被引:12
作者
Solmaz, Berkan [1 ]
Gundogdu, Erhan [1 ,2 ]
Yucesoy, Veysel [1 ]
Koc, Aykut [1 ]
Alatan, Abdullah Aydin [3 ,4 ]
机构
[1] Aselsan Res Ctr, Ankara, Turkey
[2] Ecole Polytech Fed Lausanne, Comp Vis Lab, Lausanne, Switzerland
[3] Middle East Tech Univ, Dept Elect & Elect Engn, Ankara, Turkey
[4] Middle East Tech Univ, Ctr Image Anal OGAM, Ankara, Turkey
关键词
marine vehicles; image classification; object recognition; learning (artificial intelligence); statistical analysis; traffic engineering computing; video retrieval; fine-grained maritime vessel recognition; fine-grained land vehicle recognition; deep feature embedding; large-scale image analysis; large-scale video analysis; visual surveillance systems; deep learning-based approaches; computer vision problems; fine-grained object recognition; maritime vessel classification; maritime vessel identification; land vehicle classification; land vehicle identification; visual recognition; coarse-grained classification task; fine-grained classification task; coarse-grained retrieval task; fine-grained retrieval task; verification task; multitask learning framework; loss function; global statistics; hierarchical individual sample label; data pairs; MARVEL data set; Stanford Cars data set; IMAGE SIMILARITY;
D O I
10.1049/iet-cvi.2018.5187
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in large-scale image and video analysis have empowered the potential capabilities of visual surveillance systems. In particular, deep learning-based approaches bring in substantial benefits in solving certain computer vision problems such as fine-grained object recognition. Here, the authors mainly concentrate on classification and identification of maritime vessels and land vehicles, which are the key constituents of visual surveillance systems. Employing publicly available data sets for maritime vessels and land vehicles, the authors aim to improve visual recognition. Specifically, the authors focus on five tasks regarding visual recognition; coarse-grained classification, fine-grained classification, coarse-grained retrieval, fine-grained retrieval, and verification. To increase the performance in these tasks, the authors utilise a multi-task learning framework and present a novel loss function which simultaneously considers deep feature learning and classification by exploiting the available hierarchical labels of individual samples and the global statistics of distances between the data pairs. The authors observe that the proposed multi-task learning model improves the fine-grained recognition performance on MARVEL and Stanford Cars data sets, compared to training of a model targeting a single recognition task.
引用
收藏
页码:1121 / 1132
页数:12
相关论文
共 56 条
[31]   3D Object Representations for Fine-Grained Categorization [J].
Krause, Jonathan ;
Stark, Michael ;
Deng, Jia ;
Li Fei-Fei .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, :554-561
[32]   Rotational Invariant Dimensionality Reduction Algorithms [J].
Lai, Zhihui ;
Xu, Yong ;
Yang, Jian ;
Shen, Linlin ;
Zhang, David .
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (11) :3733-3746
[33]   Approximate Orthogonal Sparse Embedding for Dimensionality Reduction [J].
Lai, Zhihui ;
Wong, Wai Keung ;
Xu, Yong ;
Yang, Jian ;
Zhang, David .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (04) :723-735
[34]   Local sparse representation projections for face recognition [J].
Lai, Zhihui ;
Li, Yajing ;
Wan, Minghua ;
Jin, Zhong .
NEURAL COMPUTING & APPLICATIONS, 2013, 23 (7-8) :2231-2239
[35]   Locality preserving embedding for face and handwriting digital recognition [J].
Lai, Zhihui ;
Wan, MingHua ;
Jin, Zhong .
NEURAL COMPUTING & APPLICATIONS, 2011, 20 (04) :565-573
[36]  
Liu JX, 2012, LECT NOTES COMPUT SC, V7572, P172, DOI 10.1007/978-3-642-33718-5_13
[37]   Distinctive image features from scale-invariant keypoints [J].
Lowe, DG .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) :91-110
[38]   A pool of multiple person re-identification experts [J].
Martinel, Niki ;
Micheloni, Christian ;
Foresti, Gian Luca .
PATTERN RECOGNITION LETTERS, 2016, 71 :23-30
[39]  
Nowak E, 2006, LECT NOTES COMPUT SC, V3954, P490
[40]   On lines and planes of closest fit to systems of points in space. [J].
Pearson, Karl .
PHILOSOPHICAL MAGAZINE, 1901, 2 (7-12) :559-572