Fine-grained recognition of maritime vessels and land vehicles by deep feature embedding

被引:12
作者
Solmaz, Berkan [1 ]
Gundogdu, Erhan [1 ,2 ]
Yucesoy, Veysel [1 ]
Koc, Aykut [1 ]
Alatan, Abdullah Aydin [3 ,4 ]
机构
[1] Aselsan Res Ctr, Ankara, Turkey
[2] Ecole Polytech Fed Lausanne, Comp Vis Lab, Lausanne, Switzerland
[3] Middle East Tech Univ, Dept Elect & Elect Engn, Ankara, Turkey
[4] Middle East Tech Univ, Ctr Image Anal OGAM, Ankara, Turkey
关键词
marine vehicles; image classification; object recognition; learning (artificial intelligence); statistical analysis; traffic engineering computing; video retrieval; fine-grained maritime vessel recognition; fine-grained land vehicle recognition; deep feature embedding; large-scale image analysis; large-scale video analysis; visual surveillance systems; deep learning-based approaches; computer vision problems; fine-grained object recognition; maritime vessel classification; maritime vessel identification; land vehicle classification; land vehicle identification; visual recognition; coarse-grained classification task; fine-grained classification task; coarse-grained retrieval task; fine-grained retrieval task; verification task; multitask learning framework; loss function; global statistics; hierarchical individual sample label; data pairs; MARVEL data set; Stanford Cars data set; IMAGE SIMILARITY;
D O I
10.1049/iet-cvi.2018.5187
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in large-scale image and video analysis have empowered the potential capabilities of visual surveillance systems. In particular, deep learning-based approaches bring in substantial benefits in solving certain computer vision problems such as fine-grained object recognition. Here, the authors mainly concentrate on classification and identification of maritime vessels and land vehicles, which are the key constituents of visual surveillance systems. Employing publicly available data sets for maritime vessels and land vehicles, the authors aim to improve visual recognition. Specifically, the authors focus on five tasks regarding visual recognition; coarse-grained classification, fine-grained classification, coarse-grained retrieval, fine-grained retrieval, and verification. To increase the performance in these tasks, the authors utilise a multi-task learning framework and present a novel loss function which simultaneously considers deep feature learning and classification by exploiting the available hierarchical labels of individual samples and the global statistics of distances between the data pairs. The authors observe that the proposed multi-task learning model improves the fine-grained recognition performance on MARVEL and Stanford Cars data sets, compared to training of a model targeting a single recognition task.
引用
收藏
页码:1121 / 1132
页数:12
相关论文
共 56 条
[1]   Efficient object detection and segmentation for fine-grained recognition [J].
Angelova, Anelia ;
Zhu, Shenghuo .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :811-818
[2]  
[Anonymous], P 11 INT C DISTR SMA
[3]  
[Anonymous], 2017, IPSJ Trans. Comput. Vis. Appl
[4]  
[Anonymous], COMPUT VIS PATTERN R
[5]  
[Anonymous], MATCONVNET CONVOLUT
[6]  
[Anonymous], 2012, ADV NEURAL INFORM PR
[7]  
[Anonymous], MARVEL LARGE SCALE I
[8]  
[Anonymous], 2017, ARXIV170408345
[9]  
[Anonymous], 2014, BRIT C MACH VIS
[10]  
[Anonymous], 2015, P IEEE C COMP VIS PA