Visual Recognition Based on Deep Learning for Navigation Mark Classification

被引:49
作者
Pan, Mingyang [1 ]
Liu, Yisai [1 ]
Cao, Jiayi [1 ]
Li, Yu [2 ]
Li, Chao [1 ]
Chen, Chi-Hua [3 ]
机构
[1] Dalian Maritime Univ, Nav Coll, Dalian 116026, Peoples R China
[2] Changjiang Nanjing Waterway Bur, Nanjing 210011, Peoples R China
[3] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350108, Peoples R China
基金
中国国家自然科学基金;
关键词
Navigation; Marine vehicles; Image recognition; Image classification; Visualization; Machine learning; Convolutional neural networks; Deep learning; image classification; multi-scale attention; navigation marks; ResNet; NEURAL-NETWORKS;
D O I
10.1109/ACCESS.2020.2973856
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recognizing objects from camera images is an important field for researching smart ships and intelligent navigation. In sea transportation, navigation marks indicating the features of navigational environments (e.g. channels, special areas, wrecks, etc.) are focused in this paper. A fine-grained classification model named RMA (ResNet-Multiscale-Attention) based on deep learning is proposed to analyse the subtle and local differences among navigation mark types for the recognition of navigation marks. In the RMA model, an attention mechanism based on the fusion of feature maps with three scales is proposed to locate attention regions and capture discriminative characters that are important to distinguish the slight differences among similar navigation marks. Experimental results on a dataset with 10260 navigation mark images showed that the RMA has an accuracy about 96 & x0025; to classify 42 types of navigation marks, and the RMA is better than ResNet-50 model with which the accuracy is about 94 & x0025;. The visualization analyses showed that the RMA model can extract the attention regions and the characters of navigation marks.
引用
收藏
页码:32767 / 32775
页数:9
相关论文
共 50 条
  • [21] Comparative Analysis of Classical Machine Learning and Deep Learning Methods for Fruit Image Recognition and Classification
    Salim, Nareen O. M.
    Mohammed, Ahmed Khorsheed
    TRAITEMENT DU SIGNAL, 2024, 41 (03) : 1331 - 1343
  • [22] An Ensemble Algorithm Based on Deep Learning for Tuberculosis Classification
    Hernandez, Alfonso
    Panizo, Angel
    Camacho, David
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2019, PT I, 2019, 11871 : 145 - 154
  • [23] A Classification Model for the Prostate Cancer Based on Deep Learning
    Liu, Ying
    An, Xiaomei
    2017 10TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI), 2017,
  • [24] Deep Learning based Automated Food Image Classification
    Sumanth, Mukkamala
    Reddy, Anumula Hemanth
    Abhishek, Dandu
    Balaji, Sanaka Venkata
    Amarendra, K.
    Srinivas, P. V. V. S.
    2024 SECOND INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTING AND INFORMATICS, ICICI 2024, 2024, : 103 - 107
  • [25] Android Smartphone based Visual Object Recognition for Visually Impaired using Deep Learning
    Parikh, Neel
    Shah, Ishita
    Vahora, Safvan
    PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2018, : 420 - 425
  • [26] Texture Recognition and Classification Based on Deep Learning
    Zhu, Gaoming
    Li, Bingchan
    Hong, Shuai
    Mao, Bo
    2018 SIXTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2018, : 344 - 348
  • [27] A Deep Learning Model Based on Multi-Objective Particle Swarm Optimization for Scene Classification in Unmanned Aerial Vehicles
    Rajagopal, Aghila
    Joshi, Gyanendra Prasad
    Ramachandran, A.
    Subhalakshmi, R. T.
    Khari, Manju
    Jha, Sudan
    Shankar, K.
    You, Jinsang
    IEEE ACCESS, 2020, 8 (08): : 135383 - 135393
  • [28] Driver Drowsiness Detection System using Deep Learning based on Visual Facial Features
    Mahmoud, Mahamad Salah
    Jarndal, Anwar
    Alzghoul, Ahmad
    Almahasneh, Hossam
    Alsyouf, Imad
    Hamid, Abdul Kadir
    2021 14TH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE), 2021, : 453 - 458
  • [29] Machine Learning based Classification for Fire and Smoke Images Recognition*
    Jabnouni, Hedi
    Arfaoui, Imen
    Cherni, Mohamed Ali
    Bouchouicha, Moez
    Sayadi, Mounir
    2022 8TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT'22), 2022, : 425 - 430
  • [30] Deep learning based on LSTM model for enhanced visual odometry navigation system
    Deraz, Ashraf A.
    Badawy, Osama
    Elhosseini, Mostafa A.
    Mostafa, Mostafa
    Ali, Hesham A.
    El-Desouky, Ali I.
    AIN SHAMS ENGINEERING JOURNAL, 2023, 14 (08)