Development of VGG-16 transfer learning framework for geographical landmark recognition

被引:2
作者
Bansal, Kanishk [1 ]
Singh, Amar [1 ]
机构
[1] Lovely Profess Univ, Dept Comp Applicat, Phagwara, Punjab, India
来源
INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS | 2023年 / 17卷 / 03期
关键词
Geographical landmark recognition; VGG-16; transfer learning; data augmentation; computer vision; IDENTIFICATION; BEHAVIOR;
D O I
10.3233/IDT-230048
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Computer vision mandates the development of landmark recognition paradigms for efficient Image Recognition. In this article, the concept of Visual Geometry Group Network (VGG-16) transfer learning is used to develop an AI model over a geographical landmarks' dataset. The dataset is a small part of Google Landmarks dataset V2 which originally consists of over 4M images. A VGG-16 model trained on ImageNet dataset is used to transfer knowledge. A positive transfer of knowledge is seen and it was observed that the trained model was a highly generalized model for our dataset. Not only a training accuracy of more than 0.85 is observed but equivalent validation accuracy suggests that the developed model is highly robust with minimal overfitting. A comparison of our proposed approach was made with classical machine learning techniques like KNN (K Nearest Neighbor), Decision Trees, Random Forest, SVM (Support Vector Machines) and a 3 Layered CNN. The results clearly depict that the proposed approach outperforms all other machine learning classifiers in consideration.
引用
收藏
页码:799 / 810
页数:12
相关论文
共 41 条
[1]  
Abadi M, 2016, ACM SIGPLAN NOTICES, V51, P1, DOI [10.1145/2951913.2976746, 10.1145/3022670.2976746]
[2]  
Bansal K, 2022, AIP C P, P50007
[3]  
Bansal K., 2021, MACHINE LEARNING DAT, P64
[4]   Automated evolution of CNN with 3PGA for geographical landmark recognition [J].
Bansal, Kanishk ;
Singh, Amar .
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (01) :745-756
[5]   Evolving CNN with Paddy Field Algorithm for Geographical Landmark Recognition [J].
Bansal, Kanishk ;
Singh, Amar ;
Verma, Sahil ;
Kavita ;
Jhanjhi, Noor Zaman ;
Shorfuzzaman, Mohammad ;
Masud, Mehedi .
ELECTRONICS, 2022, 11 (07)
[6]  
Chen DM, 2011, PROC CVPR IEEE, P737, DOI 10.1109/CVPR.2011.5995610
[7]  
Ghosh S, 2021, MACHINE LEARNING DAT, P75
[8]   Array programming with NumPy [J].
Harris, Charles R. ;
Millman, K. Jarrod ;
van der Walt, Stefan J. ;
Gommers, Ralf ;
Virtanen, Pauli ;
Cournapeau, David ;
Wieser, Eric ;
Taylor, Julian ;
Berg, Sebastian ;
Smith, Nathaniel J. ;
Kern, Robert ;
Picus, Matti ;
Hoyer, Stephan ;
van Kerkwijk, Marten H. ;
Brett, Matthew ;
Haldane, Allan ;
del Rio, Jaime Fernandez ;
Wiebe, Mark ;
Peterson, Pearu ;
Gerard-Marchant, Pierre ;
Sheppard, Kevin ;
Reddy, Tyler ;
Weckesser, Warren ;
Abbasi, Hameer ;
Gohlke, Christoph ;
Oliphant, Travis E. .
NATURE, 2020, 585 (7825) :357-362
[9]   Distributed graph coloring: an approach based on the calling behavior of Japanese tree frogs [J].
Hernandez, Hugo ;
Blum, Christian .
SWARM INTELLIGENCE, 2012, 6 (02) :117-150
[10]   Matplotlib: A 2D graphics environment [J].
Hunter, John D. .
COMPUTING IN SCIENCE & ENGINEERING, 2007, 9 (03) :90-95