Pre-trained VGGNet Architecture for Remote-Sensing Image Scene Classification
被引:0
作者:
Muhammad, Usman
论文数: 0引用数: 0
h-index: 0
机构:
Univ Chinese Acad Sci, Sch Comp & Control Engn, Beijing 100049, Peoples R ChinaUniv Chinese Acad Sci, Sch Comp & Control Engn, Beijing 100049, Peoples R China
Muhammad, Usman
[1
]
Wang, Weiqiang
论文数: 0引用数: 0
h-index: 0
机构:
Univ Chinese Acad Sci, Sch Comp & Control Engn, Beijing 100049, Peoples R ChinaUniv Chinese Acad Sci, Sch Comp & Control Engn, Beijing 100049, Peoples R China
Wang, Weiqiang
[1
]
Chattha, Shahbaz Pervaiz
论文数: 0引用数: 0
h-index: 0
机构:
Yanbu Univ Coll, Yanbu, Saudi ArabiaUniv Chinese Acad Sci, Sch Comp & Control Engn, Beijing 100049, Peoples R China
Chattha, Shahbaz Pervaiz
[2
]
Ali, Sajid
论文数: 0引用数: 0
h-index: 0
机构:
Univ Educ, Lahore Campus, Dg Khan, PakistanUniv Chinese Acad Sci, Sch Comp & Control Engn, Beijing 100049, Peoples R China
Ali, Sajid
[3
]
机构:
[1] Univ Chinese Acad Sci, Sch Comp & Control Engn, Beijing 100049, Peoples R China
[2] Yanbu Univ Coll, Yanbu, Saudi Arabia
[3] Univ Educ, Lahore Campus, Dg Khan, Pakistan
来源:
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR)
|
2018年
基金:
国家重点研发计划;
关键词:
FUSION;
D O I:
暂无
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
The visual geometry group network (VGGNet) is used widely for image classification and has proven to be very effective method. Most existing approaches use features of just one type, and traditional fusion methods generally use multiple manually created features. However, to get the benefits of multi-layer features remain a significant challenge in the remote-sensing domain. To address this challenge, we present a simple yet powerful framework based on canonical correlation analysis and 4-layer SVM classifier. Specifically, the pretrained VGGNet is employed as a deep feature extractor to extract mid-level and deep features for remote-sensing scene images. We then choose two convolutional (mid-level) and two fully-connected layers produced by VGGNet in which each layer is treated as a separated feature descriptor. Next, canonical correlation analysis (CCA) is used as a feature fusion strategy to refine the extracted features, and to fuse them with more discriminative power. Finally, the support vector machine (SVM) classifier is used to construct the 4-layer representation of the scenes images. Experimenting on a UC Merced and WHU-RS datasets, demonstrate that the proposed approach, even without data augmentation, fine tuning or coding strategy, has a superior performance than state-of-the-art methods used now.