PointFlow: Flowing Semantics Through Points for Aerial Image Segmentation

被引:89
作者
Li, Xiangtai [1 ]
He, Hao [2 ,3 ]
Li, Xia [4 ]
Li, Duo [5 ]
Cheng, Guangliang [6 ,8 ]
Shi, Jianping [6 ,7 ]
Weng, Lubin [2 ]
Tong, Yunhai [1 ]
Lin, Zhouchen [1 ]
机构
[1] Peking Univ, Key Lab Machine Percept MOE, Beijing, Peoples R China
[2] Chinese Acad Sci, Inst Automat, NLPR, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
[4] Swiss Fed Inst Technol, Zurich, Switzerland
[5] HKUST, Hong Kong, Peoples R China
[6] SenseTime Res, Hong Kong, Peoples R China
[7] SJTU, Qing Yuan Res Inst, Shanghai, Peoples R China
[8] Shanghai AI Lab, Shanghai, Peoples R China
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
关键词
D O I
10.1109/CVPR46437.2021.00420
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Aerial Image Segmentation is a particular semantic segmentation problem and has several challenging characteristics that general semantic segmentation does not have. There are two critical issues: The one is an extremely foreground-background imbalanced distribution, and the other is multiple small objects along with the complex background. Such problems make the recent dense affinity context modeling perform poorly even compared with baselines due to over-introduced background context. To handle these problems, we propose a point-wise affinity propagation module based on the Feature Pyramid Network (FPN) framework, named PointFlow. Rather than dense affinity learning, a sparse affinity map is generated upon selected points between the adjacent features, which reduces the noise introduced by the background while keeping efficiency. In particular, we design a dual point matcher to select points from the salient area and object boundaries, respectively. Experimental results on three different aerial segmentation datasets suggest that the proposed method is more effective and efficient than state-of-the-art general semantic segmentation methods. Especially, our methods achieve the best speed and accuracy trade-off on three aerial benchmarks. Further experiments on three general semantic segmentation datasets prove the generality of our method. Code and models are made available (https://github. com/lxtGH/PFSegNets).
引用
收藏
页码:4215 / 4224
页数:10
相关论文
共 68 条
[1]  
Adelson E., 1984, RCA Eng., V29, P33
[2]  
[Anonymous], 2017, CVPR
[3]  
[Anonymous], on computer vision and pattern recognition CVPR
[4]  
[Anonymous], 2018, COMP VIS ECCV 2018 W, DOI DOI 10.1163/9789004385580002
[5]  
[Anonymous], 2018, CVPR, DOI DOI 10.1109/CVPR.2018.00132
[6]  
[Anonymous], 2018, NeurIPS
[7]  
[Anonymous], 2018, CVPR, DOI DOI 10.1109/CVPR.2018.00690
[8]  
[Anonymous], 2018, P EUROPEAN C COMPUTE
[9]  
Badrinarayanan V., 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence, DOI DOI 10.1109/TPAMI.2016.2644615
[10]   RoadTracer: Automatic Extraction of Road Networks from Aerial Images [J].
Bastani, Favyen ;
He, Songtao ;
Abbar, Sofiane ;
Alizadeh, Mohammad ;
Balakrishnan, Hari ;
Chawla, Sanjay ;
Madden, Sam ;
DeWitt, David .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4720-4728