Joint Structured Bipartite Graph and Row-Sparse Projection for Large-Scale Feature Selection

被引:5
作者
Dong, Xia [1 ]
Nie, Feiping [2 ,3 ]
Wu, Danyang [4 ,5 ]
Wang, Rong [3 ,6 ]
Li, Xuelong [3 ,6 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
[2] Northwestern Polytech Univ, Sch Artificial Intelligence Opt & Elect IOPEN, Sch Comp Sci, Xian 710072, Peoples R China
[3] Northwestern Polytech Univ, Key Lab Intelligent Interact & Applicat, Minist Ind & Informat Technol, Xian 710072, Peoples R China
[4] Northwest A&F Univ, Coll Informat Engn, Yangling 712100, Peoples R China
[5] Northwest A&F Univ, Shaanxi Engn Res Ctr Intelligent Percept & Anal Ag, Yangling 712100, Peoples R China
[6] Northwestern Polytech Univ, Sch Artificial Intelligence Opt & Elect IOPEN, Xian 710072, Peoples R China
关键词
Feature extraction; Data models; Bipartite graph; Adaptation models; Data structures; Sparse matrices; Optimization; Large-scale feature selection; multiple subcluster centers; row-sparse projection; structured bipartite graph; unsupervised learning; UNSUPERVISED FEATURE-SELECTION; SUPERVISED FEATURE-SELECTION; DIMENSIONALITY REDUCTION;
D O I
10.1109/TNNLS.2024.3389029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection plays an important role in data analysis, yet traditional graph-based methods often produce suboptimal results. These methods typically follow a two-stage process: constructing a graph with data-to-data affinities or a bipartite graph with data-to-anchor affinities and independently selecting features based on their scores. In this article, a large-scale feature selection approach based on structured bipartite graph and row-sparse projection (RSBLFS)-B-2) is proposed to overcome this limitation. (RSBLFS)-B-2 integrates the construction of a structured bipartite graph consisting of c connected components into row-sparse projection learning with k nonzero rows. This integration allows for the joint selection of an optimal feature subset in an unsupervised manner. Notably, the c connected components of the structured bipartite graph correspond to c clusters, each with multiple subcluster centers. This feature makes RS (BLFS)-B-2 particularly effective for feature selection and clustering on nonspherical large-scale data. An algorithm with theoretical analysis is developed to solve the optimization problem involved in (RSBLFS)-B-2. Experimental results on synthetic and real-world datasets confirm its effectiveness in feature selection tasks.
引用
收藏
页码:6911 / 6924
页数:14
相关论文
共 70 条
[1]  
Arai H, 2016, AAAI CONF ARTIF INTE, P666
[2]  
Boyd Stephen., 2004, Convex Optimization, V1st, P727
[3]   Convex Sparse PCA for Unsupervised Feature Learning [J].
Chang, Xiaojun ;
Nie, Feiping ;
Yang, Yi ;
Zhang, Chengqi ;
Huang, Heng .
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2016, 11 (01)
[4]   Compound Rank-k Projections for Bilinear Analysis [J].
Chang, Xiaojun ;
Nie, Feiping ;
Wang, Sen ;
Yang, Yi ;
Zhou, Xiaofang ;
Zhang, Chengqi .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (07) :1502-1513
[5]  
Chung F.R. K., 1997, Regional Conference Series in Mathematics, V92
[6]  
Daum F, 2003, AEROSP CONF PROC, P1979
[7]   Unsupervised Feature Selection with Adaptive Structure Learning [J].
Du, Liang ;
Shen, Yi-Dong .
KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, :209-218
[8]   Exploiting Combination Effect for Unsupervised Feature Selection by l2,0 Norm [J].
Du, Xingzhong ;
Nie, Feiping ;
Wang, Weiqing ;
Yang, Yi ;
Zhou, Xiaofang .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (01) :201-214
[10]   A wrapper-based approach to image segmentation and classification [J].
Farmer, ME ;
Jain, AK .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2005, 14 (12) :2060-2072