A data-level fusion model for unsupervised attribute selection in multi-source homogeneous data

被引:88
作者
Zhang, Pengfei [1 ,2 ,3 ]
Li, Tianrui [1 ,2 ,3 ]
Yuan, Zhong [1 ,2 ,3 ]
Luo, Chuan [4 ]
Wang, Guoqiang [1 ,2 ,3 ]
Liu, Jia [1 ,2 ,3 ]
Du, Shengdong [1 ,2 ,3 ]
机构
[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China
[2] Southwest Jiaotong Univ, Inst Artificial Intelligence, Chengdu 611756, Peoples R China
[3] Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data App, Chengdu 611756, Peoples R China
[4] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
基金
国家重点研发计划;
关键词
Information fusion; Granular computing; Rough sets; Uncertainty measures; Unsupervised attribute selection; ROUGH SET-THEORY; INFORMATION FUSION; 3-WAY DECISION; ENTROPY; SAFETY;
D O I
10.1016/j.inffus.2021.10.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Information fusion refers to derive an overall precise description of data by using certain fusion technique for utilizing the complementary information from multiple sources of data, which can facilitate effective decision-making, prediction and classification, etc. Multi-source homogeneous data, characterizing the data type of variables in the sample in form of one type (i.e., numerical or categorical) in different information sources, which widely exists in many practical applications. This paper concentrates on efficient fusion of multi-source homogeneous data with a data-level fusion model which involves the consolidation of multiple information sources and unsupervised attribute selection of the fused data. A unified description and modeling method of a multi-source homogeneous information system is introduced. The neighborhood rough sets model is used to construct the neighborhood granular structure, which uses the idea of granular computing to build methods of uncertainty measures. Given the uncertainty of fusing multiple information sources, Sup-Inf fusion functions are developed based on the proposed uncertainty measures, which can fuse the multi-source homogeneous information system into a single-source information system. Finally, an unsupervised attribute selection approach is employed to eliminate redundant attribute of the single-source information system. Theoretical analysis and comprehensive experiments on several datasets demonstrate the feasibility and superiority of our method.
引用
收藏
页码:87 / 103
页数:17
相关论文
共 53 条
[1]   Cluster ensembles: A survey of approaches with recent extensions and applications [J].
Boongoen, Tossapon ;
Iam-On, Natthakan .
COMPUTER SCIENCE REVIEW, 2018, 28 :1-25
[2]   Feature-level fusion approaches based on multimodal EEG data for depression recognition [J].
Cai, Hanshu ;
Qu, Zhidiao ;
Li, Zhe ;
Zhang, Yi ;
Hu, Xiping ;
Hu, Bin .
INFORMATION FUSION, 2020, 59 (59) :127-138
[3]   Information fusion and numerical characterization of a multi-source information system [J].
Che, Xiaoya ;
Mi, Jusheng ;
Chen, Degang .
KNOWLEDGE-BASED SYSTEMS, 2018, 145 :121-133
[4]   Granule structures, distances and measures in neighborhood systems [J].
Chen, Yumin ;
Qin, Nan ;
Li, Wei ;
Xu, Feifei .
KNOWLEDGE-BASED SYSTEMS, 2019, 165 :268-281
[5]   Robust Human Activity Recognition Using Multimodel Feature-Level Fusion [J].
Ehatisham-Ul-Haq, Muhammad ;
Javed, Ali ;
Azam, Muhammad Awais ;
Malik, Hafiz M. A. ;
Irtaza, Aun ;
Lee, Ik Hyun ;
Mahmood, Muhammad Tariq .
IEEE ACCESS, 2019, 7 :60736-60751
[6]  
Han C, 2010, MULTISOURE INFORM FU, VSecond
[7]  
Han J, 2012, MOR KAUF D, P1
[8]  
Herrera F, 2004, INTELLIGENT SENSORY EVALUATION: METHODOLOGIES AND APPLICATIONS, P71
[9]   Neighborhood rough set based heterogeneous feature subset selection [J].
Hu, Qinghua ;
Yu, Daren ;
Liu, Jinfu ;
Wu, Congxin .
INFORMATION SCIENCES, 2008, 178 (18) :3577-3594
[10]   An efficient gene selection technique for cancer recognition based on neighborhood mutual information [J].
Hu, Qinghua ;
Pan, Wei ;
An, Shuang ;
Ma, Peijun ;
Wei, Jinmao .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2010, 1 (1-4) :63-74