Semi-supervised attribute reduction based on label distribution and label irrelevance

被引：19

作者：

Dai, Jianhua ^{[1
]}

Huang, Weiyi

Wang, Weisi

Zhang, Chucai

机构：

[1] Hunan Normal Univ, Hunan Prov Key Lab Intelligent Comp & Language Inf, Changsha 410081, Peoples R China

来源：

INFORMATION FUSION | 2023年 / 100卷

关键词：

Attribute reduction; Fuzzy similarity relation; Semi-supervised; Label distribution; Label irrelevance; ROUGH SET-THEORY; FEATURE-SELECTION; KNOWLEDGE GRANULATION; CONDITIONAL-ENTROPY;

D O I：

10.1016/j.inffus.2023.101951

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Attribute reduction in partially labeled data, also called semi-supervised attribute reduction, is an important issue. In recent years, the research on semi-supervised attribute reduction has attracted the attention of many scholars. Unfortunately, most existing semi-supervised attribute reduction methods do not handle the information loss caused by missing labels well. Meanwhile, these methods in general only consider the relevance between attributes and labels to measure attribute correlations, which ignores the irrelevant information contained in the attributes with respect to the labels. In view of this, this paper proposes a novel semi-supervised attribute reduction algorithm considering attribute relevance, redundancy and label irrelevance from the perspective of label distribution. Firstly, the membership degree of unlabeled objects relative to labels is defined by fuzzy similarity relation, which implements information restoration and converts partially labeled data into label distribution data. Secondly, some fuzzy uncertainty measures for label distribution are defined and related properties are investigated accordingly. Additionally, considering that irrelevant information brought by attributes may lead to over-fitting, label irrelevance criterion based on fuzzy uncertainty measures is constructed. Thirdly, a novel semi-supervised attribute reduction algorithm via the maximum relevance, minimum redundancy, and minimum irrelevance is proposed. Finally, compared with the representative semi-supervised attribute reduction algorithms and supervised attribute reduction algorithm, the effectiveness of the proposed algorithm is verified by various experiments.

引用

页数：16

共 45 条

[1] A semi-supervised feature ranking method with ensemble learning [J].