Uncertainty measurement for heterogeneous data: an application in attribute reduction

被引:7
作者
Song, Yan [1 ]
Zhang, Gangqiang [2 ]
He, Jiali [1 ]
Liao, Shimin [3 ]
Xie, Ningxin [2 ]
机构
[1] Yulin Normal Univ, Sch Math & Stat, Yulin 537000, Guangxi, Peoples R China
[2] Guangxi Univ Nationalities, Sch Artificial Intelligence, Nanning 530006, Guangxi, Peoples R China
[3] Guangxi Univ Nationalities, Sch Math & Phys, Nanning 530006, Guangxi, Peoples R China
基金
中国国家自然科学基金;
关键词
HIS; Uncertainty; Measurement; Effectiveness; Attribute reduction;
D O I
10.1007/s10462-021-09978-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the era of big data, multimedia, hyper-media and social networks are emerging, and the amount of information is growing rapidly. When people participate in the process of massive data processing, they will encounter data with different structures, so data has heterogeneity. How to acquire hidden and valuable knowledge from heterogeneous data and measure its uncertainty is an important problem in artificial intelligence. This paper investigates uncertainty measurement for heterogeneous data and gives its application in attribute reduction. The concept of a heterogeneous information system (HIS) is first proposed. Then, an equivalence relation on the object set is constructed. Next, uncertainty measurement for a HIS is investigated, a numerical experiment is given, and dispersion analysis, correlation analysis, and Friedman test and Bonferroni-Dunn test in statistics are conducted. Finally, as an application of the proposed measures, attribute reduction in a HIS is studied, and the corresponding algorithms and their analysis are proposed.
引用
收藏
页码:991 / 1027
页数:37
相关论文
共 31 条
[1]   Information-theoretic measures of uncertainty for rough sets and rough relational databases [J].
Beaubouef, T ;
Petry, FE ;
Arora, G .
INFORMATION SCIENCES, 1998, 109 (1-4) :185-195
[2]   An entropy-based uncertainty measurement approach in neighborhood systems [J].
Chen, Yumin ;
Wu, Keshou ;
Chen, Xuhui ;
Tang, Chaohui ;
Zhu, Qingxin .
INFORMATION SCIENCES, 2014, 279 :239-250
[3]   Uncertainty measurement for interval-valued decision systems based on extended conditional entropy [J].
Dai, Jianhua ;
Wang, Wentao ;
Xu, Qing ;
Tian, Haowei .
KNOWLEDGE-BASED SYSTEMS, 2012, 27 :443-450
[4]   MULTIPLE COMPARISONS AMONG MEANS [J].
DUNN, OJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1961, 56 (293) :52-&
[5]   Uncertainty measures of rough set prediction [J].
Düntsch, I ;
Gediga, G .
ARTIFICIAL INTELLIGENCE, 1998, 106 (01) :109-137
[6]   A comparison of alternative tests of significance for the problem of m rankings [J].
Friedman, M .
ANNALS OF MATHEMATICAL STATISTICS, 1940, 11 :86-92
[7]   Information-preserving hybrid data reduction based on fuzzy-rough techniques [J].
Hu, QH ;
Yu, DR ;
Xie, ZX .
PATTERN RECOGNITION LETTERS, 2006, 27 (05) :414-423
[8]   Neighborhood rough set based heterogeneous feature subset selection [J].
Hu, Qinghua ;
Yu, Daren ;
Liu, Jinfu ;
Wu, Congxin .
INFORMATION SCIENCES, 2008, 178 (18) :3577-3594
[9]   Measures of uncertainty based on Gaussian kernel for a fully fuzzy information system [J].
Li, Zhaowen ;
Liu, Xiaofeng ;
Dai, Jianhua ;
Chen, Jiaolong ;
Fujita, Hamido .
KNOWLEDGE-BASED SYSTEMS, 2020, 196
[10]   Measures of uncertainty for knowledge bases [J].
Li, Zhaowen ;
Zhang, Gangqiang ;
Wu, Wei-Zhi ;
Xie, Ningxin .
KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (02) :611-637