RSFD: A rough set-based feature discretization method for meteorological data

被引:4
作者
Zeng, Lirong [1 ]
Chen, Qiong [2 ]
Huang, Mengxing [1 ]
机构
[1] Hainan Univ, Sch Informat & Commun Engn, State Key Lab Marine Resource Utilizat South China, Haikou, Peoples R China
[2] Tsinghua Univ, Inst Global Change Studies, Dept Earth Syst Sci, Minist Educ,Key Lab Earth Syst Modeling, Beijing, Peoples R China
基金
中国博士后科学基金; 海南省自然科学基金;
关键词
meteorological data; feature discretization; information gain; rough set; classification accuracy; SYSTEM;
D O I
10.3389/fenvs.2022.1013811
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Meteorological data mining aims to discover hidden patterns in a large number of available meteorological data. As one of the most relevant big data preprocessing technologies, feature discretization can transform continuous features into discrete ones to improve the efficiency of meteorological data mining algorithms. Aiming at the problems of high interaction of multiple attributes, noise interference, and difficulty in obtaining prior knowledge in meteorological data, we propose a rough set-based feature discretization method for meteorological data (RSFD). First, we calculate the information gain of each candidate breakpoint in the meteorological attribute to split the intervals. Then, we use chi-square test to merge these discrete intervals. Finally, we take the variation of indiscernibility relation in rough set as the evaluation criterion for the discretization scheme. We scan each attribute in turn by using the strategy of splitting first and then merging, thus obtaining the optimal discrete feature set. We compare RSFD with the state-of-the-art discretization methods on meteorological data. Experiments show that our method achieves better results in the classification accuracy of meteorological data, and obtains a smaller number of discrete intervals while ensuring data consistency.
引用
收藏
页数:8
相关论文
共 26 条
[1]   Spatiotemporal Change of Air-Quality Patterns in Hubei Province-A Pre- to Post-COVID-19 Analysis Using Path Analysis and Regression [J].
Aamir, Muhammad ;
Li, Zeyun ;
Bazai, Sibghatullah ;
Wagan, Raja Asif ;
Bhatti, Uzair Aslam ;
Nizamani, Mir Muhammad ;
Akram, Shakeel .
ATMOSPHERE, 2021, 12 (10)
[2]   Climate change threatens Pakistan's snow leopards [J].
Bhatti, Uzair Aslam ;
Nizamani, Mir Muhammad ;
Huang Mengxing .
SCIENCE, 2022, 377 (6606) :585-586
[3]   Local Similarity-Based Spatial-Spectral Fusion Hyperspectral Image Classification With Deep CNN and Gabor Filtering [J].
Bhatti, Uzair Aslam ;
Yu, Zhaoyuan ;
Chanussot, Jocelyn ;
Zeeshan, Zeeshan ;
Yuan, Linwang ;
Luo, Wen ;
Nawaz, Saqib Ali ;
Bhatti, Mughair Aslam ;
ul Ain, Qurat ;
Mehmood, Anum .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[4]   Time Series Analysis and Forecasting of Air Pollution Particulate Matter (PM2.5): An SARIMA and Factor Analysis Approach [J].
Bhatti, Uzair Aslam ;
Yan, Yuhuan ;
Zhou, Mingquan ;
Ali, Sajid ;
Hussain, Aamir ;
Huo, Qingsong ;
Yu, Zhaoyuan ;
Yuan, Linwang .
IEEE ACCESS, 2021, 9 :41019-41031
[5]  
Chen Q., 2018, 2018 OCEANS-MTS/IEEE Kobe Techno-Oceans (OTO), P1
[6]   Generalized Interval Type-II Fuzzy Rough Model-Based Feature Discretization for Mixed Pixels [J].
Chen, Qiong ;
Ding, Weiping ;
Huang, Xiaomeng ;
Wang, Hao .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2023, 31 (03) :845-859
[7]   A Feature Discretization Method for Classification of High-Resolution Remote Sensing Images in Coastal Areas [J].
Chen, Qiong ;
Huang, Mengxing ;
Wang, Hao .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (10) :8584-8598
[8]   A Feature Discretization Method Based on Fuzzy Rough Sets for High-Resolution Remote Sensing Big Data Under Linear Spectral Model [J].
Chen, Qiong ;
Huang, Mengxing ;
Wang, Hao ;
Xu, Guangquan .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2022, 30 (05) :1328-1342
[9]   Rough fuzzy model based feature discretization in intelligent data preprocess [J].
Chen, Qiong ;
Huang, Mengxing .
JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2021, 10 (01)
[10]   Reinforcement Learning-Based Genetic Algorithm in Optimizing Multidimensional Data Discretization Scheme [J].
Chen, Qiong ;
Huang, Mengxing ;
Xu, Qiannan ;
Wang, Hao ;
Wang, Jinghui .
MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020