Interval-valued fuzzy predicates from labeled data: An approach to data classification and knowledge discovery

被引:0
作者
Comas, Diego S. [1 ,2 ]
Meschino, Gustavo J. [3 ]
Ballarin, Virginia L. [2 ]
机构
[1] Consejo Nacl Invest Cient & Tecn CONICET, Buenos Aires, Argentina
[2] Univ Nacl Mar del Plata, Fac Ingn, Image Proc Lab, Inst Invest Cient & Tecnol Elect ICyTE, Juan B Justo 4302,B7608FDQ, Mar Del Plata, Argentina
[3] Univ Nacl Mar del Plata, CONICET, Bioengn Lab, Inst Invest Cient & Tecnol Elect ICyTE,Fac Ingn, Juan B Justo 4302,B7608FDQ, Mar Del Plata, Argentina
关键词
Interval-valued fuzzy logic; Data classification; Knowledge discovery; Membership functions; GENERATION; SUPPORT;
D O I
10.1016/j.ins.2025.122033
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Interpretable data classifiers play a significant role in providing transparency in the decisionmaking process by ensuring accountability and auditability, enhancing model understanding, and extracting new information that expands the field of knowledge in a discipline while effectively handling large datasets. This paper introduces the Type-2 Label-based Fuzzy Predicate Classification (T2-LFPC) method, in which interval-valued fuzzy predicates are used for interpretable data classification. The proposed approach begins by clustering the data within each class, associating clusters with collections of common attributes, and identifying class prototypes. Interval-valued membership functions and predicates are then derived from these prototypes, leading to the creation of an interpretable classifier. Empirical evaluations on 14 datasets, both public and synthetic, are presented to demonstrate the superior performance of T2-LFPC based on the accuracy and Jaccard index. The proposed method enables linguistic descriptions of classes, insight into attribute semantics, class property definitions, and an understanding of data space partitioning. This innovative approach enhances knowledge discovery by addressing the challenges posed by the complexity and size of modern datasets.
引用
收藏
页数:26
相关论文
共 50 条
  • [31] Knowledge Discovery from Large Amounts of Social Media Data
    Belcastro, Loris
    Cantini, Riccardo
    Marozzo, Fabrizio
    APPLIED SCIENCES-BASEL, 2022, 12 (03):
  • [32] Random Forests in a Glassworks: Knowledge Discovery from Industrial Data
    Setlak, Galina
    Pasko, Lukasz
    INFORMATION SYSTEMS ARCHITECTURE AND TECHNOLOGY, ISAT 2019, PT II, 2020, 1051 : 179 - 188
  • [33] Knowledge Discovery: Methods from data mining and machine learning
    Shu, Xiaoling
    Ye, Yiwan
    SOCIAL SCIENCE RESEARCH, 2023, 110
  • [34] Visual Knowledge Discovery from Public Transit Performance Data
    Leung, Carson K.
    Munshi, Mohammadafaz V.
    Patel, Vrushil Kiritkumar
    Nhu Minh Ngoc Pham
    Wu, Yixi
    2023 27TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION, IV, 2023, : 323 - 328
  • [35] Visualization and Visual Knowledge Discovery from Big Uncertain Data
    Leung, Carson K.
    Madill, Evan W. R.
    Pazdor, Adam
    2022 26TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV), 2022, : 330 - 335
  • [36] Knowledge Discovery from Honeypot Data for Monitoring Malicious Attacks
    Jin, Huidong
    de Vel, Olivier
    Zhang, Ke
    Liu, Nianjun
    AI 2008: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2008, 5360 : 470 - +
  • [37] Evolutionary intelligent data warehousing approach to knowledge discovery systems: Dynamic cubing
    Kaur H.
    Singh K.
    Kaur T.
    Recent Advances in Computer Science and Communications, 2021, 14 (06) : 1869 - 1882
  • [38] A Novel Approach for Knowledge Discovery from AIS Data: An Application for Transit Marine Traffic in the Sea of Marmara
    Dogan, Yunus
    Kart, Ozge
    Kundakci, Burak
    Nas, Selcuk
    ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2021, 21 (03) : 73 - 80
  • [39] Knowledge Discovery from Unstructured Data in Financial Services (KDF) Workshop
    Shah, Sameena
    Zhu, Xiandan
    Chen, Wenhu
    Li, Manling
    Nourbakhsh, Armineh
    Liu, Xiaomo
    Ma, Zhiqiang
    Smiley, Charese
    Pei, Yulong
    Gupta, Akshat
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 3464 - 3467
  • [40] A Review of the Enabling Methodologies for Knowledge Discovery from Smart Grids Data
    De Caro, Fabrizio
    Andreotti, Amedeo
    Araneo, Rodolfo
    Panella, Massimo
    Rosato, Antonello
    Vaccaro, Alfredo
    Villacci, Domenico
    ENERGIES, 2020, 13 (24)