Novel Regularization for Learning the Fuzzy Choquet Integral With Limited Training Data

被引:2
作者
Kakula, Siva Krishna [1 ]
Pinar, Anthony J. [2 ]
Islam, Muhammad Aminul [3 ]
Anderson, Derek T. [3 ]
Havens, Timothy C. [4 ]
机构
[1] Michigan Technol Univ, Dept Comp Sci, Houghton, MI 49931 USA
[2] Michigan Technol Univ, Dept Elect & Comp Engn, Houghton, MI 49931 USA
[3] Univ Missouri, Dept Elect Engn & Comp Sci, Columbia, MO 65211 USA
[4] Michigan Technol Univ, Coll Comp, Houghton, MI 49931 USA
关键词
Frequency modulation; Lattices; Training data; Kernel; Data visualization; Density measurement; Quadratic programming; Choquet fuzzy integral (FI); fuzzy measure (FM); quadratic programming (QP); regularization; support vector machine (SVM); AGGREGATION FUNCTIONS; PRE-AGGREGATION; CLASSIFICATION; FUSION; CONSTRUCTION; PERFORMANCE; REGRESSION; OPERATORS;
D O I
10.1109/TFUZZ.2020.3009722
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fuzzy integrals (FIs) are powerful aggregation operators that fuse information from multiple sources. The aggregation is parameterized using a fuzzy measure (FM), which encodes the worths of all subsets of sources. Since the FI is defined with respect to an FM, much consideration must be given to defining the FM. However, in practice this is a difficult task-the number of values in an FM scales as 2(n), where n is the number of input sources, thus manually specifying an FM quickly becomes tedious. In this article, we review an automatic, data-supported method of learning the FM by minimizing a sum-of-squared error objective function in the context of decision-level fusion of classifiers using the Choquet FI. While this solves the specification problem, we illuminate an issue encountered with many real-world data sets; i.e., if the training data do not contain a significant number of all possible sort orders, many of the FM values are not supported by the data. We propose various regularization strategies to alleviate this issue by pushing the learned FM toward a predefined structure; these regularizers allow the user to encode knowledge of the underlying FM to the learning problem. Furthermore, we propose another regularization strategy that constrains the learned FM's structure to be a linear order statistic. Finally, we perform several experiments using synthetic and real-world data sets and show that our proposed extensions can improve the learned FM behavior and classification accuracy. A previously proposed visualization technique is employed to simultaneously quantitatively illustrate the FM as well as the FI.
引用
收藏
页码:2890 / 2901
页数:12
相关论文
共 77 条
  • [1] Extension of the Fuzzy Integral for General Fuzzy Set-Valued Information
    Anderson, Derek T.
    Havens, Timothy C.
    Wagner, Christian
    Keller, James M.
    Anderson, Melissa F.
    Wescott, Daniel J.
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2014, 22 (06) : 1625 - 1639
  • [2] Anderson DT, 2014, IEEE INT FUZZY SYST, P2519, DOI 10.1109/FUZZ-IEEE.2014.6891630
  • [3] Estimation of Adult Skeletal Age-at-Death Using the Sugeno Fuzzy Integral
    Anderson, Melissa F.
    Anderson, Derek T.
    Wescott, Daniel J.
    [J]. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY, 2010, 142 (01) : 30 - 41
  • [4] Assessing non-additive utility for multicriteria decision aid
    Angilella, S
    Greco, S
    Lamantia, F
    Matarazzo, B
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2004, 158 (03) : 734 - 744
  • [5] Non-additive robust ordinal regression: A multiple criteria decision model based on the Choquet integral
    Angilella, Silvia
    Greco, Salvatore
    Matarazzo, Benedetto
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2010, 201 (01) : 277 - 288
  • [6] [Anonymous], 1963, DISTRIBUTION FREE MU
  • [7] [Anonymous], 2019, MATLAB STAT MACH LEA
  • [8] [Anonymous], 2000, Fuzzy measures and integrals: theory and applications
  • [9] Auephanwiriyakul S., 2002, Information Fusion, V3, P69, DOI 10.1016/S1566-2535(01)00054-9
  • [10] Diamond Sampling for Approximate Maximum All-pairs Dot-product (MAD) Search
    Ballard, Grey
    Kolda, Tamara G.
    Pinar, Ali
    Seshadhri, C.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 11 - 20