Enabling and Optimizing Non-linear Feature Interactions in Factorized Linear Algebra

被引:20
作者
Li, Side [1 ]
Chen, Lingjiao [2 ]
Kumar, Arun [1 ]
机构
[1] Univ Calif San Diego, La Jolla, CA 92093 USA
[2] Univ Wisconsin, Madison, WI USA
来源
SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA | 2019年
关键词
D O I
10.1145/3299869.3319878
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accelerating machine learning (ML) over relational data is a key focus of the database community. While many real-world datasets are multi-table, most ML tools expect single-table inputs, forcing users to materialize joins before ML, leading to data redundancy and runtime waste. Recent works on "factorized ML" address such issues by pushing ML through joins. However, they have hitherto been restricted to ML models linear in the feature space, rendering them less effective when users construct non-linear feature interactions such as pairwise products to boost ML accuracy. In this work, we take a first step towards closing this gap by introducing a new abstraction to enable pairwise feature interactions in multi-table data and present an extensive framework of algebraic rewrite rules for factorized LA operators over feature interactions. Our rewrite rules carefully exploit the interplay of the redundancy caused by both joins and interactions. We prototype our framework in Python to build a tool we call MorpheusFI. An extensive empirical evaluation with both synthetic and real datasets shows that MorpheusFI yields up to 5x speedups over materialized execution for a popular second-order gradient method and even an order of magnitude speedups over a popular stochastic gradient method.
引用
收藏
页码:1571 / 1588
页数:18
相关论文
共 50 条
  • [21] Radial projections for non-linear feature extraction
    Perez-Jimenez, AJ
    Perez-Cortes, JC
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL II, PROCEEDINGS, 2002, : 444 - 447
  • [22] THE EXPLICIT STRUCTURE OF THE NON-LINEAR SCHRODINGER PROLONGATION ALGEBRA
    VANECK, HN
    GRAGERT, PKH
    MARTINI, R
    PROCEEDINGS OF THE KONINKLIJKE NEDERLANDSE AKADEMIE VAN WETENSCHAPPEN SERIES A-MATHEMATICAL SCIENCES, 1983, 86 (02): : 165 - 172
  • [23] Covariant formulation of the tensor algebra of non-linear elasticity
    Federico, Salvatore
    INTERNATIONAL JOURNAL OF NON-LINEAR MECHANICS, 2012, 47 (02) : 273 - 284
  • [24] CERTAIN FEATURE OF NON-LINEAR GALVANOMAGNETIC CHARACTERISTICS
    KACHLISHVILI, ZS
    PHYSICA STATUS SOLIDI A-APPLICATIONS AND MATERIALS SCIENCE, 1978, 49 (02): : K135 - K137
  • [25] Non-Linear Feature Extraction by Linear PCA Using Local Kernel
    Hotta, Kazuhiro
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 2064 - 2067
  • [26] An accelerated linear method for optimizing non-linear wavefunctions in variational Monte Carlo
    Sabzevari, Iliya
    Mahajan, Ankit
    Sharma, Sandeep
    JOURNAL OF CHEMICAL PHYSICS, 2020, 152 (02)
  • [27] A NON-LINEAR DIGITAL OPTIMIZING PROGRAM FOR PROCESS CONTROL
    MUGELE, RA
    COMMUNICATIONS OF THE ACM, 1962, 5 (03) : 134 - 134
  • [28] PROBLEM OF OPTIMIZING ENERGY CHARACTERISTICS OF NON-LINEAR ACCELERATORS
    BATYGIN, YV
    IVANOV, GM
    MAKHNENKO, LA
    ZHURNAL TEKHNICHESKOI FIZIKI, 1980, 50 (03): : 544 - 548
  • [29] Evaluating Linear and Non-linear Genotype–Environment Interactions in Pearl Millet
    Abdalla E.A.
    Virk D.S.
    Abera F.A.
    Proceedings of the National Academy of Sciences, India Section B: Biological Sciences, 2019, 89 (1) : 231 - 241
  • [30] Optimizing switching of non-linear properties with hyperbolic metamaterials
    Ethridge, James A.
    Jones, John G.
    Ferdinandus, Manuel R.
    Havrilla, Michael J.
    Marciniak, Michael A.
    OPTICAL MATERIALS EXPRESS, 2022, 12 (11) : 4471 - 4483