Enabling and Optimizing Non-linear Feature Interactions in Factorized Linear Algebra

被引:20
|
作者
Li, Side [1 ]
Chen, Lingjiao [2 ]
Kumar, Arun [1 ]
机构
[1] Univ Calif San Diego, La Jolla, CA 92093 USA
[2] Univ Wisconsin, Madison, WI USA
来源
SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA | 2019年
关键词
D O I
10.1145/3299869.3319878
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accelerating machine learning (ML) over relational data is a key focus of the database community. While many real-world datasets are multi-table, most ML tools expect single-table inputs, forcing users to materialize joins before ML, leading to data redundancy and runtime waste. Recent works on "factorized ML" address such issues by pushing ML through joins. However, they have hitherto been restricted to ML models linear in the feature space, rendering them less effective when users construct non-linear feature interactions such as pairwise products to boost ML accuracy. In this work, we take a first step towards closing this gap by introducing a new abstraction to enable pairwise feature interactions in multi-table data and present an extensive framework of algebraic rewrite rules for factorized LA operators over feature interactions. Our rewrite rules carefully exploit the interplay of the redundancy caused by both joins and interactions. We prototype our framework in Python to build a tool we call MorpheusFI. An extensive empirical evaluation with both synthetic and real datasets shows that MorpheusFI yields up to 5x speedups over materialized execution for a popular second-order gradient method and even an order of magnitude speedups over a popular stochastic gradient method.
引用
收藏
页码:1571 / 1588
页数:18
相关论文
共 50 条
  • [1] Linear and non-linear algebra
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2001, 1981 : 271 - 272
  • [2] Algebra of compositions and non-linear equations
    Daletskii, YL
    ALGEBRAIC AND GEOMETRIC METHODS IN MATHEMATICAL PHYSICS, 1996, 19 : 277 - 291
  • [3] Linear and non-linear models of brain interactions
    Büchel, C
    BIOLOGICAL PSYCHIATRY, 2000, 47 (08) : 64S - 64S
  • [4] Non-linear image feature tracking
    van Wyk, BJ
    van Wyk, MA
    SEVENTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, 2005, : 371 - 375
  • [5] Non-linear generalization of the sl(2) algebra
    Curado, EMF
    Rego-Monteiro, MA
    PHYSICS LETTERS A, 2002, 300 (2-3) : 205 - 212
  • [6] Fault diagnosis for multivariable non-linear systems based on non-linear spectrum feature
    Zhang, Jialiang
    Cao, Jianfu
    Gao, Feng
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2017, 39 (07) : 1017 - 1026
  • [7] Linear and Non-Linear Visual Feature Learning in Rat and Humans
    Bossens, Christophe
    Op de Beeck, Hans P.
    FRONTIERS IN BEHAVIORAL NEUROSCIENCE, 2016, 10
  • [8] Feature Extraction Using Linear and Non-linear Subspace Techniques
    Teixeira, Ana R.
    Tome, Ana Maria
    Lang, E. W.
    ARTIFICIAL NEURAL NETWORKS - ICANN 2009, PT II, 2009, 5769 : 115 - +
  • [9] A NON-LINEAR THEORY OF STRONG INTERACTIONS
    SKYRME, THR
    PROCEEDINGS OF THE ROYAL SOCIETY OF LONDON SERIES A-MATHEMATICAL AND PHYSICAL SCIENCES, 1958, 247 (1249): : 260 - 278
  • [10] NON-LINEAR INTERACTIONS AND NUCLEAR SATURATION
    CLEMENTEL, E
    VILLI, C
    NUOVO CIMENTO, 1955, 1 (06): : 1273 - 1276