Enabling and Optimizing Non-linear Feature Interactions in Factorized Linear Algebra

被引:20
作者
Li, Side [1 ]
Chen, Lingjiao [2 ]
Kumar, Arun [1 ]
机构
[1] Univ Calif San Diego, La Jolla, CA 92093 USA
[2] Univ Wisconsin, Madison, WI USA
来源
SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA | 2019年
关键词
D O I
10.1145/3299869.3319878
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accelerating machine learning (ML) over relational data is a key focus of the database community. While many real-world datasets are multi-table, most ML tools expect single-table inputs, forcing users to materialize joins before ML, leading to data redundancy and runtime waste. Recent works on "factorized ML" address such issues by pushing ML through joins. However, they have hitherto been restricted to ML models linear in the feature space, rendering them less effective when users construct non-linear feature interactions such as pairwise products to boost ML accuracy. In this work, we take a first step towards closing this gap by introducing a new abstraction to enable pairwise feature interactions in multi-table data and present an extensive framework of algebraic rewrite rules for factorized LA operators over feature interactions. Our rewrite rules carefully exploit the interplay of the redundancy caused by both joins and interactions. We prototype our framework in Python to build a tool we call MorpheusFI. An extensive empirical evaluation with both synthetic and real datasets shows that MorpheusFI yields up to 5x speedups over materialized execution for a popular second-order gradient method and even an order of magnitude speedups over a popular stochastic gradient method.
引用
收藏
页码:1571 / 1588
页数:18
相关论文
共 50 条
  • [41] NON-LINEAR INTERACTIONS IN INDIUM-ANTIMONIDE
    HASSELBACH, M
    HSU, SC
    KWOK, HS
    [J]. APPLIED PHYSICS B-PHOTOPHYSICS AND LASER CHEMISTRY, 1982, 28 (2-3): : 253 - 254
  • [42] MANY BODY FORCES AND NON-LINEAR INTERACTIONS
    CLEMENTEL, E
    VILLI, C
    [J]. NUOVO CIMENTO, 1955, 1 (02): : 344 - 346
  • [43] Non-linear interactions in imperfect beams at veering
    Lacarbonara, W
    Arafat, HN
    Nayfeh, AH
    [J]. INTERNATIONAL JOURNAL OF NON-LINEAR MECHANICS, 2005, 40 (07) : 987 - 1003
  • [44] Clustering non-linear interactions in factor analysis
    Erick da Conceição Amorim
    Vinícius Diniz Mayrink
    [J]. METRON, 2020, 78 : 329 - 352
  • [45] Linear and non-linear properties of feature selectivity in V4 neurons
    Touryan, Jon
    Mazer, James A.
    [J]. FRONTIERS IN SYSTEMS NEUROSCIENCE, 2015, 9 : 1 - 12
  • [46] UNIFYING AND OPTIMIZING LINEAR ALGEBRA ALGORITHMS
    ANGELACCIO, M
    COLAJANNI, M
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1993, 4 (12) : 1382 - 1397
  • [47] LINEAR AND NON-LINEAR RESPONSE
    KALLIO, A
    PUOSKARI, M
    LANTTO, L
    PIETILAINEN, P
    HALONEN, V
    [J]. LECTURE NOTES IN PHYSICS, 1984, 198 : 210 - 218
  • [48] Linear, or Non-Linear, That is the Question!
    Kong, Taeyong
    Kim, Taeri
    Jeon, Jinsung
    Choi, Jeongwhan
    Lee, Yeon-Chang
    Park, Noseong
    Kim, Sang-Wook
    [J]. WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 517 - 525
  • [49] On non-linear stability in unconstrained non-linear elasticity
    Haughton, DM
    [J]. INTERNATIONAL JOURNAL OF NON-LINEAR MECHANICS, 2004, 39 (07) : 1181 - 1192
  • [50] NON-LINEAR PROGRAMMING AND NON-LINEAR REGRESSION PROCEDURES
    EDWARDS, C
    [J]. JOURNAL OF FARM ECONOMICS, 1962, 44 (01): : 100 - 114