Quaternion Factorization Machines: A Lightweight Solution to Intricate Feature Interaction Modeling

被引:8
作者
Chen, Tong [1 ]
Yin, Hongzhi [1 ]
Zhang, Xiangliang [2 ]
Huang, Zi [1 ]
Wang, Yang [3 ,4 ]
Wang, Meng [3 ,4 ]
机构
[1] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia
[2] Univ Notre Dame, Coll Engn, Notre Dame, IN 46556 USA
[3] Hefei Univ Technol, Minist Educ, Key Lab Knowledge Engn Big Data, Hefei 230009, Anhui, Peoples R China
[4] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230009, Peoples R China
基金
澳大利亚研究理事会;
关键词
Quaternions; Frequency modulation; Predictive models; Predictive analytics; Computational modeling; Task analysis; Internet of Things; Factorization machines (FMs); predictive analytics; quaternion representations; CONVOLUTIONAL NEURAL-NETWORKS;
D O I
10.1109/TNNLS.2021.3118706
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the sparsity of available features in web-scale predictive analytics, combinatorial features become a crucial means for deriving accurate predictions. As a well-established approach, a factorization machine (FM) is capable of automatically learning high-order interactions among features to make predictions without the need for manual feature engineering. With the prominent development of deep neural networks (DNNs), there is a recent and ongoing trend of enhancing the expressiveness of FM-based models with DNNs. However, though better results are obtained with DNN-based FM variants, such performance gain is paid off by an enormous amount (usually millions) of excessive model parameters on top of the plain FM. Consequently, the heavy parameterization impedes the real-life practicality of those deep models, especially efficient deployment on resource-constrained Internet of Things (IoT) and edge devices. In this article, we move beyond the traditional real space where most deep FM-based models are defined and seek solutions from quaternion representations within the hypercomplex space. Specifically, we propose the quaternion factorization machine (QFM) and quaternion neural factorization machine (QNFM), which are two novel lightweight and memory-efficient quaternion-valued models for sparse predictive analytics. By introducing a brand new take on FM-based models with the notion of quaternion algebra, our models not only enable expressive inter-component feature interactions but also significantly reduce the parameter size due to lower degrees of freedom in the hypercomplex Hamilton product compared with real-valued matrix multiplication. Extensive experimental results on three large-scale datasets demonstrate that QFM achieves 4.36% performance improvement over the plain FM without introducing any extra parameters, while QNFM outperforms all baselines with up to two magnitudes' parameter size reduction in comparison to state-of-the-art peer methods.
引用
收藏
页码:4345 / 4358
页数:14
相关论文
共 61 条
[1]  
[Anonymous], 2015, ARXIV PREPRINT ARXIV
[2]  
[Anonymous], 2015, ACS SYM SER
[3]  
Arjovsky M, 2016, PR MACH LEARN RES, V48
[4]  
Blondel M, 2016, ADV NEUR IN, V29
[5]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[6]   Efficient Non-Sampling Factorization Machines for Optimal Context-Aware Recommendation [J].
Chen, Chong ;
Zhang, Min ;
Ma, Weizhi ;
Liu, Yiqun ;
Ma, Shaoping .
WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, :2400-2410
[7]   Sequence-Aware Factorization Machines for Temporal Predictive Analytics [J].
Chen, Tong ;
Yin, Hongzhi ;
Quoc Viet Hung Nguyen ;
Peng, Wen-Chih ;
Li, Xue ;
Zhou, Xiaofang .
2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, :1405-1416
[8]   TADA: Trend Alignment with Dual-Attention Multi-Task Recurrent Neural Networks for Sales Prediction [J].
Chen, Tong ;
Yin, Hongzhi ;
Chen, Hongxu ;
Wu, Lin ;
Wang, Hao ;
Zhou, Xiaofang ;
Li, Xue .
2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, :49-58
[9]  
Cheng H.-T., 2016, P 1 WORKSH DEEP LEAR, P7
[10]   Aspect-Aware Latent Factor Model: Rating Prediction with Ratings and Reviews [J].
Cheng, Zhiyong ;
Ding, Ying ;
Zhu, Lei ;
Kankanhalli, Mohan .
WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018), 2018, :639-648