Random Interaction Forest (RIF)-A Novel Machine Learning Strategy Accounting for Feature Interaction

被引:5
|
作者
Guo, Chao-Yu [1 ]
Lin, Yi-Jyun [1 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Inst Publ Hlth, Coll Med, Div Biostat & Data Sci, Taipei 112304, Taiwan
关键词
Interaction; random forest; linear regression; logistic regression; machine learning;
D O I
10.1109/ACCESS.2022.3233194
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
If an interaction exists in medical and health sciences, a proper statistical approach is required to avoid an erroneous conclusion. For example, different genders may introduce modified therapeutic effects of drugs, or an adverse interaction between two medicines changes the pharmacological activity, reduces the therapeutic effect, or induces toxicity. Therefore, if the analysis does not account for the impact of the interaction, it may introduce significant prediction errors or bias. Regression models deal with a two-way interaction by adding the product of the two interactive variables. Since machine learning models demonstrate a superior predictive ability to regression models, this study proposes a new method based on the random forest to account for interaction, called random interaction forest (RIF). This new strategy modifies the structure of the random forest, where the interaction features are forced to be in the first two nodes. Simulation studies examined the predictive ability of the linear regression model, logistic regression model, random forest, and the RIF under various scenarios. The results showed that the RIF consistently outperforms random forest and logistic regression when interactions are present. The RIF also performs better in many scenarios than the linear regression model. When the effect of interaction is more significant, the performance of RIF could be superior.
引用
收藏
页码:1806 / 1813
页数:8
相关论文
共 50 条
  • [41] Leptospirosis modelling using hydrometeorological indices and random forest machine learning
    Veianthan Jayaramu
    Zed Zulkafli
    Simon De Stercke
    Wouter Buytaert
    Fariq Rahmat
    Ribhan Zafira Abdul Rahman
    Asnor Juraiza Ishak
    Wardah Tahir
    Jamalludin Ab Rahman
    Nik Mohd Hafiz Mohd Fuzi
    International Journal of Biometeorology, 2023, 67 : 423 - 437
  • [42] Machine Learning for Drug-Target Interaction Prediction
    Chen, Ruolan
    Liu, Xiangrong
    Jin, Shuting
    Lin, Jiawei
    Liu, Juan
    MOLECULES, 2018, 23 (09):
  • [43] Inferring colloidal interaction from scattering by machine learning
    Tung, Chi-Huan
    Chang, Shou-Yi
    Chang, Ming-Ching
    Carrillo, Jan-Michael
    Sumpter, Bobby
    Do, Changwoo
    Chen, Wei-Ren
    CARBON TRENDS, 2023, 10
  • [44] A novel focus encoding scheme for addressee detection in multiparty interaction using machine learning algorithms
    Malik, Usman
    Barange, Mukesh
    Saunier, Julien
    Pauchet, Alexandre
    JOURNAL ON MULTIMODAL USER INTERFACES, 2021, 15 (02) : 175 - 188
  • [45] Predicting network of drug-enzyme interaction based on machine learning method
    Niu, Bing
    Zhang, Yuchao
    Ding, Juan
    Lu, Yin
    Wang, Miao
    Lu, Wencong
    Yuan, Xiaochen
    Yin, Jinyuan
    BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2014, 1844 (01): : 214 - 223
  • [46] Multifidelity aerodynamic flow field prediction using random forest-based machine learning
    Nagawkar, Jethro
    Leifsson, Leifur
    AEROSPACE SCIENCE AND TECHNOLOGY, 2022, 123
  • [47] Prediction & optimization of alkali-activated concrete based on the random forest machine learning algorithm
    Sun, Yubo
    Cheng, Hao
    Zhang, Shizhe
    Mohan, Manu K.
    Ye, Guang
    De Schutter, Geert
    CONSTRUCTION AND BUILDING MATERIALS, 2023, 385
  • [48] Heterogeneous feature fusion based machine learning strategy for ECG diagnosis
    Ren, He
    Sun, Qi
    Xiao, Zhengguang
    Yu, Miao
    Wang, Siqi
    Yuan, Linrong
    Li, Yiming
    Tu, Huating
    Tu, Mengting
    Yang, Hui
    Li, Ping
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 271
  • [49] A novel focus encoding scheme for addressee detection in multiparty interaction using machine learning algorithms
    Usman Malik
    Mukesh Barange
    Julien Saunier
    Alexandre Pauchet
    Journal on Multimodal User Interfaces, 2021, 15 : 175 - 188
  • [50] A combined strategy of feature selection and machine learning to identify predictors of prediabetes
    De Silva, Kushan
    Jonsson, Daniel
    Demmer, Ryan T.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (03) : 396 - 406