Random Interaction Forest (RIF)-A Novel Machine Learning Strategy Accounting for Feature Interaction

被引:5
|
作者
Guo, Chao-Yu [1 ]
Lin, Yi-Jyun [1 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Inst Publ Hlth, Coll Med, Div Biostat & Data Sci, Taipei 112304, Taiwan
关键词
Interaction; random forest; linear regression; logistic regression; machine learning;
D O I
10.1109/ACCESS.2022.3233194
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
If an interaction exists in medical and health sciences, a proper statistical approach is required to avoid an erroneous conclusion. For example, different genders may introduce modified therapeutic effects of drugs, or an adverse interaction between two medicines changes the pharmacological activity, reduces the therapeutic effect, or induces toxicity. Therefore, if the analysis does not account for the impact of the interaction, it may introduce significant prediction errors or bias. Regression models deal with a two-way interaction by adding the product of the two interactive variables. Since machine learning models demonstrate a superior predictive ability to regression models, this study proposes a new method based on the random forest to account for interaction, called random interaction forest (RIF). This new strategy modifies the structure of the random forest, where the interaction features are forced to be in the first two nodes. Simulation studies examined the predictive ability of the linear regression model, logistic regression model, random forest, and the RIF under various scenarios. The results showed that the RIF consistently outperforms random forest and logistic regression when interactions are present. The RIF also performs better in many scenarios than the linear regression model. When the effect of interaction is more significant, the performance of RIF could be superior.
引用
收藏
页码:1806 / 1813
页数:8
相关论文
共 50 条
  • [21] Prediction of hotel booking cancellations: Integration of machine learning and probability model based on interpretable feature interaction
    Chen, Shuixia
    Ngai, Eric W. T.
    Ku, Yaoyao
    Xu, Zeshui
    Gou, Xunjie
    Zhang, Chenxi
    DECISION SUPPORT SYSTEMS, 2023, 170
  • [22] Enhancing machine learning in gas-solid interaction analysis: Addressing feature selection and dimensionality challenges
    Takefuji, Yoshiyasu
    COORDINATION CHEMISTRY REVIEWS, 2025, 534
  • [23] Landslide Susceptibility Mapping of Chamoli (Uttarakhand) Using Random Forest Machine Learning Method
    Mittal, Amogh
    Gupta, Kunal
    Satyam, Neelima
    NATURAL GEO-DISASTERS AND RESILIENCY, CREST 2023, 2024, 445 : 207 - 217
  • [24] Research on Machine Learning Framework Based on Random Forest Algorithm
    Ren, Qiong
    Cheng, Hui
    Han, Hai
    ADVANCES IN MATERIALS, MACHINERY, ELECTRONICS I, 2017, 1820
  • [25] Interaction of elements in dilute Mg alloys: a DFT and machine learning study
    Chen, Tao
    Yuan, Yuan
    Mi, Xiaoxi
    Wu, Jiajia
    Tang, Aitao
    Wang, Jingfeng
    Moelans, Nele
    Pan, Fusheng
    JOURNAL OF MATERIALS RESEARCH AND TECHNOLOGY-JMR&T, 2022, 21 : 4512 - 4525
  • [26] Process parameters based machine learning model for bead profile prediction in activated TIG Welding using random forest machine learning
    Munghate, Abhinav Arun
    Thapliyal, Shivraman
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART B-JOURNAL OF ENGINEERING MANUFACTURE, 2024, 238 (12) : 1761 - 1768
  • [27] Heartbeat Classification by Random Forest With a Novel Context Feature: A Segment Label
    Zou, Congyu
    Mueller, Alexander
    Wolfgang, Utschick
    Rueckert, Daniel
    Mueller, Phillip
    Becker, Matthias
    Steger, Alexander
    Martens, Eimo
    IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE, 2022, 10
  • [28] Revolutionizing indoor emergency evacuation prediction with machine learning: A Hand-searching technique and content analysis on Random Forest algorithm
    Rahman, Syed Ahmad Fadhli Syed Abdul
    Maulud, Khairul Nizam Abdul
    Mazlan, Muhammad Fadhli Mustaqim
    JOURNAL OF CONTINGENCIES AND CRISIS MANAGEMENT, 2024, 32 (01)
  • [29] Network Intrusion Detection System Using Random Forest and Decision Tree Machine Learning Techniques
    Bhavani, T. Tulasi
    Rao, M. Kameswara
    Reddy, A. Manohar
    FIRST INTERNATIONAL CONFERENCE ON SUSTAINABLE TECHNOLOGIES FOR COMPUTATIONAL INTELLIGENCE, 2020, 1045 : 637 - 643
  • [30] Harnessing Machine Learning in Vocal Arts Medicine: A Random Forest Application for "Fach" Classification in Opera
    Wang, Zehui
    Mueller, Matthias
    Caffier, Felix
    Caffier, Philipp P.
    DIAGNOSTICS, 2023, 13 (18)