Enhancing Software Defect Prediction accuracy using Modified Entropy Calculation in Random Forest Algorithm

被引:0
|
作者
Suryawanshi, Ranjeetsingh [1 ]
Kadam, Amol [1 ]
机构
[1] Bharati Vidyapeeth Deemed Be Univ, Coll Engn, Pune, India
关键词
Random forest; decision tree; classification; prediction; entropy; Taylor series; NETWORKS;
D O I
10.52783/jes.754
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Imagine you are trying to classify software defect for a large dataset. How will you choose the best algorithm to do that? For the above problem we have various algorithms like Random Forest, Support Vector Machine, Neural Networks, Naive Bayes, K -Nearest Neighbours, Decision Tree, Logistic Regression etc. One of the most used methods is Random Forest algorithm, which uses multiple Decision Trees to make predictions. However, this algorithm relies on a complex calculation called Entropy, which measures the uncertainty in the data. Entropy function that uses natural logarithm which may be time consuming calculation. Is there a better way to calculate entropy? In this research, have explored a different way to calculate the natural logarithm using the Taylor series expression. It is a series consisting of sum of infinite terms that approximates any function by using its derivatives. We further modified the Random Forest algorithm by replacing the natural logarithm the Taylor series expression in the Entropy formula. We tested our modified algorithm on dataset and compared its performance with the original Entropy formula. We found that our modification in the algorithm has improved the accuracy of the algorithm on software defect prediction.
引用
收藏
页码:84 / 91
页数:8
相关论文
共 50 条
  • [21] Prediction of Permeability Using Random Forest and Genetic Algorithm Model
    Wang, Junhui
    Yan, Wanzi
    Wan, Zhijun
    Wang, Yi
    Lv, Jiakun
    Zhou, Aiping
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2020, 125 (03): : 1135 - 1157
  • [22] Optimizing Air Pollution Prediction With Random Forest Algorithm
    Singh, Sukhendra
    Kumar, Manoj
    Verma, Birendra Kumar
    Kumar, Sushil
    AEROSOL SCIENCE AND ENGINEERING, 2025,
  • [23] Performance Analysis of Heart Disease Prediction System using Novel Random Forest Over Naive Bayes Algorithm with an Improved Accuracy Rate
    Poojitha, T.
    Mahaveerakannan, R.
    CARDIOMETRY, 2022, (25): : 1562 - 1569
  • [24] Disease Prediction: Smart Disease Prediction System using Random Forest Algorithm
    Swarupa, A. N. V. K.
    Sree, V. Heina
    Nookambika, S.
    Kishore, Y. Kiran Sai
    Teja, U. Ravi
    2021 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, SMART AND GREEN TECHNOLOGIES (ICISSGT 2021), 2021, : 48 - 51
  • [25] Epileptic Seizure Prediction from the Scalp EEG Signals by using Random Forest Algorithm
    Hu, Ziyu
    Han, Chunxiao
    Guo, Fengjuan
    Qin, Qing
    Li, Shanshan
    Qin, Yingmei
    2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 669 - 674
  • [26] Fast Defect Detection Algorithm on the Variety Surface with Random Forest using GPUs
    Kwon, Bae-guen
    Kang, Dong-joong
    2011 11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2011, : 1135 - 1136
  • [27] An Analysis of Software Bug Reports Using Random Forest
    Ha Manh Tran
    Sinh Van Nguyen
    Synh Viet Uyen Ha
    Thanh Quoc Le
    FUTURE DATA AND SECURITY ENGINEERING, FDSE 2018, 2018, 11251 : 273 - 285
  • [28] Prediction of the concrete compressive strength using improved random forest algorithm
    Khodaparasti M.
    Alijamaat A.
    Pouraminian M.
    Journal of Building Pathology and Rehabilitation, 2023, 8 (2)
  • [29] Prediction of IIoT traffic using a modified whale optimization approach integrated with random forest classifier
    Ikram, Sumaiya Thaseen
    Priya, V
    Anbarasu, B.
    Cheng, Xiaochun
    Ghalib, Muhammad Rukunuddin
    Shankar, Achyut
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (08): : 10725 - 10756
  • [30] Controller Monitoring System In Software Defined Networks Using Random Forest Algorithm
    Kirutika, K.
    Vetriselvi, V.
    Parthasarathi, Ranjani
    Rao, G. Subrahmanya V. R. K.
    2019 IEEE 53RD INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY (ICCST 2019), 2019,