EHHR: an efficient evolutionary hyper-heuristic based recommender framework for short-text classifier selection

被引:2
作者
Almas, Bushra [1 ,2 ]
Mujtaba, Hasan [1 ]
Khan, Kifayat Ullah [1 ]
机构
[1] Natl Univ Comp & Emerging Sci, Dept Comp Sci, AK Brohi Rd,H-11-4, Islamabad, Pakistan
[2] Quaid I Azam Univ, Inst Informat Technol, Islamabad, Pakistan
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2023年 / 26卷 / 02期
关键词
Machine learning; Social media; Hyper-heuristics; Short-text classification; Evolutionary algorithm; LEXICAL RICHNESS; ALGORITHM; LANGUAGE;
D O I
10.1007/s10586-022-03754-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With various machine learning heuristics, it becomes difficult to choose an appropriate heuristic to classify short-text emerging from various social media sources in the form of tweets and reviews. The No Free Lunch theorem asserts that no heuristic applies to all problems indiscriminately. Regardless of their success, the available classifier recommendation algorithms only deal with numeric data. To cater to these limitations, an umbrella classifier recommender must determine the best heuristic for short-text data. This paper presents an efficient reminisce-enabled classifier recommender framework to recommend a heuristic for new short-text data classification. The proposed framework, "Efficient Evolutionary Hyper-heuristic based Recommender Framework for Short-text Classifier Selection (EHHR)," reuses the previous solutions to predict the performance of various heuristics for an unseen problem. The Hybrid Adaptive Genetic Algorithm (HAGA) in EHHR facilitates dataset-level feature optimization and performance prediction. HAGA reveals that the influential features for recommending the best short-text heuristic are the average entropy, mean length of the word string, adjective variation, verb variation II, and average hard examples. The experimental results show that HAGA is 80% more accurate when compared to the standard Genetic Algorithm (GA). Additionally, EHHR clusters datasets and rank heuristics cluster-wise. EHHR clusters 9 out of 10 problems correctly.
引用
收藏
页码:1425 / 1446
页数:22
相关论文
共 71 条
  • [1] Hyper-heuristic method for multilevel thresholding image segmentation
    Abd Elaziz, Mohamed
    Ewees, Ahmed A.
    Oliva, Diego
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 146
  • [2] No Free Lunch Theorem: A Review
    Adam, Stavros P.
    Alexandropoulos, Stamatios-Aggelos N.
    Pardalos, Panos M.
    Vrahatis, Michael N.
    [J]. APPROXIMATION AND OPTIMIZATION: ALGORITHMS, COMPLEXITY AND APPLICATIONS, 2019, 145 : 57 - 82
  • [3] Ahmed F, 2022, Arxiv, DOI arXiv:2202.11928
  • [4] Ali M, 2018, 2018 IEEE 8TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), P742, DOI 10.1109/CCWC.2018.8301712
  • [5] A Case-Based Meta-Learning and Reasoning Framework for Classifiers Selection
    Ali, Rahman
    Khatak, Aasad Masood
    Chow, Francis
    Lee, Sungyoung
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2018), 2018,
  • [6] Accurate multi-criteria decision making methodology for recommending machine learning algorithm
    Ali, Rahman
    Lee, Sungyoung
    Chung, Tae Choong
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 71 : 257 - 278
  • [7] Alsmadi Issa M., 2020, International Journal of Web Engineering and Technology, V15, P59
  • [8] Review of short-text classification
    Alsmadi, Issa
    Gan, Keng Hoon
    [J]. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2019, 15 (02) : 155 - 182
  • [9] [Anonymous], Your machine learning and Data Science Community (no date)
  • [10] Hyper-heuristics: a survey of the state of the art
    Burke, Edmund K.
    Gendreau, Michel
    Hyde, Matthew
    Kendall, Graham
    Ochoa, Gabriela
    Oezcan, Ender
    Qu, Rong
    [J]. JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2013, 64 (12) : 1695 - 1724