Multi-Type Feature Extraction and Early Fusion Framework for SMS Spam Detection

被引:5
作者
Al-Kabbi, Hussein Alaa [1 ]
Feizi-Derakhshi, Mohammad-Reza [1 ]
Pashazadeh, Saeid [2 ]
机构
[1] Univ Tabriz, Dept Comp Engn, Computerized Intelligence Syst Lab, Tabriz 5166616471, Iran
[2] Univ Tabriz, Dept Comp Engn, Tabriz 5166616471, Iran
关键词
Feature extraction; Deep learning; Task analysis; Text categorization; Hidden Markov models; Classification algorithms; Data integration; CNN; data fusion; deep learning; LSTM; SMS spam detection; LSTM; CLASSIFICATION;
D O I
10.1109/ACCESS.2023.3327897
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
SMS spam is a pervasive issue that affects millions worldwide, leading to significant inconvenience, time wastage, and potential financial scams. Given the prevalence and potential harm, accurate and real-time detection of SMS spam is crucial. This paper proposes a novel approach to SMS spam detection involving five steps: preprocessing, feature extraction, feature fusion, feature selection, and classification. Our model is designed to simultaneously capture local, temporal, and global text message features using a hybrid deep learning model to enhance feature representation. We evaluated our model using the UCI dataset, comparing it with traditional and deep learning algorithms such as RF and BERT using cross-validation to ensure the robustness of our results. Our proposed method exhibited superior performance, achieving a good accuracy of 99.56%, surpassing other methods. The effectiveness of this method in SMS spam detection proved its potential for real-world implementation, where it could substantially mitigate the prevalence and impact of SMS spam.
引用
收藏
页码:123756 / 123765
页数:10
相关论文
共 49 条
[1]   A review of soft techniques for SMS spam classification: Methods, approaches and applications [J].
Abayomi-Alli, Olusola ;
Misra, Sanjay ;
Abayomi-Alli, Adebayo ;
Odusami, Modupe .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 86 :197-212
[2]   Immunocomputing-Based Approach for Optimizing the Topologies of LSTM Networks [J].
Al Bataineh, Ali ;
Kaur, Devinder .
IEEE ACCESS, 2021, 9 :78993-79004
[3]   Interpretable Multimodal Sentiment Classification Using Deep Multi-View Attentive Network of Image and Text Data [J].
Al-Tameemi, Israa Khalaf Salman ;
Feizi-Derakhshi, Mohammad-Reza ;
Pashazadeh, Saeid ;
Asadpour, Mohammad .
IEEE ACCESS, 2023, 11 :91060-91081
[4]   Data Fusion and IoT for Smart Ubiquitous Environments: A Survey [J].
Alam, Furqan ;
Mehmood, Rashid ;
Katib, Iyad ;
Albogami, Nasser N. ;
Albeshri, Aiiad .
IEEE ACCESS, 2017, 5 :9533-9554
[5]  
Almeida TA, 2011, DOCENG 2011: PROCEEDINGS OF THE 2011 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, P259
[6]   Review of deep learning: concepts, CNN architectures, challenges, applications, future directions [J].
Alzubaidi, Laith ;
Zhang, Jinglan ;
Humaidi, Amjad J. ;
Al-Dujaili, Ayad ;
Duan, Ye ;
Al-Shamma, Omran ;
Santamaria, J. ;
Fadhel, Mohammed A. ;
Al-Amidie, Muthana ;
Farhan, Laith .
JOURNAL OF BIG DATA, 2021, 8 (01)
[7]   Attention and Feature Selection for Automatic Speech Emotion Recognition Using Utterance and Syllable-Level Prosodic Features [J].
Ben Alex, Starlet ;
Mary, Leena ;
Babu, Ben P. .
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2020, 39 (11) :5681-5709
[8]   Data Fusion [J].
Bleiholder, Jens ;
Naumann, Felix .
ACM COMPUTING SURVEYS, 2008, 41 (01) :1-41
[9]   Feature selection for high-dimensional data [J].
Bolón-Canedo V. ;
Sánchez-Maroño N. ;
Alonso-Betanzos A. .
Progress in Artificial Intelligence, 2016, 5 (02) :65-75
[10]   Comparing Deep Learning and Shallow Learning Techniques for API Calls Malware Prediction: A Study [J].
Cannarile, Angelo ;
Dentamaro, Vincenzo ;
Galantucci, Stefano ;
Iannacone, Andrea ;
Impedovo, Donato ;
Pirlo, Giuseppe .
APPLIED SCIENCES-BASEL, 2022, 12 (03)