Blog Backlinks Malicious Domain Name Detection via Supervised Learning

被引:9
作者
Alshdadi, Abdulrahman A. [1 ]
Alghamdi, Ahmed S. [2 ]
Daud, Ali [3 ]
Hussain, Saqib [4 ]
机构
[1] Univ Jeddah, Comp Sci Comp Sci & Engn CCSE, Jeddah, Saudi Arabia
[2] Univ Jeddah, Jeddah, Saudi Arabia
[3] Univ Jeddah, Coll Comp Sci & Engn, Dept Comp Sci & Artificial Intelligence, Jeddah, Saudi Arabia
[4] Int Islamic Univ, Islamabad, Pakistan
关键词
Blog Backlinks; Google Webmaster Tools; Keyword Rankings; Malicious Domain Name Detection; Social Computing; Social Media Platforms; Supervised Learning; Web Spam; WEB SPAM; CLASSIFICATION;
D O I
10.4018/IJSWIS.2021070101
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Web spam is the unwanted request on websites, low-quality backlinks, emails, and reviews which is generated by an automated program. It is the big threat for website owners; because of it, they can lose their top keywords ranking from search engines, which will result in huge financial loss to the business. Over the years, researchers have tried to identify malicious domains based on specific features. However, lighthouse plugin, Ahrefs tool, and social media platforms features are ignored. In this paper, the authors are focused on detection of the spam domain name from a mixture of legit and spam domain name dataset. The dataset is taken from Google webmaster tools. Machine learning models are applied on individual, distributed, and hybrid features, which significantly improved the performance of existing malicious domain machine learning techniques. Better accuracy is achieved for support vector machine (SVM) classifier, as compared to Naive Bayes, C4.5, AdaBoost, LogitBoost.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 31 条
[1]  
Abaqus, 2018, About us
[2]   Using Case-Based Reasoning for Phishing Detection [J].
Abutair, Hassan Y. A. ;
Belghith, Abdelfettah .
8TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT-2017) AND THE 7TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT 2017), 2017, 109 :281-288
[3]  
ahrefs, 2018, US
[4]  
[Anonymous], 2011, P WWW 11
[5]  
[Anonymous], 2014, INT J COMPUT APPL
[6]   Web Spam Detection: New Classification Features Based on Qualified Link Analysis and Language Models [J].
Araujo, Lourdes ;
Martinez-Romo, Juan .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2010, 5 (03) :581-590
[7]   Heuristic nonlinear regression strategy for detecting phishing websites [J].
Babagoli, Mehdi ;
Aghababa, Mohammad Pourmahmood ;
Solouk, Vahid .
SOFT COMPUTING, 2019, 23 (12) :4315-4327
[8]  
Blackhatlinks, 2018, US
[9]  
Choi H., 2011, WEBAPPS 11 P 2 USENI, V11
[10]   Machine learning for email spam filtering: review, approaches and open research problems [J].
Dada, Emmanuel Gbenga ;
Bassi, Joseph Stephen ;
Chiroma, Haruna ;
Abdulhamid, Shafi'i Muhammad ;
Adetunmbi, Adebayo Olusola ;
Ajibuwa, Opeyemi Emmanuel .
HELIYON, 2019, 5 (06)