Birds of a Feather Flock Together: Generating Pornographic and Gambling Domain Names Based on Character Composition Similarity

被引:1
作者
Cheng, Yanan [1 ]
Jiang, Hao [1 ]
Zhang, Zhaoxin [1 ]
Du, Yuejin [2 ]
Chai, Tingting [1 ]
机构
[1] Harbin Inst Technol, Fac Comp, Harbin 150001, Peoples R China
[2] Beijing Qihoo Technol Co Ltd, Beijing 100015, Peoples R China
关键词
COVID-19;
D O I
10.1155/2022/4408987
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cybercriminals often register many pornographic or gambling domains (known as abusive domains) with similar character compositions in bulk to reduce their investment in buying domains and make it easier for clients to remember and spread them. Therefore, this study combines the ideas of text similarity and text generation and proposes an abusive domain generation model based on GRU for rapidly generating new abusive domain names from known ones. Additionally, we develop a two-layer detection system for pornography and gambling domains using fastText and CNN models to obtain an abusive domain dataset for model training and validation. In the end, our detection system identifies pornographic and gambling domains with 99% precision while balancing correctness and speed. By inputting 40,000 random keywords into the abusive domain generation model, we obtained 130,220 online domains that served web pages, of which about 66% were pornographic or gambling domains. The results show that by exploiting cybercriminals' behaviors in registering abusive domain names, such as bulk registration of similar domain names, we can prospectively acquire a large number of new abusive domains based on known ones. This study demonstrates that predicting new abusive domains not only expands the domain blacklist but also allows researchers to target the generated suspicious domains and dispose of them in time before they show abusive behavior.
引用
收藏
页数:17
相关论文
共 36 条
[1]  
[Anonymous], 2022, EUCLIDEAN DISTANCE W
[2]  
[Anonymous], 2022, FASTTEXT PYPI
[3]  
[Anonymous], 2022, FXSJY JIEBA JIEBA CH
[4]  
[Anonymous], 2022, ALEXA RANKING WEBSIT
[5]  
[Anonymous], 2022, DOWNLOAD LIST ALL DO
[6]  
[Anonymous], 2022, PORNOGRAPHY IS BOOMI
[7]  
[Anonymous], 2022, REPORTING ABUSIVE DO
[8]  
[Anonymous], 2022, selenium
[9]  
[Anonymous], 2022, REQUESTS HTTP HUMANS
[10]   Internet and Pornography Use During the COVID-19 Pandemic: Presumed Impact and What Can Be Done [J].
Awan, Hashir Ali ;
Aamir, Alifiya ;
Diwan, Mufaddal Najmuddin ;
Ullah, Irfan ;
Pereira-Sanchez, Victor ;
Ramalho, Rodrigo ;
Orsolini, Laura ;
de Filippis, Renato ;
Ojeahere, Margaret Isioma ;
Ransing, Ramdas ;
Vadsaria, Aftab Karmali ;
Virani, Sanya .
FRONTIERS IN PSYCHIATRY, 2021, 12