Making Java']JavaScript Render Decisions to Optimize Security-Oriented Crawler Process

被引:0
作者
Aktas, Onur [1 ]
Can, Ahmet Burak [2 ]
机构
[1] S4E, TR-06800 Ankara, Turkiye
[2] Hacettepe Univ, Dept Comp Engn, TR-06800 Ankara, Turkiye
关键词
Crawlers; Computer security; Web pages; Rendering (computer graphics); Libraries; Application security; Uniform resource locators; Surveys; Indexes; !text type='Java']Java[!/text; Machine learning; Crawler; cyber security; !text type='Java']Java[!/text]Script; machine learning; rendering; web application security;
D O I
10.1109/ACCESS.2024.3481646
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The widespread use of web applications requires important changes in cybersecurity to protect online services and data. In the process of identifying security vulnerabilities in web applications, a systematic approach is employed to detect and mitigate cybersecurity risks. This approach utilizes web crawlers to identify attack vectors. Traditional web crawling methods are resource-intensive and often need to be more efficient in handling dynamic JavaScript-rich content. Addressing this crucial gap, our study introduces an innovative approach to predict the necessity of JavaScript rendering, thereby enhancing the effectiveness and efficiency of security-oriented web crawlers. This approach seeks to reduce computational requirements and quicken the security evaluation process through the use of machine learning algorithms. By utilizing a dataset containing the source code from the main pages of 17,160 websites, our experimental results demonstrate a 20% reduction in execution time compared to full JavaScript rendering, indicating an improvement in resource usage without any significant reduction in coverage. Our methodology significantly improves the efficiency of security-focused web crawlers and helps security scanners to detect security risks of web applications with fewer resources.
引用
收藏
页码:161688 / 161696
页数:9
相关论文
共 35 条
[1]  
Abdullah HS, 2020, Academic Journal of Nawroz University, V9, P47, DOI [10.25007/ajnu.v9n1a532, 10.25007/ajnu.v9n1a532, DOI 10.25007/AJNU.V9N1A532]
[2]   An Efficient Focused Web Crawling Approach [J].
Aggarwal, Kompal .
SOFTWARE ENGINEERING (CSI 2015), 2019, 731 :131-138
[3]   A Systematic Literature Review on the Characteristics and Effectiveness of Web Application Vulnerability Scanners [J].
Alazmi, Suliman ;
De Leon, Daniel Conte .
IEEE ACCESS, 2022, 10 :33200-33219
[4]   Deep Reinforcement Learning for Web Crawling [J].
Avrachenkov, Konstantin ;
Borkar, Vivek ;
Patil, Kishor .
2021 SEVENTH INDIAN CONTROL CONFERENCE (ICC), 2021, :201-206
[5]   Security testing of web applications: A systematic mapping of the literature [J].
Aydos, Murat ;
Aldan, Cigdem ;
Coskun, Evren ;
Soydan, Alperen .
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (09) :6775-6792
[6]  
Bin Wang, 2019, 2019 IEEE 4th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), P1524, DOI 10.1109/IAEAC47372.2019.8997964
[7]   WEB DATA MINING FOR MONITORING BUSINESS EXPORT ORIENTATION [J].
Blazquez, Desamparados ;
Domenech, Josep .
TECHNOLOGICAL AND ECONOMIC DEVELOPMENT OF ECONOMY, 2018, 24 (02) :406-428
[8]  
Brunelle JF, 2017, ACM-IEEE J CONF DIG, P1
[9]  
Chang Zixiang, 2022, CCEAI 2022: The 6th International Conference on Control Engineering and Artificial Intelligence., P21, DOI 10.1145/3522749.3523076
[10]   Web application testing: A systematic literature review [J].
Dogan, Serdar ;
Betin-Can, Aysu ;
Garousi, Vahid .
JOURNAL OF SYSTEMS AND SOFTWARE, 2014, 91 :174-201