High-performance Classification of Phishing URLs Using a Multi-modal Approach with MapReduce

被引:4
作者
Shrestha, Niju [1 ]
Kharel, Rajan Kumar [1 ]
Britt, Jason [1 ]
Hasan, Ragib [1 ]
机构
[1] Univ Alabama Birmingham, Dept Comp & Informat Sci, Birmingham, AL 35294 USA
来源
2015 IEEE World Congress on Services | 2015年
关键词
Phishing; Map Reduce; Color code;
D O I
10.1109/SERVICES.2015.38
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Classifying phishing websites can be expensive both computationally and financially given a large enough volume of suspect sites. A distributed cloud environment can reduce the computational time and financial cost significantly. To test this idea, we apply a multi-modal feature classification algorithm to classify phishing websites in a non-distributed and several distributed environments. A multi-modal approach combines both visual and text features for classification. The implementation extracts color feature and histogram feature from the screenshot of a phishing website and text from its html source code. Feature extraction and comparison is accomplished by applying the MapReduce framework. Implementing the multimodal approach in a distributed environment proves to reduce the runtime as well as the financial costs. We present results that show our work is 30 times faster than existing state of the art systems in phishing website classification problem.
引用
收藏
页码:206 / 212
页数:7
相关论文
共 16 条
  • [1] Alkhozae M. G., 2011, INT J INFORM COMMUNI
  • [2] [Anonymous], P 5 INT C INT MON PR
  • [3] Basnet R, 2008, STUD FUZZ SOFT COMP, V226, P373, DOI 10.1007/978-3-540-77465-5_19
  • [4] Britt J., 2012, P 4 USENIX WORKSH LA, P10
  • [5] CORDERO A, 2007, P 16 INT C WORLD WID
  • [6] Mapreduce: Simplified data processing on large clusters
    Dean, Jeffrey
    Ghemawat, Sanjay
    [J]. COMMUNICATIONS OF THE ACM, 2008, 51 (01) : 107 - 113
  • [7] Deng X., 2006, IEEE T DEPENDABLE SE
  • [8] Gyawali B., 2011, PROC ANN CEAS C, P176
  • [9] Kerr D. A., 2010, COLORIMETRY, V1, P1
  • [10] Suriya R., 2009, P 2 INT C CYPR TURK