E-Mail Spam Detection Based on Part of Speech Tagging

被引:0
作者
Parsaei, Mohammad Reza [1 ]
Salehi, Mohammad [1 ]
机构
[1] Shiraz Univ Technol, Sch Comp Sci & IT, Shiraz, Iran
来源
2015 2ND INTERNATIONAL CONFERENCE ON KNOWLEDGE-BASED ENGINEERING AND INNOVATION (KBEI) | 2015年
关键词
K-Mean algorithm; Spam e-mail; data mining; pus tagging; vector model;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Ever since the emails became well-known tools in communication field, the problem of spams was associated with them. One of the most significant methods for filtering such junk email is diagnostic of those e-mails by applying some especial technics named as Data-Mining. In the presented paper, a new approach based on this strategy that how frequently words are repeated is proposed in which the key words in the evidence are found by usage of their repetition number (frequency). The key sentences, those with the key words, of the incoming e-mails have to be tagged and thereafter the grammatical roles of the entire words in the sentence need to be determined, finally they will be put together in a vector in order to indicate the similarity between the received emails. The proposed paper takes advantage of an extraordinary algorithm called K-Mean algorithm to classify the received e-mails. It is worthwhile to note that the so-called K-Mean algorithm follows some simple and understandable rules which are too easy to work with and this stands as a great privilege for this paper. The precision of the applied algorithm in diagnostic of the e-mails is 83 percent.
引用
收藏
页码:1010 / 1013
页数:4
相关论文
共 42 条
  • [21] Multiple criteria programming models for VIP E-Mail behavior analysis
    Zhang P.
    Zhu X.
    Zhang Z.
    Shi Y.
    Web Intelligence and Agent Systems, 2010, 8 (01): : 69 - 78
  • [22] Tree-based text stream clustering with application to spam mail classification
    Taninpong, Phimphaka
    Ngamsuriyaroj, Sudsanguan
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2018, 10 (04) : 353 - 370
  • [23] Advances in spam detection for email spam, web spam, social network spam, and review spam: ML-based and nature-inspired-based techniques
    Akinyelu, Andronicus A.
    JOURNAL OF COMPUTER SECURITY, 2021, 29 (05) : 473 - 529
  • [24] A novel approach of mining write-prints for authorship attribution in e-mail forensics
    Iqbal, Farkhund
    Hadjidj, Rachid
    Fung, Benjamin C. M.
    Debbabi, Mourad
    DIGITAL INVESTIGATION, 2008, 5 (S42-S51) : S42 - S51
  • [25] Fast and effective spam sender detection with granular SVM on highly imbalanced mail server behavior data
    Tang, Yuchun
    Krasser, Sven
    Judge, Paul
    2006 INTERNATIONAL CONFERENCE ON COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING, 2006, : 18 - +
  • [26] A Data Mining Based Spam Detection System for YouTube
    Chowdury, Rashid
    Adnan, Md. Nuruddin Monsur
    Mahmud, G. A. N.
    Rahman, Rashedur M.
    2013 EIGHTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM), 2013, : 373 - 378
  • [27] Support Vector Machine Based Spam SMS Detection
    Tekerek, Adem
    JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2019, 22 (03): : 779 - 784
  • [28] A new multi-criteria quadratic-programming linear classification model for VIP E-mail analysis
    Zhang, Peng
    Zhang, Juliang
    Shi, Yong
    COMPUTATIONAL SCIENCE - ICCS 2007, PT 2, PROCEEDINGS, 2007, 4488 : 499 - +
  • [29] Graph-based KNN Algorithm for Spam SMS Detection
    Tran Phuc Ho
    Kang, Ho-Seok
    Kim, Sung-Ryul
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2013, 19 (16) : 2404 - 2419
  • [30] Web spam detection based on discriminative content and link features
    Mahmoudi M.
    Yari A.
    Khadivi S.
    2010 5th International Symposium on Telecommunications, IST 2010, 2010, : 542 - 546