A fuzzy Dempster-Shafer classifier for detecting Web spams

被引:8
作者
Chatterjee, Moitrayee [1 ]
Namin, Akbar Siami [2 ]
机构
[1] New Jersey City Univ, Comp Sci Dept, 2039 John F Kennedy Blvd, Jersey City, NJ 07305 USA
[2] Texas Tech Univ, Comp Sci Dept, 1012 Boston Ave, Lubbock, TX 79409 USA
基金
美国国家科学基金会;
关键词
Dempster-Shafer Theory; Dempster-Shafer Combination; Basic probability assignment; Mass function; Belief; Plausibility; Fuzzy reasoning; Classification; Web spam;
D O I
10.1016/j.jisa.2021.102793
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Web spam identification problem can be modeled as an instance of the conventional classification problem. Web spams aim at deceiving web crawlers by advertising certain Web pages through elevation of their page rankings superficially than their actual weights. Web spams are intended to produce fraudulent results of web search queries and degenerate the client's experience by directing users to fake Web pages. We present a fuzzy evidence-based methodology for identifying Web spams by which the spamicity of web hosts is formulated as a reasoning problem in the presence of uncertainty. However, any classification task intrinsically suffers from incomplete or vague evidence and ambiguity in the class assignment based on evidence. In this work, we combine fuzzy reasoning as the decision maker for selecting the most suitable evidence in a multi-source Dempster-Shafer (DS) based classification algorithm. The introduced approach has the benefit of providing more reliable solution to detect spams without any prior information. The evidence theory offers flexible support that takes into account the multi-dimensional nature of implementation decisions. The experimental results show that the fuzzy reasoning in combination with DS theory, reduces the conflicts among evidence leading to enhanced classification results. The aim of this paper is to describe the potential of fuzzy reasoning and the Dempster-Shafer Theory (DST) as a decision model for the web spams classification problem.
引用
收藏
页数:9
相关论文
共 39 条
[11]  
EIRON N., 2004, P 13 INT C WORLD WID, P309, DOI DOI 10.1145/988672.988714
[12]  
ELISABETTA B, 1999, INT J INTELL SYST, V0014, P00559
[13]  
ENRIQUE HV, 2007, INT J UNCERTAIN FUZZ, V0015, P00225
[14]  
FERNANDEZ GSA, 1999, FUZZY SET SYST, V0106, P00179
[15]  
FUYUAN X, 2020, IEEE T FUZZY SYST
[16]  
Fuyuan Xiao, 2020, IEEE T NEURAL NETW L
[17]  
Glenn S., 1976, MATH THEORY EVIDENCE, V42
[18]  
Hall DL, 1997, P IEEE, V85, P6, DOI 10.1109/ISCAS.1998.705329
[19]   Choice processes for non-homogeneous group decision making in linguistic setting [J].
Herrera, F ;
Herrera-Viedma, E ;
Verdegay, JL .
FUZZY SETS AND SYSTEMS, 1998, 94 (03) :287-308
[20]  
JAMES MGK, 1994, FUZZY SET SYST, V0065, P00273