Filtering objectionable information access based on click-through behaviours with deep learning methods

被引:0
|
作者
Lee, Lung-Hao [1 ]
Li, Jian-Hong [2 ]
Ku, Szu-Wei [3 ]
Tseng, Yuen-Hsien [4 ,5 ]
机构
[1] Natl Cent Univ, Dept Elect Engn, Taipei, Taiwan
[2] Natl Cent Univ, Dept Elect Engn, Taoyuan, Taiwan
[3] Natl Cent Univ, Dept Elect Engn, Taoyuan, Taiwan
[4] Natl Taiwan Normal Univ, Grad Inst Lib & Informat Studies, New Taipei, Taiwan
[5] Natl Taiwan Normal Univ, Grad Inst Lib & Informat Studies, 162,Sect 1,Heping East Rd, Taipei 106, Taiwan
关键词
Click-through data; objectionable content filtering; online information access; user behaviours; web content categorization;
D O I
10.1177/01655515231160041
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study explores URL click-through behaviour to predict the category of users' online information accesses and applies the results to progressively filter objectionable accesses during web surfing. Each clicked URL is represented by the embedding technique and fed into the Bidirectional Long Short-Term Memory neural network cascaded with a Conditional Random Field (BiLSTM-CRF) model to predict the category of a user's access. Large-scale experiments on click-through data from nearly one million real users show that our proposed BiLSTM-CRF model achieves promising results. The proposed method outperforms related approaches by a high accuracy of 0.9492 (near 27% relative improvement) for context-aware category prediction and an F1-score of 0.8995 (about 29% relative improvement) for objectionable access identification. In addition, in real-time filtering simulations, our model gradually achieves a macro-averaging blocking rate of 0.9221, while maintaining a favourably low false-positive rate of 0.0041.
引用
收藏
页数:15
相关论文
共 11 条
  • [1] Learning to rank with click-through features in a reinforcement learning framework
    Keyhanipour, Amir Hosein
    Moshiri, Behzad
    Piroozmand, Maryam
    Oroumchian, Farhad
    Moeini, Ali
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2016, 12 (04) : 448 - 476
  • [2] Approach for collection selection based on click-through data
    Chen, L. (lingchen@cs.zju.edu.cn), 1600, Zhejiang University (47): : 23 - 28+161
  • [3] CF-Rank: Learning to rank by classifier fusion on click-through data
    Keyhanipour, Amir Hosein
    Moshiri, Behzad
    Rahgozar, Maseud
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (22) : 8597 - 8608
  • [4] Modeling Click-through Based Word-pairs for Web Search
    Jagarlamudi, Jagadeesh
    Gao, Jianfeng
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 483 - 492
  • [6] Learning to rank: new approach with the layered multi-population genetic programming on click-through features
    Keyhanipour, Amir Hosein
    Moshiri, Behzad
    Oroumchian, Farhad
    Rahgozar, Maseud
    Badie, Kambiz
    GENETIC PROGRAMMING AND EVOLVABLE MACHINES, 2016, 17 (03) : 203 - 230
  • [7] Learning to rank: new approach with the layered multi-population genetic programming on click-through features
    Amir Hosein Keyhanipour
    Behzad Moshiri
    Farhad Oroumchian
    Maseud Rahgozar
    Kambiz Badie
    Genetic Programming and Evolvable Machines, 2016, 17 : 203 - 230
  • [9] Click-through-based Subspace Learning for Image Search
    Pan, Yingwei
    Yao, Ting
    Tian, Xinmei
    Li, Houqiang
    Ngo, Chong-Wah
    PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 233 - 236
  • [10] Click-through-based Cross-view Learning for Image Search
    Pan, Yingwei
    Yao, Ting
    Mei, Tao
    Li, Houqiang
    Ngo, Chong-Wah
    Rui, Yong
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 717 - 726