FilteredWeb: A Framework for the Automated Search-Based Discovery of Blocked URLs

被引:0
|
作者
Darer, Alexander [1 ]
Farnan, Oliver [1 ]
Wright, Joss [2 ]
机构
[1] Univ Oxford, Dept Comp Sci, Oxford, England
[2] Univ Oxford, Oxford Internet Inst, Oxford, England
来源
TMA CONFERENCE 2017 - PROCEEDINGS OF THE 1ST NETWORK TRAFFIC MEASUREMENT AND ANALYSIS CONFERENCE | 2017年
基金
英国工程与自然科学研究理事会;
关键词
censorship; filtering; DNS; Chinese Internet; search; CHINA; CENSORSHIP;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Various methods have been proposed for creating and maintaining lists of potentially filtered URLs to allow for measurement of ongoing internet censorship around the world. Whilst testing a known resource for evidence of filtering can be relatively simple, given appropriate vantage points, discovering previously unknown filtered web resources remains an open challenge. We present a novel framework for automating the process of discovering filtered resources through the use of adaptive queries to well-known search engines. Our system applies information retrieval algorithms to isolate characteristic linguistic patterns in known filtered web pages; these are used as the basis for web search queries. The resulting URLs of these searches are checked for evidence of filtering, and newly discovered blocked resources will be fed back into the system to detect further filtered content. Our implementation of this framework, applied to China as a case study, shows the approach is demonstrably effective at detecting significant numbers of previously unknown filtered web pages, making a significant contribution to the ongoing detection of internet filtering as it develops. When deployed, this system was used to discover 1355 poisoned domains within China as of Feb 2017-30 times more than in the most widely-used published filter list of the time. Of these, 759 are outside of the Alexa Top 1000 domains list, demonstrating the capability of this framework to find more obscure filtered content. Further, our initial analysis of filtered URLs, and the search terms that were used to discover them, gives further insight into the nature of the content currently being blocked in China.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Experience Paper: Search-based Testing in Automated Driving Control Applications
    Gladisch, Christoph
    Heinz, Thomas
    Heinzemann, Christian
    Oehlerking, Jens
    von Vietinghoff, Anne
    Pfitzer, Tim
    34TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE 2019), 2019, : 26 - 37
  • [22] Automated generation of presentations through a search-based software visualization system
    Adobbati, R
    HUMAN-COMPUTER INTERACTION - INTERACT '99, 1999, : 663 - 665
  • [23] Search-based optimization
    Wheeler, WC
    CLADISTICS-THE INTERNATIONAL JOURNAL OF THE WILLI HENNIG SOCIETY, 2003, 19 (04): : 348 - 355
  • [24] Self-Adaptive Systems Framework Based on Agent and Search-Based Optimization
    He, Liu
    Li, Qingshan
    Wang, Lu
    Wan, Jiewen
    2017 IEEE 24TH INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION, AND REENGINEERING (SANER), 2017, : 557 - 558
  • [25] Search-Based Algorithm With Scatter Search Strategy for Automated Test Case Generation of NLP Toolkit
    Liu, Fangqing
    Huang, Han
    Yang, Zhongming
    Hao, Zhifeng
    Wang, Jiangping
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2021, 5 (03): : 491 - 503
  • [26] EMOA*: A framework for search-based multi-objective path planning
    Ren, Zhongqiang
    Hernandez, Carlos
    Likhachev, Maxim
    Felner, Ariel
    Koenig, Sven
    Salzman, Oren
    Rathinam, Sivakumar
    Choset, Howie
    ARTIFICIAL INTELLIGENCE, 2025, 339
  • [27] Search-based framework for transparent non-overlapping ensemble models
    Gulowaty, Bogdan
    Wozniak, Michal
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [28] Search-Based Automated Play Testing of Computer Games: A Model-Based Approach
    Ferdous, Raihana
    Kifetew, Fitsum
    Prandi, Davide
    Prasetya, I. S. W. B.
    Shirzadehhajimahmood, Samira
    Susi, Angelo
    SEARCH-BASED SOFTWARE ENGINEERING (SSBSE 2021), 2021, 12914 : 56 - 71
  • [29] Search-based Adaptation Planning Framework for Self-Adaptive Systems
    Wang, Lu
    PROCEEDINGS OF THE 2017 IEEE/ACM 39TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING COMPANION (ICSE-C 2017), 2017, : 465 - 466
  • [30] A Model Independent S/W Framework for Search-Based Software Testing
    Oh, Jungsup
    Baik, Jongmoon
    Lim, Sung-Hwa
    SCIENTIFIC WORLD JOURNAL, 2014,