Spatio-temporal Mining of Software Adoption & Penetration

被引:0
|
作者
Papalexakis, Evangelos E. [1 ]
Dumitras, Tudor [2 ]
Chau, Duen Horng [3 ]
Prakash, B. Aditya [4 ]
Faloutsos, Christos [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Symantec Res Labs, Culver City, CA USA
[3] Georgia Tech, Atlanta, GA 30332 USA
[4] Virginia Tech, Blacksburg, VA 24061 USA
关键词
Malware Propagation; Internet Security; Data Analysis;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
How does malware propagate? Does it form spikes over time? Does it resemble the propagation pattern of benign files, such as software patches? Does it spread uniformly over countries? How long does it take for a URL that distributes malware to be detected and shut down? In this work, we answer these questions by analyzing patterns from 22 million malicious (and benign) files, found on 1.6 million hosts worldwide during the month of June 2011. We conduct this study using the WINE database available at Symantec Research Labs. Additionally, we explore the research questions raised by sampling on such large databases of executables; the importance of studying the implications of sampling is twofold: First, sampling is a means of reducing the size of the database hence making it more accessible to researchers; second, because every such data collection can be perceived as a sample of the real world. Finally, we discover the SHARKFIN temporal propagation pattern of executable files, the GEOSPLIT pattern in the geographical spread of machines that report executables to Symantec's servers, the Periodic Power Law (PPL) distribution of the life-time of URLs, and we show how to efficiently extrapolate crucial properties of the data from a small sample. To the best of our knowledge, our work represents the largest study of propagation patterns of executables.
引用
收藏
页码:884 / 891
页数:8
相关论文
共 50 条
  • [1] SHARKFIN: Spatio-temporal mining of software adoption and penetration
    Papalexakis, Evangelos E.
    Dumitras, Tudor
    Chau, Duen Horng
    Prakash, B. Aditya
    Faloutsos, Christos
    SOCIAL NETWORK ANALYSIS AND MINING, 2014, 4 (01) : 1 - 15
  • [2] Software for spatio-temporal trajectory analysis and pattern mining
    Sidorova, Marina
    Pidhornyi, Pavlo
    2018 14TH INTERNATIONAL CONFERENCE ON ADVANCED TRENDS IN RADIOELECTRONICS, TELECOMMUNICATIONS AND COMPUTER ENGINEERING (TCSET), 2018, : 958 - 961
  • [3] Mining spatio-temporal data
    Gennady Andrienko
    Donato Malerba
    Michael May
    Maguelonne Teisseire
    Journal of Intelligent Information Systems, 2006, 27 : 187 - 190
  • [4] Mining spatio-temporal data
    Andrienko, Gennady
    Malerba, Donato
    May, Michael
    Teisseire, Maguelonne
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2006, 27 (03) : 187 - 190
  • [5] Mining Trajectories for Spatio-temporal Analytics
    Xing, Songhua
    Liu, Xuan
    He, Qing
    Hampapur, Arun
    12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2012), 2012, : 910 - 913
  • [6] Mining generalized spatio-temporal patterns
    Wang, JM
    Hsu, WN
    Lee, ML
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2005, 3453 : 649 - 661
  • [7] A survey on spatio-temporal data mining
    Vasavi M.
    Murugan A.
    Materials Today: Proceedings, 2023, 80 : 2769 - 2772
  • [8] Mining contacts from spatio-temporal trajectories
    Madanayake, Adikarige Randil Sanjeewa
    Lee, Kyungmi
    Lee, Ickjai
    AI OPEN, 2024, 5 : 197 - 207
  • [9] Mining frequent spatio-temporal sequential patterns
    Cao, HP
    Mamoulis, N
    Cheung, DW
    FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, : 82 - 89
  • [10] Exploratory spatio-temporal data mining and visualization
    Compieta, P.
    Di Martino, S.
    Bertolotto, M.
    Ferrucci, F.
    Kechadi, T.
    JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2007, 18 (03): : 255 - 279