TEXT MESSAGE CORPUS: APPLYING NATURAL LANGUAGE PROCESSING TO MOBILE DEVICE FORENSICS

被引:0
作者
O'Day, Daniel R. [1 ]
Calix, Ricardo A. [1 ]
机构
[1] Purdue Univ Calumet, Hammond, IN 46323 USA
来源
ELECTRONIC PROCEEDINGS OF THE 2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW) | 2013年
关键词
Short Message Analysis; Text Message Corpus; Natural Language Processing; Semantic Analysis;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The average mobile device user sends a large quantity of text and other short messages. These text message data are of great value to law enforcement investigators who may be analyzing a suspect's mobile device or social media profile for evidence of criminal activity. Current tools and methodologies for analyzing text and other short message data generally only allow for simple keyword searches, which is often a time-consuming task for law enforcement investigators. In addition, there are limited corpora available containing text message data. An initial corpus of text message data for experimental purposes has been developed and made available to the research community. A simple methodology is proposed for feature extraction. The format of the data is given as well as basic statistics, suggestions for possible use, and future work.
引用
收藏
页数:6
相关论文
共 26 条
  • [1] Altheide C, 2011, DIGITAL FORENSICS WITH OPEN SOURCE TOOLS, P1
  • [2] [Anonymous], 2009, Natural language processing with Python: Analyzing text with the natural language toolkit
  • [3] Bayesia SAS, 2012, BAYESIALAB THE TECHN
  • [4] Carrier B., 2005, FILE SYSTEM FORENSIC, P8
  • [5] Casey E., 2011, COMPUTERS AND THE LA, P2
  • [6] Cellebrite, 2012, UFED ULTIMATE
  • [7] Databionic ESOM Tools, 2006, DATABIONIC TOOLS
  • [8] Text Classification Methodologies Applied to Micro-text in Military Chat
    Dela Rosa, Kevin
    Ellen, Jeffrey
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2009, : 710 - +
  • [9] Do Phuc, 2007, 2007 IEEE International Conference on Research, Innovation and Vision for the Future, P247
  • [10] Hall M., 2009, SIGKDD Explorations, V11, P10, DOI DOI 10.1145/1656274.1656278