Similarity analysis of frequent sequential activity pattern mining

被引:31
|
作者
Shou, Zhenyu [1 ]
Di, Xuan [1 ,2 ]
机构
[1] Columbia Univ, Dept Civil Engn & Engn Mech, New York, NY 10027 USA
[2] Columbia Univ, Data Sci Inst, Ctr Smart Cities, New York, NY USA
关键词
Similarity; Frequent pattern mining; Clustering; Connected vehicles; ACTIVITY SEQUENCES; TRAVEL BEHAVIOR; BIG DATA; RECOGNITION; CLASSIFICATION; TRAJECTORIES; PATH;
D O I
10.1016/j.trc.2018.09.018
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Activity pattern classification is extensively studied using multi-person single-day mobile traces. However, human mobility exhibits intra-personal variability and thus single-day activity may not fully capture one's activity patterns. This paper creates a methodological framework to analyze similarity of activity patterns using frequent sequential pattern mining when multi-person multi day data is available. Frequent sequential pattern mining discovers the frequently occurring ordered subsequences and is a natural approach of analyzing multi-day travel patterns. Prefix Span algorithm is implemented to extract frequent patterns for each individual. Then similarity measures are defined to describe the extent to which two travelers' activity patterns are alike and the regularity of how one repeats her activity patterns from day to day. Based upon the pairwise similarity values between two individuals, hierarchical clustering is conducted to divide travelers into communities. To illustrate these methodologies, 349 travelers' 19,130 travel activity sequences are extracted from the world's first connected vehicle testbed in Ann Arbor, Michigan. Three major clusters are identified. Coupled with demographics, these clusters are characterized as "seniors", "working class", and "parents", respectively. Multinomial logistic regression is employed to model to what extent the similarity of socio-demographics can explain that of travel patterns. This work can be extended to either infer an unknown user's demographics (or customer profiling) based on her activity patterns, or to reconstruct an unknown user's frequent activity patterns based on her demographics and other similar travelers' patterns.
引用
收藏
页码:122 / 143
页数:22
相关论文
共 50 条
  • [1] Weighted frequent sequential pattern mining
    Md Ashraful Islam
    Mahfuzur Rahman Rafi
    Al-amin Azad
    Jesan Ahammed Ovi
    Applied Intelligence, 2022, 52 : 254 - 281
  • [2] Weighted frequent sequential pattern mining
    Islam, Md Ashraful
    Rafi, Mahfuzur Rahman
    Azad, Al-amin
    Ovi, Jesan Ahammed
    APPLIED INTELLIGENCE, 2022, 52 (01) : 254 - 281
  • [3] Mining frequent sequential patterns under a similarity constraint
    Capelle, M
    Masson, C
    Boulicaut, JF
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2002, 2002, 2412 : 1 - 6
  • [4] Frequent sequential pattern mining under differential privacy
    Lu, Guoqing
    Zhang, Xiaojian
    Ding, Liping
    Li, Yanfeng
    Liao, Xin
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2015, 52 (12): : 2789 - 2801
  • [5] Finding frequent trajectories by clustering and sequential pattern mining
    Arthur A.Shaw
    N.P.Gopalan
    Journal of Traffic and Transportation Engineering(English Edition) , 2014, (06) : 393 - 403
  • [6] Finding frequent trajectories by clustering and sequential pattern mining
    Shaw, Arthur A.
    Gopalan, N. P.
    JOURNAL OF TRAFFIC AND TRANSPORTATION ENGINEERING-ENGLISH EDITION, 2014, 1 (06) : 393 - 403
  • [7] Identification of process improvement opportunity based on frequent activity sequential pattern mining technology
    Chen, Liang
    Gao, Jianmin
    Li, Qing
    Chen, Kun
    Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2006, 40 (11): : 1310 - 1314
  • [8] Mining Order-Preserving Submatrices Based on Frequent Sequential Pattern Mining
    Xue, Yun
    Li, Yuting
    Deng, Weijun
    Li, Jiejin
    Tang, Jianxiong
    Liao, Zhengling
    Li, Tiechen
    HEALTH INFORMATION SCIENCE, HIS 2014, 2014, 8423 : 184 - 193
  • [9] Efficient Adaptive Frequent Pattern Mining Techniques for Market Analysis in Sequential and Parallel Systems
    Kuriakose, Sherly
    Nedunchezhian, Raju
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2017, 14 (02) : 175 - 185
  • [10] Super-sequence Frequent Pattern Mining on Sequential Dataset
    Yu, Xinran
    Korkmaz, Turgay
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,