Spell Sequences, State Proximities, and Distance Metrics

被引:36
作者
Elzinga, Cees H. [1 ]
Studer, Matthias [2 ]
机构
[1] Vrije Univ Amsterdam, Fac Social Sci, NL-1081 HV Amsterdam, Netherlands
[2] Univ Geneva, Inst Demog & Life Course Studies, Geneva, Switzerland
基金
瑞士国家科学基金会;
关键词
sequence analysis; OM; subsequence; soft-matching; duration-weighting; OPTIMAL MATCHING ANALYSIS; SOCIAL-SCIENCE DATA; LIFE-COURSE; SIMILARITY; TRAJECTORIES; CAREERS; COST;
D O I
10.1177/0049124114540707
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Because optimal matching (OM) distance is not very sensitive to differences in the order of states, we introduce a subsequence-based distance measure that can be adapted to subsequence length, to subsequence duration, and to soft-matching of states. Using a simulation technique developed by Studer, we investigate the sensitivity, relative to OM, of several variants of this metric to variations in order, timing, and duration of states. The results show that the behavior of the metric is as intended. Furthermore, we use family formation data from the Swiss Household Panel to compare a few variants of the new metric to OM. The new metrics have been implemented in the freely available TraMineR-package.
引用
收藏
页码:3 / 47
页数:45
相关论文
共 57 条
[1]   MEASURING RESEMBLANCE IN SEQUENCE DATA - AN OPTIMAL MATCHING ANALYSIS OF MUSICIANS CAREERS [J].
ABBOTT, A ;
HRYCAK, A .
AMERICAN JOURNAL OF SOCIOLOGY, 1990, 96 (01) :144-185
[2]  
Aisenbrey Silke, 2010, SOCIOLOGICAL METHODS, V38, P430
[3]  
[Anonymous], 2001, J. Am. Stat. Assoc.
[4]  
[Anonymous], 2006, SEQUENCE ANAL METRIC
[5]  
[Anonymous], 2000, Matrix analysis and applied linear algebra
[6]  
[Anonymous], 2000, Pattern Classification
[7]   The subsequence composition of a string [J].
Apostolico, Alberto ;
Cunial, Fabio .
THEORETICAL COMPUTER SCIENCE, 2009, 410 (43) :4360-4371
[8]  
Banerjee A, 2005, J MACH LEARN RES, V6, P1705
[9]  
Billari F.C., 2001, International Journal of Population Geography, V7, P339, DOI DOI 10.1002/IJPG.231
[10]   Parametric and Nonparametric Analysis of Life Courses: An Application to Family Formation Patterns [J].
Bonetti, Marco ;
Piccarreta, Raffaella ;
Salford, Gaia .
DEMOGRAPHY, 2013, 50 (03) :881-902