Clustering Methods with Qualitative Data: a Mixed-Methods Approach for Prevention Research with Small Samples

被引:83
作者
Henry, David [1 ]
Dymnicki, Allison B. [2 ]
Mohatt, Nathaniel [3 ]
Allen, James [4 ]
Kelly, James G. [1 ]
机构
[1] Univ Illinois, Chicago, IL 60607 USA
[2] Amer Inst Res, Washington, DC USA
[3] Univ Colorado, Nederland, CO USA
[4] Univ Minnesota, Minneapolis, MN USA
关键词
Mixed methods; Cluster analysis; Simulation; Community leadership; BINARY DATA;
D O I
10.1007/s11121-015-0561-z
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Qualitative methods potentially add depth to prevention research but can produce large amounts of complex data even with small samples. Studies conducted with culturally distinct samples often produce voluminous qualitative data but may lack sufficient sample sizes for sophisticated quantitative analysis. Currently lacking in mixed-methods research are methods allowing for more fully integrating qualitative and quantitative analysis techniques. Cluster analysis can be applied to coded qualitative data to clarify the findings of prevention studies by aiding efforts to reveal such things as the motives of participants for their actions and the reasons behind counterintuitive findings. By clustering groups of participants with similar profiles of codes in a quantitative analysis, cluster analysis can serve as a key component in mixed-methods research. This article reports two studies. In the first study, we conduct simulations to test the accuracy of cluster assignment using three different clustering methods with binary data as produced when coding qualitative interviews. Results indicated that hierarchical clustering, K-means clustering, and latent class analysis produced similar levels of accuracy with binary data and that the accuracy of these methods did not decrease with samples as small as 50. Whereas the first study explores the feasibility of using common clustering methods with binary data, the second study provides a "real-world" example using data from a qualitative study of community leadership connected with a drug abuse prevention project. We discuss the implications of this approach for conducting prevention research, especially with small samples and culturally distinct communities.
引用
收藏
页码:1007 / 1016
页数:10
相关论文
共 25 条
  • [1] Anderberg M.R., 1973, Probability and Mathematical Statistics, DOI DOI 10.1016/C2013-0-06161-0
  • [2] [Anonymous], DAT MIN KNOWL DISC W, DOI DOI 10.1145/882082.882087
  • [3] [Anonymous], 1990, Basics of Qualitative Research
  • [4] [Anonymous], 1999, P EMPS 99 C LUN GERM
  • [5] [Anonymous], 2013, R LANG ENV STAT COMP
  • [6] [Anonymous], GROUPING METHODS
  • [7] [Anonymous], CLUSTER ANAL SAGE U
  • [8] CONVERGENT AND DISCRIMINANT VALIDATION BY THE MULTITRAIT-MULTIMETHOD MATRIX
    CAMPBELL, DT
    FISKE, DW
    [J]. PSYCHOLOGICAL BULLETIN, 1959, 56 (02) : 81 - 105
  • [9] An examination of indexes for determining the number of clusters in binary data sets
    Dimitriadou, E
    Dolnicar, S
    Weingessel, A
    [J]. PSYCHOMETRIKA, 2002, 67 (01) : 137 - 159
  • [10] Eshghi A., 2011, Journal of Data Science, V9, P271, DOI [DOI 10.6339/JDS.201104_09(2).0009, https://doi.org/10.6339/JDS.20110409(2).0009, DOI 10.6339/JDS.20110409(2).0009]