Dimensionality Reduction Techniques for Visualizing Morphometric Data: Comparing Principal Component Analysis to Nonlinear Methods

被引:13
作者
Du, Trina Y. [1 ]
机构
[1] Univ Ottawa, Dept Biol, 30 Marie Curie, Ottawa, ON K1N 6N5, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Data visualization; Morphological variation; Multivariate data; Theoretical biology; GEOMETRIC MORPHOMETRICS; DEVELOPMENTAL DYNAMICS; PHENOTYPE; EVOLUTIONARY; MORPHOSPACE; PATTERNS; SYSTEMS; ORIGINS; POINTS; SPACES;
D O I
10.1007/s11692-018-9464-9
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Principal component analysis (PCA) is the most widely used dimensionality reduction technique in the biological sciences, and is commonly employed to create 2D visualizations of geometric morphometric data. However, interesting biological information may be lost or misrepresented in these plots due to PCA's inability to summarize nonlinear dependencies between variables. Nonlinear alternative methods exist, but their effectiveness has never been tested on morphometric data. Here, the performance of PCA on the task of visualizing morphometric variation is compared to four nonlinear techniques: Sammon Mapping, Isomap, Locally Linear Embedding, and Laplacian Eigenmaps. The performance of methods is assessed on the basis of global and local preservation of pairwise distances for a variety of simulated and empirical datasets. The relative performance of PCA varies in function of the distribution of variation, complexity, and size of datasets. Overall, nonlinear methods show superior preservation of small differences between morphologies compared to PCA.
引用
收藏
页码:106 / 121
页数:16
相关论文
共 44 条
[1]  
Adams D.C., 2017, geomorph: geometric Morphometric Analyses of 2D/3D Landmark Data
[2]   Multivariate Phylogenetic Comparative Methods: Evaluations, Comparisons, and Recommendations [J].
Adams, Dean C. ;
Collyer, Michael L. .
SYSTEMATIC BIOLOGY, 2018, 67 (01) :14-31
[3]   FROM GENES TO PHENOTYPE - DYNAMIC-SYSTEMS AND EVOLVABILITY [J].
ALBERCH, P .
GENETICA, 1991, 84 (01) :5-11
[4]  
Altenberg Lee, 2005, P99
[5]  
Bartholomew D, 2011, WILEY SER PROBAB ST, P1, DOI 10.1002/9781119970583
[6]   Laplacian eigenmaps for dimensionality reduction and data representation [J].
Belkin, M ;
Niyogi, P .
NEURAL COMPUTATION, 2003, 15 (06) :1373-1396
[7]  
Bookstein FL, 1996, NATO ADV SCI INST SE, V284, P131
[8]   Shaping space: the possible and the attainable in RNA genotype-phenotype mapping [J].
Fontana, W ;
Schuster, P .
JOURNAL OF THEORETICAL BIOLOGY, 1998, 194 (04) :491-515
[9]   Comparing the differential filling of morphospace and allometric space through time: the morphological and developmental dynamics of Early Jurassic ammonoids [J].
Gerber, Sylvain .
PALEOBIOLOGY, 2011, 37 (03) :369-382
[10]   Analysis of a complex of statistical variables into principal components [J].
Hotelling, H .
JOURNAL OF EDUCATIONAL PSYCHOLOGY, 1933, 24 :417-441