Information Preserving Component Analysis: Data Projections for Flow Cytometry Analysis

被引:14
作者
Carter, Kevin M. [1 ]
Raich, Raviv [2 ]
Finn, William G. [3 ]
Hero, Alfred O., III [1 ]
机构
[1] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA
[2] Oregon State Univ, Sch Elect Engn & Comp Sci, Corvallis, OR 97331 USA
[3] Univ Michigan, Dept Pathol, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
Dimensionality reduction; flow cytometry; information geometry; multivariate data analysis; statistical manifold; DIMENSIONALITY REDUCTION;
D O I
10.1109/JSTSP.2008.2011112
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Flow cytometry is often used to characterize the malignant cells in leukemia and lymphoma patients, traced to the level of the individual cell. Typically, flow-cytometric data analysis is performed through a series of 2-D projections onto the axes of the data set. Through the years, clinicians have determined combinations of different fluorescent markers which generate relatively known expression patterns for specific subtypes of leukemia and lymphoma - cancers of the hematopoietic system. By only viewing a series of 2-D projections, the high-dimensional nature of the data is rarely exploited. In this paper we present a means of determining a low-dimensional projection which maintains the high-dimensional relationships (i.e., information distance) between differing oncological data sets. By using machine learning techniques, we allow clinicians to visualize data in a low dimension defined by a linear combination of all of the available markers, rather than just two at a time. This provides an aid in diagnosing similar forms of cancer, as well as a means for variable selection in exploratory flow-cytometric research. We refer to our method as information preserving component analysis (IPCA).
引用
收藏
页码:148 / 158
页数:11
相关论文
共 28 条
[1]  
Amari S.-i., 2000, Methods of Information Geometry, V191, DOI DOI 10.1090/MMONO/191
[2]  
[Anonymous], ADV NEURAL INFORM PR
[3]   Fine: Information embedding for document classification [J].
Carter, Kevin M. ;
Raich, Raviv ;
Hero, Alfred O., III .
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :1861-+
[4]  
CARTER KM, 2008, ARXIV08094866 U MICH
[5]   Ig V gene mutation status and CD38 expression as novel prognostic indicators in chronic lymphocytic leukemia [J].
Damle, RN ;
Wasil, T ;
Fais, F ;
Ghiotto, F ;
Valetto, A ;
Allen, SL ;
Buchbinder, A ;
Budman, D ;
Dittmar, K ;
Kolitz, J ;
Lichtman, SM ;
Schulman, P ;
Vinciguerra, VP ;
Rai, KR ;
Ferrarini, M ;
Chiorazzi, N .
BLOOD, 1999, 94 (06) :1840-1847
[6]  
Douglas SC, 2007, INT CONF ACOUST SPEE, P1401
[7]   The geometry of algorithms with orthogonality constraints [J].
Edelman, A ;
Arias, TA ;
Smith, ST .
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 1998, 20 (02) :303-353
[8]  
FINN WG, 2009, CYTOM PT B, V76
[9]   PROJECTION PURSUIT ALGORITHM FOR EXPLORATORY DATA-ANALYSIS [J].
FRIEDMAN, JH ;
TUKEY, JW .
IEEE TRANSACTIONS ON COMPUTERS, 1974, C 23 (09) :881-890
[10]   REGULARIZED DISCRIMINANT-ANALYSIS [J].
FRIEDMAN, JH .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1989, 84 (405) :165-175