Classifying antibodies using flow cytometry data: Class prediction and class discovery

被引：1

作者：

Salganik, MP

Milford, EL

Hardie, DL

Shaw, S

Wand, MP

机构：

[1] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA

[2] Harvard Univ, Brigham & Womens Hosp, Sch Med, Dept Med, Boston, MA 02115 USA

[3] Univ Birmingham, Biomed Res Inst, MRC, Ctr Immune Regulat, Birmingham 2TT, W Midlands, England

[4] NCI, Expt Immunol Branch, NIH, Bethesda, MD 20892 USA

[5] Univ New S Wales, Sch Math, Dept Stat, Sydney, NSW 2052, Australia

来源：

BIOMETRICAL JOURNAL | 2005年 / 47卷 / 05期

关键词：

classification; monoclonal antibodies; flow cytometry; dissimilarity measure; kernel smoothing; SiZer; class discovery; class prediction; unsupervised learning; supervised learning;

D O I：

10.1002/bimj.200310142

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Classifying monoclonal antibodies, based on the similarity of their binding to the proteins (antigens) on the surface of blood cells, is essential for progress in immunology, hematology and clinical medicine. The collaborative efforts of researchers from many countries have led to the classification of thousands of antibodies into 247 clusters of differentiation (CD). Classification is based on flow cytometry and biochemical data. In preliminary classifications of antibodies based on flow cytometry data, the object requiring classification (an antibody) is described by a set of random samples from unknown densities of fluorescence intensity. An individual sample is collected in the experiment, where a population of cells of a certain type is stained by the identical fluorescently marked replicates of the antibody of interest. Samples are collected for multiple cell types. The classification problems of interest include identifying new CDs (class discovery or unsupervised learning) and assigning new antibodies to the known CD clusters (class prediction or supervised learning). These problems have attracted limited attention from statisticians. We recommend a novel approach to the classification process in which a computer algorithm suggests to the analyst the subset of the "most appropriate" classifications of an antibody in class prediction problems or the "most similar" pairs/groups of antibodies in class discovery problems. The suggested algorithm speeds up the analysis of a flow cytometry data by a factor 10-20. This allows the analyst to focus on the interpretation of the automatically suggested preliminary classification solutions and on planning the subsequent biochemical experiments.

引用

页码：740 / 754

页数：15