CP-CHARM: segmentation-free image classification made accessible

被引:35
作者
Uhlmann, Virginie [1 ,2 ]
Singh, Shantanu [2 ]
Carpenter, Anne E. [2 ]
机构
[1] Swiss Fed Inst Technol EPFL, Biomed Imaging Grp, Lausanne, Switzerland
[2] Broad Inst Harvard & MIT, Imaging Platform, Cambridge, MA USA
来源
BMC BIOINFORMATICS | 2016年 / 17卷
基金
美国国家卫生研究院;
关键词
Image classification; Biological imaging; Image features; Segmentation-free analysis; High-dimensional classification; SUBCELLULAR LOCATION PATTERNS; SOFTWARE; RECOGNITION;
D O I
10.1186/s12859-016-0895-y
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Automated classification using machine learning often relies on features derived from segmenting individual objects, which can be difficult to automate. WND-CHARM is a previously developed classification algorithm in which features are computed on the whole image, thereby avoiding the need for segmentation. The algorithm obtained encouraging results but requires considerable computational expertise to execute. Furthermore, some benchmark sets have been shown to be subject to confounding artifacts that overestimate classification accuracy. Results: We developed CP-CHARM, a user-friendly image-based classification algorithm inspired by WND-CHARM in (i) its ability to capture a wide variety of morphological aspects of the image, and (ii) the absence of requirement for segmentation. In order to make such an image-based classification method easily accessible to the biological research community, CP-CHARM relies on the widely-used open-source image analysis software CellProfiler for feature extraction. To validate our method, we reproduced WND-CHARM's results and ensured that CP-CHARM obtained comparable performance. We then successfully applied our approach on cell-based assay data and on tissue images. We designed these new training and test sets to reduce the effect of batch-related artifacts. Conclusions: The proposed method preserves the strengths of WND-CHARM - it extracts a wide variety of morphological features directly on whole images thereby avoiding the need for cell segmentation, but additionally, it makes the methods easily accessible for researchers without computational expertise by implementing them as a CellProfiler pipeline. It has been demonstrated to perform well on a wide range of bioimage classification problems, including on new datasets that have been carefully selected and annotated to minimize batch effects. This provides for the first time a realistic and reliable assessment of the whole image classification strategy.
引用
收藏
页数:12
相关论文
共 34 条
  • [1] [Anonymous], COMP BASE MED SYST C
  • [2] [Anonymous], 2004, WILEY SER PROB STAT
  • [3] CellProfiler: image analysis software for identifying and quantifying cell phenotypes
    Carpenter, Anne E.
    Jones, Thouis Ray
    Lamprecht, Michael R.
    Clarke, Colin
    Kang, In Han
    Friman, Ola
    Guertin, David A.
    Chang, Joo Han
    Lindquist, Robert A.
    Moffat, Jason
    Golland, Polina
    Sabatini, David M.
    [J]. GENOME BIOLOGY, 2006, 7 (10)
  • [4] Determining the subcellular location of new proteins from microscope images using local features
    Coelho, Luis Pedro
    Kangas, Joshua D.
    Naik, Armaghan W.
    Osuna-Highley, Elvira
    Glory-Afshar, Estelle
    Fuhrman, Margaret
    Simha, Ramanuja
    Berget, Peter B.
    Jarvik, Jonathan W.
    Murphy, Robert F.
    [J]. BIOINFORMATICS, 2013, 29 (18) : 2343 - 2349
  • [5] Dillon W.R., 1984, MULTIVARIATE ANAL ME
  • [6] REGULARIZED DISCRIMINANT-ANALYSIS
    FRIEDMAN, JH
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1989, 84 (405) : 165 - 175
  • [7] Automated subcellular location determination and high-throughput microscopy
    Glory, Estelle
    Murphy, Robert F.
    [J]. DEVELOPMENTAL CELL, 2007, 12 (01) : 7 - 16
  • [8] Huang K, 2004, 2004 2ND IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: MACRO TO NANO, VOLS 1 and 2, P1139
  • [9] Feature reduction for improved recognition of subcellular location patterns in fluorescence microscope images
    Huang, K
    Velliste, M
    Murphy, RF
    [J]. MANIPULATION AND ANALYSIS OF BIOMOLECULES, CELLS AND TISSUES, 2003, 4962 : 307 - 318
  • [10] Jolliffe I.T., 2002, Principal Component Analysis