Deep Phenotyping - Harnessing Data Richness for Unsupervised High-Content Analysis
Poster May 05, 2016
Huang Dong, Wang Yi, Maciej Hermanowicz, Ke Yiping, Maja Choma, Lee Kee Khoon, Frederic Bard
High-content analysis (HCA) has recently arisen as a novel systematic approach to cell biology research. Due to the intrinsically high variability and noise level of the assay, it has been primarily applied to supervised analysis, where the phenotypes are known beforehand and manual labels are provided to guide the analysis. Furthermore, most existing research uses only a few parameters, failing to exploit the richness of the raw images.
In this work, we develop an end-to-end computational framework focused on unsupervised HCA. The framework can identify unknown subtle phenotypes from massive multi-parametric data, thus dubbed “Deep Phenotyping”, and correlate them with gene perturbations. It consists of a series of modules: high-dimensional parameter extraction for each cell from raw images, multi-level quality control using both supervised and unsupervised methods with minimal labeling effort, a novelty detection algorithm to automatically retrieve cells that differ from the control, and an unsupervised clustering algorithm to identify unknown phenotypes in the multi-parametric space.
The framework is modularized and applicable to a variety of HCA tasks. We demonstrate its use on the phenotypic analysis of the Golgi apparatus. Novel Golgi phenotypes beyond the three known ones are identified, and potential protein-protein interactions regulating Golgi functions are revealed.
Multiplexing cell-based assays is possible using 3D culture models that are larger and more complex than monolayers
Real-time detection methods to measure live or dead cells provide much flexibility for multiplexing
All multiplexed assay combinations should be verified using appropriate controls for each 3D cell culture model.