Cell morphology-based machine learning models for human cell state classification

Herein, we implement and access machine learning architectures to ascertain models that differentiate healthy from apoptotic cells using exclusively forward (FSC) and side (SSC) scatter flow cytometry information. To generate training data, colorectal cancer HCT116 cells were subjected to miR-34a tr...

Full description

Bibliographic Details
Main Authors: Bleris, L. (Author), Li, Y. (Author), Nguyen, K. (Author), Nowak, C.M (Author), Pham, U. (Author)
Format: Article
Language:English
Published: Nature Research 2021
Subjects:
Online Access:View Fulltext in Publisher
LEADER 02702nam a2200313Ia 4500
001 10.1038-s41540-021-00180-y
008 220427s2021 CNT 000 0 und d
020 |a 20567189 (ISSN) 
245 1 0 |a Cell morphology-based machine learning models for human cell state classification 
260 0 |b Nature Research  |c 2021 
856 |z View Fulltext in Publisher  |u https://doi.org/10.1038/s41540-021-00180-y 
520 3 |a Herein, we implement and access machine learning architectures to ascertain models that differentiate healthy from apoptotic cells using exclusively forward (FSC) and side (SSC) scatter flow cytometry information. To generate training data, colorectal cancer HCT116 cells were subjected to miR-34a treatment and then classified using a conventional Annexin V/propidium iodide (PI)-staining assay. The apoptotic cells were defined as Annexin V-positive cells, which include early and late apoptotic cells, necrotic cells, as well as other dying or dead cells. In addition to fluorescent signal, we collected cell size and granularity information from the FSC and SSC parameters. Both parameters are subdivided into area, height, and width, thus providing a total of six numerical features that informed and trained our models. A collection of logistical regression, random forest, k-nearest neighbor, multilayer perceptron, and support vector machine was trained and tested for classification performance in predicting cell states using only the six aforementioned numerical features. Out of 1046 candidate models, a multilayer perceptron was chosen with 0.91 live precision, 0.93 live recall, 0.92 live f value and 0.97 live area under the ROC curve when applied on standardized data. We discuss and highlight differences in classifier performance and compare the results to the standard practice of forward and side scatter gating, typically performed to select cells based on size and/or complexity. We demonstrate that our model, a ready-to-use module for any flow cytometry-based analysis, can provide automated, reliable, and stain-free classification of healthy and apoptotic cells using exclusively size and granularity information. © 2021, The Author(s). 
650 0 4 |a cell size 
650 0 4 |a Cell Size 
650 0 4 |a flow cytometry 
650 0 4 |a Flow Cytometry 
650 0 4 |a human 
650 0 4 |a Humans 
650 0 4 |a machine learning 
650 0 4 |a Machine Learning 
650 0 4 |a Neural Networks, Computer 
650 0 4 |a Propidium 
650 0 4 |a propidium iodide 
700 1 |a Bleris, L.  |e author 
700 1 |a Li, Y.  |e author 
700 1 |a Nguyen, K.  |e author 
700 1 |a Nowak, C.M.  |e author 
700 1 |a Pham, U.  |e author 
773 |t npj Systems Biology and Applications