Novel machine learning approach to differential cell flow cytometry analysis based on projection pursuit

  • Mahan Dastgiri
  • , Javier Cabrera
  • , Yajie Duan
  • , Davit Sargsyan
  • , Craig W. Gambogi
  • , Abraham Adokwei
  • , Rebecca Mary Peter
  • , Po Chung Chou
  • , Ge Cheng
  • , Chun Pang Lin
  • , Jocelyn Sendecki
  • , Helena Geys
  • , Kanaka Tatikola
  • , Ah Ng Kong

Research output: Contribution to journalArticlepeer-review

Abstract

This paper introduces the novel methodology of differential projection pursuit and its applications to the analysis of large datasets. The method was applied to a cell flow cytometry dataset as an alternative approach to analyze this type of data. Multicolor cell flow cytometry is a well-established laboratory technique to identify cell subpopulations by measuring their physical and biochemical characteristics. Differential projection pursuit helps to find regions with maximal differences between two or more treatments or distributions. Data analysis in flow cytometry relies on gating, the process of manually selecting successive subpopulations of cells using two-dimensional plots. Plotting the variables only two at a time could mask the hidden structure present in the data, and manual selection makes the analysis inconsistent and arbitrary. The new methodology could automate flow cytometry analysis by utilizing the combination of projection pursuit, data nuggets, and factor analysis. When applied to flow cytometry data, differential projection pursuit allows researchers to quickly identify differences in cell populations exposed to different experimental conditions. This methodology could create a platform to explore differences in large datasets and improve the cell flow cytometry analysis clarity and reproducibility by considering the data in its true dimensional space and through automation, respectively.

Original languageEnglish (US)
JournalJournal of Biopharmaceutical Statistics
DOIs
StateAccepted/In press - 2025
Externally publishedYes

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Pharmacology
  • Pharmacology (medical)

Keywords

  • Big data
  • data nuggets
  • differential projection pursuit
  • flow cytometry
  • machine learning
  • varimax rotation

Fingerprint

Dive into the research topics of 'Novel machine learning approach to differential cell flow cytometry analysis based on projection pursuit'. Together they form a unique fingerprint.

Cite this