Identification of optimal classification functions for biological sample and state discrimination from metabolic profiling data

Kyongbum Lee, Daehee Hwang, Tadaaki Yokoyama, George Stephanopoulos, Gregory N. Stephanopoulos, Martin L. Yarmush

Research output: Contribution to journalArticlepeer-review

12 Scopus citations


Motivations: Classification of biological samples for diagnostic purposes is a difficult task because of the many decisions involved on the number, type and functional manipulations of the input variables. This study presents a generally applicable strategy for systematic formulation of optimal diagnostic indexes. To this end, we develop a novel set of computational tools by integrating regression optimization, stepwise variable selection and cross-validation algorithms. Results: The proposed discrimination methodology was applied to plasma and tissue (liver) metabolic profiling data describing the time progression of liver dysfunction in a rat model of acute hepatic failure generated by D-galactosamine (GaIN) injection. From the plasma data, our methodology identified seven (out of a total of 23) metabolites, and the corresponding transform functions, as the best inputs to the optimal diagnostic index. This index showed better time resolution and increased noise robustness compared with an existing metabolic index, Fischer's BCAA/AAA molar ratio, as well as indexes generated using other commonly used discriminant analysis tools. Comparison of plasma and liver indexes found two consensus metabolites, lactate and glucose, which implicate glycolysis and/or gluconeogenesis in mediating the metabolic effects of GaIN.

Original languageEnglish (US)
Pages (from-to)959-969
Number of pages11
Issue number6
StatePublished - Apr 12 2004
Externally publishedYes

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics


Dive into the research topics of 'Identification of optimal classification functions for biological sample and state discrimination from metabolic profiling data'. Together they form a unique fingerprint.

Cite this