Robust multi-tissue gene panel for cancer detection

Joseph Irgon, C. Chris Huang, Yi Zhang, Dmitri Talantov, Gyan Bhanot, Sándor Szalma

Research output: Contribution to journalArticle

10 Citations (Scopus)

Abstract

Background: We have identified a set of genes whose relative mRNA expression levels in various solid tumors can be used to robustly distinguish cancer from matching normal tissue. Our current feature set consists of 113 gene probes for 104 unique genes, originally identified as differentially expressed in solid primary tumors in microarray data on Affymetrix HG-U133A platform in five tissue types: breast, colon, lung, prostate and ovary. For each dataset, we first identified a set of genes significantly differentially expressed in tumor vs. normal tissue at p-value = 0.05 using an experimentally derived error model. Our common cancer gene panel is the intersection of these sets of significantly dysregulated genes and can distinguish tumors from normal tissue on all these five tissue types.Methods: Frozen tumor specimens were obtained from two commercial vendors Clinomics (Pittsfield, MA) and Asterand (Detroit, MI). Biotinylated targets were prepared using published methods (Affymetrix, CA) and hybridized to Affymetrix U133A GeneChips (Affymetrix, CA). Expression values for each gene were calculated using Affymetrix GeneChip analysis software MAS 5.0. We then used a software package called Genes@Work for differential expression discovery, and SVM light linear kernel for building classification models.Results: We validate the predictability of this gene list on several publicly available data sets generated on the same platform. Of note, when analysing the lung cancer data set of Spira et al, using an SVM linear kernel classifier, our gene panel had 94.7% leave-one-out accuracy compared to 87.8% using the gene panel in the original paper. In addition, we performed high-throughput validation on the Dana Farber Cancer Institute GCOD database and several GEO datasets.Conclusions: Our result showed the potential for this panel as a robust classification tool for multiple tumor types on the Affymetrix platform, as well as other whole genome arrays. Apart from possible use in diagnosis of early tumorigenesis, some other potential uses of our methodology and gene panel would be in assisting pathologists in diagnosis of pre-cancerous lesions, determining tumor boundaries, assessing levels of contamination in cell populations in vitro and identifying transformations in cell cultures after multiple passages. Moreover, based on the robustness of this gene panel in identifying normal vs. tumor, mislabelled or misinterpreted samples can be pinpointed with high confidence.

Original languageEnglish (US)
Article number319
JournalBMC cancer
Volume10
DOIs
StatePublished - Jun 22 2010

Fingerprint

Neoplasm Genes
Genes
Neoplasms
Software
Prostate
Ovary
Lung Neoplasms
Colon
Carcinogenesis
Breast
Cell Culture Techniques
Genome
Databases

All Science Journal Classification (ASJC) codes

  • Genetics
  • Oncology
  • Cancer Research

Cite this

Irgon, J., Huang, C. C., Zhang, Y., Talantov, D., Bhanot, G., & Szalma, S. (2010). Robust multi-tissue gene panel for cancer detection. BMC cancer, 10, [319]. https://doi.org/10.1186/1471-2407-10-319
Irgon, Joseph ; Huang, C. Chris ; Zhang, Yi ; Talantov, Dmitri ; Bhanot, Gyan ; Szalma, Sándor. / Robust multi-tissue gene panel for cancer detection. In: BMC cancer. 2010 ; Vol. 10.
@article{40dc2430b17f43c8846eeddbb1dc9813,
title = "Robust multi-tissue gene panel for cancer detection",
abstract = "Background: We have identified a set of genes whose relative mRNA expression levels in various solid tumors can be used to robustly distinguish cancer from matching normal tissue. Our current feature set consists of 113 gene probes for 104 unique genes, originally identified as differentially expressed in solid primary tumors in microarray data on Affymetrix HG-U133A platform in five tissue types: breast, colon, lung, prostate and ovary. For each dataset, we first identified a set of genes significantly differentially expressed in tumor vs. normal tissue at p-value = 0.05 using an experimentally derived error model. Our common cancer gene panel is the intersection of these sets of significantly dysregulated genes and can distinguish tumors from normal tissue on all these five tissue types.Methods: Frozen tumor specimens were obtained from two commercial vendors Clinomics (Pittsfield, MA) and Asterand (Detroit, MI). Biotinylated targets were prepared using published methods (Affymetrix, CA) and hybridized to Affymetrix U133A GeneChips (Affymetrix, CA). Expression values for each gene were calculated using Affymetrix GeneChip analysis software MAS 5.0. We then used a software package called Genes@Work for differential expression discovery, and SVM light linear kernel for building classification models.Results: We validate the predictability of this gene list on several publicly available data sets generated on the same platform. Of note, when analysing the lung cancer data set of Spira et al, using an SVM linear kernel classifier, our gene panel had 94.7{\%} leave-one-out accuracy compared to 87.8{\%} using the gene panel in the original paper. In addition, we performed high-throughput validation on the Dana Farber Cancer Institute GCOD database and several GEO datasets.Conclusions: Our result showed the potential for this panel as a robust classification tool for multiple tumor types on the Affymetrix platform, as well as other whole genome arrays. Apart from possible use in diagnosis of early tumorigenesis, some other potential uses of our methodology and gene panel would be in assisting pathologists in diagnosis of pre-cancerous lesions, determining tumor boundaries, assessing levels of contamination in cell populations in vitro and identifying transformations in cell cultures after multiple passages. Moreover, based on the robustness of this gene panel in identifying normal vs. tumor, mislabelled or misinterpreted samples can be pinpointed with high confidence.",
author = "Joseph Irgon and Huang, {C. Chris} and Yi Zhang and Dmitri Talantov and Gyan Bhanot and S{\'a}ndor Szalma",
year = "2010",
month = "6",
day = "22",
doi = "10.1186/1471-2407-10-319",
language = "English (US)",
volume = "10",
journal = "BMC Cancer",
issn = "1471-2407",
publisher = "BioMed Central",

}

Irgon, J, Huang, CC, Zhang, Y, Talantov, D, Bhanot, G & Szalma, S 2010, 'Robust multi-tissue gene panel for cancer detection', BMC cancer, vol. 10, 319. https://doi.org/10.1186/1471-2407-10-319

Robust multi-tissue gene panel for cancer detection. / Irgon, Joseph; Huang, C. Chris; Zhang, Yi; Talantov, Dmitri; Bhanot, Gyan; Szalma, Sándor.

In: BMC cancer, Vol. 10, 319, 22.06.2010.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Robust multi-tissue gene panel for cancer detection

AU - Irgon, Joseph

AU - Huang, C. Chris

AU - Zhang, Yi

AU - Talantov, Dmitri

AU - Bhanot, Gyan

AU - Szalma, Sándor

PY - 2010/6/22

Y1 - 2010/6/22

N2 - Background: We have identified a set of genes whose relative mRNA expression levels in various solid tumors can be used to robustly distinguish cancer from matching normal tissue. Our current feature set consists of 113 gene probes for 104 unique genes, originally identified as differentially expressed in solid primary tumors in microarray data on Affymetrix HG-U133A platform in five tissue types: breast, colon, lung, prostate and ovary. For each dataset, we first identified a set of genes significantly differentially expressed in tumor vs. normal tissue at p-value = 0.05 using an experimentally derived error model. Our common cancer gene panel is the intersection of these sets of significantly dysregulated genes and can distinguish tumors from normal tissue on all these five tissue types.Methods: Frozen tumor specimens were obtained from two commercial vendors Clinomics (Pittsfield, MA) and Asterand (Detroit, MI). Biotinylated targets were prepared using published methods (Affymetrix, CA) and hybridized to Affymetrix U133A GeneChips (Affymetrix, CA). Expression values for each gene were calculated using Affymetrix GeneChip analysis software MAS 5.0. We then used a software package called Genes@Work for differential expression discovery, and SVM light linear kernel for building classification models.Results: We validate the predictability of this gene list on several publicly available data sets generated on the same platform. Of note, when analysing the lung cancer data set of Spira et al, using an SVM linear kernel classifier, our gene panel had 94.7% leave-one-out accuracy compared to 87.8% using the gene panel in the original paper. In addition, we performed high-throughput validation on the Dana Farber Cancer Institute GCOD database and several GEO datasets.Conclusions: Our result showed the potential for this panel as a robust classification tool for multiple tumor types on the Affymetrix platform, as well as other whole genome arrays. Apart from possible use in diagnosis of early tumorigenesis, some other potential uses of our methodology and gene panel would be in assisting pathologists in diagnosis of pre-cancerous lesions, determining tumor boundaries, assessing levels of contamination in cell populations in vitro and identifying transformations in cell cultures after multiple passages. Moreover, based on the robustness of this gene panel in identifying normal vs. tumor, mislabelled or misinterpreted samples can be pinpointed with high confidence.

AB - Background: We have identified a set of genes whose relative mRNA expression levels in various solid tumors can be used to robustly distinguish cancer from matching normal tissue. Our current feature set consists of 113 gene probes for 104 unique genes, originally identified as differentially expressed in solid primary tumors in microarray data on Affymetrix HG-U133A platform in five tissue types: breast, colon, lung, prostate and ovary. For each dataset, we first identified a set of genes significantly differentially expressed in tumor vs. normal tissue at p-value = 0.05 using an experimentally derived error model. Our common cancer gene panel is the intersection of these sets of significantly dysregulated genes and can distinguish tumors from normal tissue on all these five tissue types.Methods: Frozen tumor specimens were obtained from two commercial vendors Clinomics (Pittsfield, MA) and Asterand (Detroit, MI). Biotinylated targets were prepared using published methods (Affymetrix, CA) and hybridized to Affymetrix U133A GeneChips (Affymetrix, CA). Expression values for each gene were calculated using Affymetrix GeneChip analysis software MAS 5.0. We then used a software package called Genes@Work for differential expression discovery, and SVM light linear kernel for building classification models.Results: We validate the predictability of this gene list on several publicly available data sets generated on the same platform. Of note, when analysing the lung cancer data set of Spira et al, using an SVM linear kernel classifier, our gene panel had 94.7% leave-one-out accuracy compared to 87.8% using the gene panel in the original paper. In addition, we performed high-throughput validation on the Dana Farber Cancer Institute GCOD database and several GEO datasets.Conclusions: Our result showed the potential for this panel as a robust classification tool for multiple tumor types on the Affymetrix platform, as well as other whole genome arrays. Apart from possible use in diagnosis of early tumorigenesis, some other potential uses of our methodology and gene panel would be in assisting pathologists in diagnosis of pre-cancerous lesions, determining tumor boundaries, assessing levels of contamination in cell populations in vitro and identifying transformations in cell cultures after multiple passages. Moreover, based on the robustness of this gene panel in identifying normal vs. tumor, mislabelled or misinterpreted samples can be pinpointed with high confidence.

UR - http://www.scopus.com/inward/record.url?scp=77953667368&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77953667368&partnerID=8YFLogxK

U2 - 10.1186/1471-2407-10-319

DO - 10.1186/1471-2407-10-319

M3 - Article

C2 - 20569444

AN - SCOPUS:77953667368

VL - 10

JO - BMC Cancer

JF - BMC Cancer

SN - 1471-2407

M1 - 319

ER -

Irgon J, Huang CC, Zhang Y, Talantov D, Bhanot G, Szalma S. Robust multi-tissue gene panel for cancer detection. BMC cancer. 2010 Jun 22;10. 319. https://doi.org/10.1186/1471-2407-10-319