AVA,Dx: Analysis of Variation for Association with Disease

Project Details


? DESCRIPTION (provided by applicant): Every individual genome predisposes its carrier to some set of diseases. Despite all research efforts, however, heritable causes of complex disease remain elusive. This is largely due to the inherent complexity of pathogenesis pathways and the interaction of individual genomic determinants with the environment. Elucidating causative genetics of pathogenesis will spur the development of better treatments and prevention tactics, modulating the presence of individual-specific stressors. Here, we propose to build AVA, Dx (Analysis of Variation for Association with Disease) a computational method for defining the functional role of DNA variation in complex diseases. AVA, Dx will use exome sequence data to pinpoint the molecular pathways affected in disease and to predict individual disease predisposition. As a proof of concept, we will use the nearly two thousand available sequenced exomes of Tourette Disorder, Crohn's Disease, and Chronic Obstructive Pulmonary Disease cohorts to build separate AVA, Dx instances. For each individual disease cohort we will first build a predictor of the impact of genetic variation on molecular gene function. This predictor will be unique in its ability to account for variant genotype in evaluating the impact of all kinds of gene-associated variants, rare and common, coding and non-coding. We will further encode each exome in our set as a vector of function impact scores for all genes. Based on this set of vectors, feature selection techniques will identify disease-genes; i.e. genes with exome-specific function changes correlating best to the clinical annotation of individual disease status (disease/healthy). Note that in this manner we expect to find a sizeable set of novel disease genes. We will train an artificial learning classifier to recognize the functional differences in sts of selected genes to distinguish the clinical status of the newly sequenced exomes (individuals). As the exome sequencing techniques used in our study vary by cohort, we will build experimental setup flexibility into our analysis structure. As a result, AVA, Dx techniques will be useful for drawing conclusions on existing sequencing data. AVA, Dx will generate experimentally testable hypotheses of disease pathogenesis by pinpointing the affected molecular functions. Moreover, AVA, Dx will be prognostic, allowing determination of disease predisposition prior to clinical diagnosis.
Effective start/end date9/4/155/31/19


  • National Institute of General Medical Sciences: $300,276.00
  • National Institute of General Medical Sciences: $306,125.00
  • National Institute of General Medical Sciences: $304,782.00
  • National Institute of General Medical Sciences: $270,132.00


  • Genetics
  • Molecular Biology


Explore the research topics touched on by this project. These labels are generated based on the underlying awards/grants. Together they form a unique fingerprint.