On Statistical Modeling of Sequencing Noise in High Depth Data to Assess Tumor Evolution

Raul Rabadan, Gyan Bhanot, Sonia Marsilio, Nicholas Chiorazzi, Laura Pasqualucci, Hossein Khiabanian

Research output: Contribution to journalArticlepeer-review

3 Scopus citations


One cause of cancer mortality is tumor evolution to therapy-resistant disease. First line therapy often targets the dominant clone, and drug resistance can emerge from preexisting clones that gain fitness through therapy-induced natural selection. Such mutations may be identified using targeted sequencing assays by analysis of noise in high-depth data. Here, we develop a comprehensive, unbiased model for sequencing error background. We find that noise in sufficiently deep DNA sequencing data can be approximated by aggregating negative binomial distributions. Mutations with frequencies above noise may have prognostic value. We evaluate our model with simulated exponentially expanded populations as well as data from cell line and patient sample dilution experiments, demonstrating its utility in prognosticating tumor progression. Our results may have the potential to identify significant mutations that can cause recurrence. These results are relevant in the pretreatment clinical setting to determine appropriate therapy and prepare for potential recurrence pretreatment.

Original languageEnglish (US)
Pages (from-to)143-155
Number of pages13
JournalJournal of Statistical Physics
Issue number1
StatePublished - Jul 1 2018

All Science Journal Classification (ASJC) codes

  • Statistical and Nonlinear Physics
  • Mathematical Physics


  • 02.50.-r
  • 87.18.Tt
  • 87.18.Vf
  • 87.18.Wd
  • 87.23.Kg

Fingerprint Dive into the research topics of 'On Statistical Modeling of Sequencing Noise in High Depth Data to Assess Tumor Evolution'. Together they form a unique fingerprint.

Cite this