Joint base-calling of Two DNA Sequences with Factor Graphs

Xiaomeng Shi, Desmond S. Lun, Muriel Médard, Ralf Kötter, James C. Meldrim, Andrew J. Barry

Research output: Contribution to journalArticlepeer-review

Abstract

Automated estimation of DNA base-sequences is an important step in genomics and in many other emerging fields in biological and medical sciences. Current automated sequencers process single strands only. To improve the utility of existing technologies, we propose to mix two independent strands prior to electrophoresis, and base-call jointly by applying the sum-product algorithm on factor graphs. We first present a statistical model for DNA sequencing data and examine the model parameters. A practical heuristic is then proposed to estimate the peaks, which are then separated into two source sequences (Major/Minor) by passing messages on a factor graph. Simulation results show that joint base-calling can provide less accurate but valid results for the minor. The algorithm presented provides a basis for future investigation of joint sequencing techniques.

Original languageEnglish (US)
Article number5420276
Pages (from-to)724-733
Number of pages10
JournalIEEE Transactions on Information Theory
Volume56
Issue number2
DOIs
StatePublished - Feb 2010
Externally publishedYes

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Computer Science Applications
  • Library and Information Sciences

Keywords

  • Base-calling
  • DNA modeling
  • DNA sequencing
  • Factor graphs
  • Sum-product algorithm

Fingerprint

Dive into the research topics of 'Joint base-calling of Two DNA Sequences with Factor Graphs'. Together they form a unique fingerprint.

Cite this