Fast motif selection for biological sequences

Pavel Kuksa, Vladimir Pavlovic

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

We consider the problem of identifying motifs, recurring or conserved patterns, in the sets of biological sequences. To solve this task, we present new deterministic and exact algorithms for finding patterns that are embedded as exact or inexact instances in all or most of the input strings. The proposed algorithms (1) improve search efficiency compared to existing exact algorithms by focusing search on a selected set of potential motif instances, and (2) scale well with the input length and the size of alphabet. Our algorithms are orders of magnitude faster than existing exact algorithms for common pattern identification. We evaluate our algorithms on benchmark motif finding problems and real applications in biological sequence analysis and show that they exhibit significant running time improvements compared to the state-of-the-art approaches.

Original languageEnglish (US)
Title of host publication2009 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2009
Pages79-82
Number of pages4
DOIs
StatePublished - Dec 1 2009
Event2009 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2009 - Washington, D.C., United States
Duration: Nov 1 2009Nov 4 2009

Publication series

Name2009 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2009

Other

Other2009 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2009
CountryUnited States
CityWashington, D.C.
Period11/1/0911/4/09

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Software
  • Biomedical Engineering
  • Health Informatics

Keywords

  • Algorithms
  • Sequences
  • Tree searching

Fingerprint Dive into the research topics of 'Fast motif selection for biological sequences'. Together they form a unique fingerprint.

Cite this