Skip to main navigation Skip to search Skip to main content

PDC: Pattern discovery with confidence in DNA sequences

  • Lu Yi
  • , Lu Shiyong
  • , Fotouhi Farshad
  • , Sun Yon
  • , Yang Zijiang
  • , Lily R. Liang
  • Wayne State University
  • University of the District of Columbia

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Pattern discovery in DNA sequences is one of the most challenging tasks in molecular biology and computer science. The main goal of pattern discovery in DNA sequences is to identify sequences of important biological function hidden in the huge amounts of genomic sequences. Several methods and techniques have been proposed and implemented in this field. However, in order to reduce computational time and complexity, most of them either focus on finding short DNA patterns or require explicit specification of pattern lengths in advance. Scientists need to find longer patterns without specifying pattern lengths in advance and still have good performance. In this paper, we propose a pattern discovery algorithm called Pattern Discovery with Confidence (PDC). Based on biological studies, we propose a new measurement system that can identify over-represented patterns inside DNA sequences. Using this measurement, PDC algorithm can narrow the search space by checking dependency along the pattern, thus extending the pattern as long as possible without the need to restrict or specify the length of a pattern in advance. Experimental tests demonstrate that this approach can find long, interesting patterns within a reasonable computation time.

Original languageEnglish
Title of host publicationProceedings of the Seventh IASTED International Conference on Advances in Computer Science and Technology
Pages345-350
Number of pages6
StatePublished - 2006
Event7th IASTED International Conference on Advances in Computer Science and Technology - Puerto Vallarta, Mexico
Duration: 23 Jan 200625 Jan 2006

Publication series

NameProceedings of the Seventh IASTED International Conference on Advances in Computer Science and Technology

Conference

Conference7th IASTED International Conference on Advances in Computer Science and Technology
Country/TerritoryMexico
CityPuerto Vallarta
Period23/01/0625/01/06

Keywords

  • Confidence
  • DNA sequence
  • Pattern discovery

Fingerprint

Dive into the research topics of 'PDC: Pattern discovery with confidence in DNA sequences'. Together they form a unique fingerprint.

Cite this