Skip to main navigation Skip to search Skip to main content

PVTree: A sequential pattern mining method for alignment independent phylogeny reconstruction

  • Xi'an Jiaotong University

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

Phylogenetic tree is essential to understand evolution and it is usually constructed through multiple sequence alignment, which suffers from heavy computational burdens and requires sophisticated parameter tuning. Recently, alignment free methods based on k-mer profiles or common substrings provide alternative ways to construct phylogenetic trees. However, most of these methods ignore the global similarities between sequences or some specific valuable features, e.g., frequent patterns overall datasets. To make further improvement, we propose an alignment free algorithm based on sequential pattern mining, where each sequence is converted into a binary representation of sequential patterns among sequences. The phylogenetic tree is further constructed via clustering distance matrix which is calculated from pattern vectors. To increase accuracy for highly divergent sequences, we consider pattern weight and filtering redundancy sub-patterns. Both simulated and real data demonstrates our method outperform other alignment free methods, especially for large sequence set with low similarity.

Original languageEnglish
Article number73
JournalGenes
Volume10
Issue number2
DOIs
StatePublished - 1 Jan 2019

Keywords

  • Alignment free
  • Multiple sequence alignment
  • Phylogenetic tree
  • Sequential pattern mining

Fingerprint

Dive into the research topics of 'PVTree: A sequential pattern mining method for alignment independent phylogeny reconstruction'. Together they form a unique fingerprint.

Cite this