Detection of False-Positive Deletions from the Database of Genomic Variants

  • Junbo Duan
  • , Han Liu
  • , Lanling Zhao
  • , Xiguo Yuan
  • , Yu Ping Wang
  • , Mingxi Wan

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Next generation sequencing is an emerging technology that has been widely used in the detection of genomic variants. However, since its depth of coverage, a main signature used for variant calling, is affected greatly by biases such as GC content and mappability, some callings are false positives. In this study, we utilized paired-end read mapping, another signature that is not affected by the aforementioned biases, to detect false-positive deletions in the database of genomic variants. We first identified 1923 suspicious variants that may be false positives and then conducted validation studies on each suspicious variant, which detected 583 false-positive deletions. Finally we analysed the distribution of these false positives by chromosome, sample, and size. Hopefully, incorrect documentation and annotations in downstream studies can be avoided by correcting these false positives in public repositories.

Original languageEnglish
Article number8420547
JournalBioMed Research International
Volume2019
DOIs
StatePublished - 2019

Fingerprint

Dive into the research topics of 'Detection of False-Positive Deletions from the Database of Genomic Variants'. Together they form a unique fingerprint.

Cite this