Skip to main navigation Skip to search Skip to main content

Modeling exome sequencing data with generalized Gaussian distribution with application to copy number variation detection

  • Junbo Duan
  • , Mingxi Wan
  • , Hong Wen Deng
  • , Yu Ping Wang
  • Xi'an Jiaotong University
  • Tulane University
  • University of Shanghai for Science and Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Exome sequencing provides us an effective way to discover genetic factors that might be associated with phenotypes for complex diseases. Compared with the whole-genome sequencing, exome sequencing can satisfy the high sequencing coverage requirement while under the limited budge constraint. However, due to the nature that exons are distributed sparsely along the genome, and the technical variability between samples, the analysis of exome sequencing data is complicated and direct utilization of current whole-genome sequencing targeted methods yields wrong results. In this paper, we propose a novel model to represent the exome sequencing data. Under this model, we show that the technical variability as well as random sequencing error follow the generalized Gaussian distribution. Based on this observation, we propose a method to detect the copy number variation. Studies on real data from 1000 Genomes Projects validate the proposed algorithm.

Original languageEnglish
Title of host publicationProceedings - 2013 IEEE International Conference on Bioinformatics and Biomedicine, IEEE BIBM 2013
Pages1-7
Number of pages7
DOIs
StatePublished - 2013
Event2013 IEEE International Conference on Bioinformatics and Biomedicine, IEEE BIBM 2013 - Shanghai, China
Duration: 18 Dec 201321 Dec 2013

Publication series

NameProceedings - 2013 IEEE International Conference on Bioinformatics and Biomedicine, IEEE BIBM 2013

Conference

Conference2013 IEEE International Conference on Bioinformatics and Biomedicine, IEEE BIBM 2013
Country/TerritoryChina
CityShanghai
Period18/12/1321/12/13

Keywords

  • 1000 Genomes Project
  • Next generation sequencing
  • copy number variation
  • exome sequencing
  • generalized Gaussian distribution
  • iteratively reweighted least squares

Fingerprint

Dive into the research topics of 'Modeling exome sequencing data with generalized Gaussian distribution with application to copy number variation detection'. Together they form a unique fingerprint.

Cite this