TY - JOUR
T1 - A sparse model based detection of copy number variations from exome sequencing data
AU - Duan, Junbo
AU - Wan, Mingxi
AU - Deng, Hong Wen
AU - Wang, Yu Ping
N1 - Publisher Copyright:
© 1964-2012 IEEE.
PY - 2016/3
Y1 - 2016/3
N2 - Goal: Whole-exome sequencing provides a more cost-effective way than whole-genome sequencing for detecting genetic variants, such as copy number variations (CNVs). Although a number of approaches have been proposed to detect CNVs from whole-genome sequencing, a direct adoption of these approaches to whole-exome sequencing will often fail because exons are separately located along a genome. Therefore, an appropriate method is needed to target the specific features of exome sequencing data. Methods: In this paper, a novel sparse model based method is proposed to discover CNVs from multiple exome sequencing data. First, exome sequencing data are represented with a penalized matrix approximation, and technical variability and random sequencing errors are assumed to follow a generalized Gaussian distribution. Second, an iteratively reweighted least squares algorithm is used to estimate the solution. Results: The method is tested and validated on both synthetic and real data, and compared with other approaches including CoNIFER, XHMM, and cn.MOPS. The test demonstrates that the proposed method outperform other approaches. Conclusion: The proposed sparse model can detect CNVs from exome sequencing data with high power and precision. Significance: Sparse model can target the specific features of exome sequencing data.
AB - Goal: Whole-exome sequencing provides a more cost-effective way than whole-genome sequencing for detecting genetic variants, such as copy number variations (CNVs). Although a number of approaches have been proposed to detect CNVs from whole-genome sequencing, a direct adoption of these approaches to whole-exome sequencing will often fail because exons are separately located along a genome. Therefore, an appropriate method is needed to target the specific features of exome sequencing data. Methods: In this paper, a novel sparse model based method is proposed to discover CNVs from multiple exome sequencing data. First, exome sequencing data are represented with a penalized matrix approximation, and technical variability and random sequencing errors are assumed to follow a generalized Gaussian distribution. Second, an iteratively reweighted least squares algorithm is used to estimate the solution. Results: The method is tested and validated on both synthetic and real data, and compared with other approaches including CoNIFER, XHMM, and cn.MOPS. The test demonstrates that the proposed method outperform other approaches. Conclusion: The proposed sparse model can detect CNVs from exome sequencing data with high power and precision. Significance: Sparse model can target the specific features of exome sequencing data.
KW - Copy number variation (CNV)
KW - Exome sequencing
KW - Iteratively reweighted least squares
KW - Matrix approximation
KW - Sparse modeling
UR - https://www.scopus.com/pages/publications/84963553357
U2 - 10.1109/TBME.2015.2464674
DO - 10.1109/TBME.2015.2464674
M3 - 文章
C2 - 26258935
AN - SCOPUS:84963553357
SN - 0018-9294
VL - 63
SP - 496
EP - 505
JO - IEEE Transactions on Biomedical Engineering
JF - IEEE Transactions on Biomedical Engineering
IS - 3
M1 - 7180343
ER -