Efficient detection of copy-number variations using exome data: Batch- and sex-based analyses

  • Yuri Uchiyama
  • , Daisuke Yamaguchi
  • , Kazuhiro Iwama
  • , Satoko Miyatake
  • , Kohei Hamanaka
  • , Naomi Tsuchida
  • , Hiromi Aoi
  • , Yoshiteru Azuma
  • , Toshiyuki Itai
  • , Ken Saida
  • , Hiromi Fukuda
  • , Futoshi Sekiguchi
  • , Tomohiro Sakaguchi
  • , Ming Lei
  • , Sachiko Ohori
  • , Masamune Sakamoto
  • , Mitsuhiro Kato
  • , Takayoshi Koike
  • , Yukitoshi Takahashi
  • , Koichi Tanda
  • Yuki Hyodo, Rachel S. Honjo, Debora Romeo Bertola, Chong Ae Kim, Masahide Goto, Tetsuya Okazaki, Hiroyuki Yamada, Yoshihiro Maegaki, Hitoshi Osaka, Lock Hock Ngu, Ch'ng G. Siew, Keng W. Teik, Manami Akasaka, Hiroshi Doi, Fumiaki Tanaka, Tomohide Goto, Long Guo, Shiro Ikegawa, Kazuhiro Haginoya, Muzhirah Haniffa, Nozomi Hiraishi, Yoko Hiraki, Satoru Ikemoto, Atsuro Daida, Shin ichiro Hamano, Masaki Miura, Akihiko Ishiyama, Osamu Kawano, Akane Kondo, Hiroshi Matsumoto, Nobuhiko Okamoto, Tohru Okanishi, Yukimi Oyoshi, Eri Takeshita, Toshifumi Suzuki, Yoshiyuki Ogawa, Hiroshi Handa, Yayoi Miyazono, Eriko Koshimizu, Atsushi Fujita, Atsushi Takata, Noriko Miyake, Takeshi Mizuguchi, Naomichi Matsumoto

Research output: Contribution to journalArticlepeer-review

29 Scopus citations

Abstract

Many algorithms to detect copy number variations (CNVs) using exome sequencing (ES) data have been reported and evaluated on their sensitivity and specificity, reproducibility, and precision. However, operational optimization of such algorithms for a better performance has not been fully addressed. ES of 1199 samples including 763 patients with different disease profiles was performed. ES data were analyzed to detect CNVs by both the eXome Hidden Markov Model (XHMM) and modified Nord's method. To efficiently detect rare CNVs, we aimed to decrease sequencing biases by analyzing, at the same time, the data of all unrelated samples sequenced in the same flow cell as a batch, and to eliminate sex effects of X-linked CNVs by analyzing female and male sequences separately. We also applied several filtering steps for more efficient CNV selection. The average number of CNVs detected in one sample was <5. This optimization together with targeted CNV analysis by Nord's method identified pathogenic/likely pathogenic CNVs in 34 patients (4.5%, 34/763). In particular, among 142 patients with epilepsy, the current protocol detected clinically relevant CNVs in 19 (13.4%) patients, whereas the previous protocol identified them in only 14 (9.9%) patients. Thus, this batch-based XHMM analysis efficiently selected rare pathogenic CNVs in genetic diseases.

Original languageEnglish
Pages (from-to)50-65
Number of pages16
JournalHuman Mutation
Volume42
Issue number1
DOIs
StatePublished - Jan 2021
Externally publishedYes

Keywords

  • XHMM
  • copy number variation
  • exome sequencing
  • jNord
  • mendelian disorder

Fingerprint

Dive into the research topics of 'Efficient detection of copy-number variations using exome data: Batch- and sex-based analyses'. Together they form a unique fingerprint.

Cite this