跳到主要导航 跳到搜索 跳到主要内容

Haplotype-resolved diverse human genomes and integrated analysis of structural variation

  • Peter Ebert
  • , Peter A. Audano
  • , Qihui Zhu
  • , Bernardo Rodriguez-Martin
  • , David Porubsky
  • , Marc Jan Bonder
  • , Arvis Sulovari
  • , Jana Ebler
  • , Weichen Zhou
  • , Rebecca Serra Mari
  • , Feyza Yilmaz
  • , Xuefang Zhao
  • , Ping Hsun Hsieh
  • , Joyce Lee
  • , Sushant Kumar
  • , Jiadong Lin
  • , Tobias Rausch
  • , Yu Chen
  • , Jingwen Ren
  • , Martin Santamarina
  • Wolfram Höps, Hufsah Ashraf, Nelson T. Chuang, Xiaofei Yang, Katherine M. Munson, Alexandra P. Lewis, Susan Fairley, Luke J. Tallon, Wayne E. Clarke, Anna O. Basile, Marta Byrska-Bishop, André Corvelo, Uday S. Evani, Tsung Yu Lu, Mark J.P. Chaisson, Junjie Chen, Chong Li, Harrison Brand, Aaron M. Wenger, Maryam Ghareghani, William T. Harvey, Benjamin Raeder, Patrick Hasenfeld, Allison A. Regier, Haley J. Abel, Ira M. Hall, Paul Flicek, Oliver Stegle, Mark B. Gerstein, Jose M.C. Tubio, Zepeng Mu, Yang I. Li, Xinghua Shi, Alex R. Hastie, Kai Ye, Zechen Chong, Ashley D. Sanders, Michael C. Zody, Michael E. Talkowski, Ryan E. Mills, Scott E. Devine, Charles Lee, Jan O. Korbel, Tobias Marschall, Evan E. Eichler
  • Heinrich Heine University Düsseldorf
  • University of Washington
  • Jackson Laboratory
  • European Molecular Biology Laboratory
  • German Cancer Research Center
  • University of Michigan, Ann Arbor
  • Harvard University
  • Broad Institute
  • Bionano Genomics Inc.
  • Yale University
  • Xi'an Jiaotong University
  • University of Alabama at Birmingham
  • University of Southern California
  • University of Santiago de Compostela
  • University of Maryland, Baltimore
  • New York Genome Center
  • Temple University
  • Pacific Biosciences
  • Max Planck Institute for Informatics
  • Saarland University
  • Washington University St. Louis
  • The University of Chicago
  • Ewha Womans University

科研成果: 期刊稿件文章同行评审

492 引用 (Scopus)

摘要

Long-read and strand-specific sequencing technologies together facilitate the de novo assembly of high-quality haplotype-resolved human genomes without parent-child trio data. We present 64 assembled haplotypes from 32 diverse human genomes. These highly contiguous haplotype assemblies (average minimum contig length needed to cover 50% of the genome: 26 million base pairs) integrate all forms of genetic variation, even across complex loci. We identified 107,590 structural variants (SVs), of which 68% were not discovered with short-read sequencing, and 278 SV hotspots (spanning megabases of gene-rich sequence). We characterized 130 of the most active mobile element source elements and found that 63% of all SVs arise through homology-mediated mechanisms. This resource enables reliable graph-based genotyping from short reads of up to 50,340 SVs, resulting in the identification of 1526 expression quantitative trait loci as well as SV candidates for adaptive selection within the human population.

源语言英语
文章编号48
期刊Science
372
6537
DOI
出版状态已出版 - 2 4月 2021

学术指纹

探究 'Haplotype-resolved diverse human genomes and integrated analysis of structural variation' 的科研主题。它们共同构成独一无二的指纹。

引用此