TY - JOUR
T1 - Genome diversity and signatures of natural selection in mainland Southeast Asia
AU - Consortium of Anthropological Research in Southeast Asia and Southwest China (CASEAC)
AU - He, Yaoxi
AU - Zhang, Xiaoming
AU - Peng, Min Sheng
AU - Li, Yu Chun
AU - Liu, Kai
AU - Zhang, Yu
AU - Mao, Leyan
AU - Guo, Yongbo
AU - Ma, Yujie
AU - Zhou, Bin
AU - Zheng, Wangshan
AU - Yue, Tian
AU - Liao, Yuwen
AU - Liang, Shen Ao
AU - Chen, Lu
AU - Zhang, Weijie
AU - Chen, Xiaoning
AU - Tang, Bixia
AU - Yang, Xiaofei
AU - Ye, Kai
AU - Gao, Shenghan
AU - Lu, Yurun
AU - Wang, Yong
AU - Wan, Shijie
AU - Hao, Rushan
AU - Wang, Xuankai
AU - Mao, Yafei
AU - Dai, Shanshan
AU - Gao, Zongliang
AU - Yang, Li Qin
AU - Guo, Jianxin
AU - Li, Jiangguo
AU - Liu, Chao
AU - Wang, Jianhua
AU - Sovannary, Tuot
AU - Bunnath, Long
AU - Kampuansai, Jatupol
AU - Inta, Angkhana
AU - Srikummool, Metawee
AU - Kutanan, Wibhu
AU - Ho, Huy Quang
AU - Pham, Khoa Dang
AU - Singthong, Sommay
AU - Sochampa, Somphad
AU - Kyaing, U. Win
AU - Pongamornkul, Wittaya
AU - Morlaeku, Chutima
AU - Rattanakrajangsri, Kittisak
AU - Kong, Qing Peng
AU - Zhang, Ya Ping
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer Nature Limited 2025.
PY - 2025/7/10
Y1 - 2025/7/10
N2 - Mainland Southeast Asia (MSEA) has rich ethnic and cultural diversity with a population of nearly 300 million1,2. However, people from MSEA are underrepresented in the current human genomic databases. Here we present the SEA3K genome dataset (phase I), generated by deep short-read whole-genome sequencing of 3,023 individuals from 30 MSEA populations, and long-read whole-genome sequencing of 37 representative individuals. We identified 79.59 million small variants and 96,384 structural variants, among which 22.83 million small variants and 24,622 structural variants are unique to this dataset. We observed a high genetic heterogeneity across MSEA populations, reflected by the varied combinations of genetic components. We identified 44 genomic regions with strong signatures of Darwinian positive selection, covering 89 genes involved in varied physiological systems such as physical traits and immune response. Furthermore, we observed varied patterns of archaic Denisovan introgression in MSEA populations, supporting the proposal of at least two distinct instances of Denisovan admixture into modern humans in Asia3. We also detected genomic regions that suggest adaptive archaic introgressions in MSEA populations. The large number of novel genomic variants in MSEA populations highlight the necessity of studying regional populations that can help answer key questions related to prehistory, genetic adaptation and complex diseases.
AB - Mainland Southeast Asia (MSEA) has rich ethnic and cultural diversity with a population of nearly 300 million1,2. However, people from MSEA are underrepresented in the current human genomic databases. Here we present the SEA3K genome dataset (phase I), generated by deep short-read whole-genome sequencing of 3,023 individuals from 30 MSEA populations, and long-read whole-genome sequencing of 37 representative individuals. We identified 79.59 million small variants and 96,384 structural variants, among which 22.83 million small variants and 24,622 structural variants are unique to this dataset. We observed a high genetic heterogeneity across MSEA populations, reflected by the varied combinations of genetic components. We identified 44 genomic regions with strong signatures of Darwinian positive selection, covering 89 genes involved in varied physiological systems such as physical traits and immune response. Furthermore, we observed varied patterns of archaic Denisovan introgression in MSEA populations, supporting the proposal of at least two distinct instances of Denisovan admixture into modern humans in Asia3. We also detected genomic regions that suggest adaptive archaic introgressions in MSEA populations. The large number of novel genomic variants in MSEA populations highlight the necessity of studying regional populations that can help answer key questions related to prehistory, genetic adaptation and complex diseases.
UR - https://www.scopus.com/pages/publications/105005397486
U2 - 10.1038/s41586-025-08998-w
DO - 10.1038/s41586-025-08998-w
M3 - 文章
C2 - 40369069
AN - SCOPUS:105005397486
SN - 0028-0836
VL - 643
SP - 417
EP - 426
JO - Nature
JF - Nature
IS - 8071
ER -