TY - JOUR
T1 - HRD-MILN
T2 - Accurately estimate tumor homologous recombination deficiency status from targeted panel sequencing data
AU - Wang, Xuwen
AU - Xu, Ying
AU - Zhang, Yinbin
AU - Wang, Shenjie
AU - Zhang, Xuanping
AU - Yi, Xin
AU - Zhang, Shuqun
AU - Wang, Jiayin
N1 - Publisher Copyright:
Copyright © 2022 Wang, Xu, Zhang, Wang, Zhang, Yi, Zhang and Wang.
PY - 2022/9/28
Y1 - 2022/9/28
N2 - Homologous recombination deficiency (HRD) is a critical feature guiding drug and treatment selection, mainly for ovarian and breast cancers. As it cannot be directly observed, HRD status is estimated on a small set of genomic instability features from sequencing data. The existing methods often perform poorly when handling targeted panel sequencing data; however, the targeted panel is the most popular sequencing strategy in clinical practices. Thus, we proposed HRD-MILN to overcome the computational challenges from targeted panel sequencing. HRD-MILN incorporated a multi-instance learning framework to discover as many loss of heterozygosity (LOH) associated with HRD status to cluster as possible. Then the HRD score is obtained based on the association between the LOHs and the cluster in the sample to be estimated, and finally, the HRD status is estimated based on the score. In comparison experiments on targeted panel sequencing data, the Precision of HRD-MILN could achieve 87%, significantly improved from 63% reported by the existing methods, where the highest margin of improvement reached 14%. It also presented advantages on whole exome sequencing data. Based on our best knowledge, HRD-MILN is the first practical tool for estimating HRD status from targeted panel sequencing data and could benefit clinical applications.
AB - Homologous recombination deficiency (HRD) is a critical feature guiding drug and treatment selection, mainly for ovarian and breast cancers. As it cannot be directly observed, HRD status is estimated on a small set of genomic instability features from sequencing data. The existing methods often perform poorly when handling targeted panel sequencing data; however, the targeted panel is the most popular sequencing strategy in clinical practices. Thus, we proposed HRD-MILN to overcome the computational challenges from targeted panel sequencing. HRD-MILN incorporated a multi-instance learning framework to discover as many loss of heterozygosity (LOH) associated with HRD status to cluster as possible. Then the HRD score is obtained based on the association between the LOHs and the cluster in the sample to be estimated, and finally, the HRD status is estimated based on the score. In comparison experiments on targeted panel sequencing data, the Precision of HRD-MILN could achieve 87%, significantly improved from 63% reported by the existing methods, where the highest margin of improvement reached 14%. It also presented advantages on whole exome sequencing data. Based on our best knowledge, HRD-MILN is the first practical tool for estimating HRD status from targeted panel sequencing data and could benefit clinical applications.
KW - cancer genomics
KW - homologous recombination deficiency
KW - multi-instance learning model
KW - sequencing data analysis
KW - targeted panel sequencing
UR - https://www.scopus.com/pages/publications/85141840525
U2 - 10.3389/fgene.2022.990244
DO - 10.3389/fgene.2022.990244
M3 - 文章
AN - SCOPUS:85141840525
SN - 1664-8021
VL - 13
JO - Frontiers in Genetics
JF - Frontiers in Genetics
M1 - 990244
ER -