TY - JOUR
T1 - Genome organization of the SARS-CoV.
AU - Xu, Jing
AU - Hu, Jianfei
AU - Wang, Jing
AU - Han, Yujun
AU - Hu, Yongwu
AU - Wen, Jie
AU - Li, Yan
AU - Ji, Jia
AU - Ye, Jia
AU - Zhang, Zizhang
AU - Wei, Wei
AU - Li, Songgang
AU - Wang, Jun
AU - Wang, Jian
AU - Yu, Jun
AU - Yang, Huanming
PY - 2003/8
Y1 - 2003/8
N2 - Annotation of the genome sequence of the SARS-CoV (severe acute respiratory syndrome-associated coronavirus) is indispensable to understand its evolution and pathogenesis. We have performed a full annotation of the SARS-CoV genome sequences by using annotation programs publicly available or developed by ourselves. Totally, 21 open reading frames (ORFs) of genes or putative uncharacterized proteins (PUPs) were predicted. Seven PUPs had not been reported previously, and two of them were predicted to contain transmembrane regions. Eight ORFs partially overlapped with or embedded into those of known genes, revealing that the SARS-CoV genome is a small and compact one with overlapped coding regions. The most striking discovery is that an ORF locates on the minus strand. We have also annotated non-coding regions and identified the transcription regulating sequences (TRS) in the intergenic regions. The analysis of TRS supports the minus strand extending transcription mechanism of coronavirus. The SNP analysis of different isolates reveals that mutations of the sequences do not affect the prediction results of ORFs.
AB - Annotation of the genome sequence of the SARS-CoV (severe acute respiratory syndrome-associated coronavirus) is indispensable to understand its evolution and pathogenesis. We have performed a full annotation of the SARS-CoV genome sequences by using annotation programs publicly available or developed by ourselves. Totally, 21 open reading frames (ORFs) of genes or putative uncharacterized proteins (PUPs) were predicted. Seven PUPs had not been reported previously, and two of them were predicted to contain transmembrane regions. Eight ORFs partially overlapped with or embedded into those of known genes, revealing that the SARS-CoV genome is a small and compact one with overlapped coding regions. The most striking discovery is that an ORF locates on the minus strand. We have also annotated non-coding regions and identified the transcription regulating sequences (TRS) in the intergenic regions. The analysis of TRS supports the minus strand extending transcription mechanism of coronavirus. The SNP analysis of different isolates reveals that mutations of the sequences do not affect the prediction results of ORFs.
UR - https://www.scopus.com/pages/publications/13744262562
U2 - 10.1016/S1672-0229(03)01028-3
DO - 10.1016/S1672-0229(03)01028-3
M3 - 文章
C2 - 15629035
AN - SCOPUS:13744262562
SN - 1672-0229
VL - 1
SP - 226
EP - 235
JO - Genomics, proteomics & bioinformatics / Beijing Genomics Institute
JF - Genomics, proteomics & bioinformatics / Beijing Genomics Institute
IS - 3
ER -