Skip to main navigation Skip to search Skip to main content

TNSim: A Tumor Sequencing Data Simulator for Incorporating Clonality Information

  • Yu Geng
  • , Zhongmeng Zhao
  • , Mingzhe Xu
  • , Xuanping Zhang
  • , Xiao Xiao
  • , Jiayin Wang
  • Xi'an Jiaotong University
  • Jinzhou Medical University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

In recent years, the next generation sequencing enables us to obtain high resolution landscapes of the genetic changes at single-nucleotide level. More and more novel methods are proposed for efficient and effective analyses on cancer sequencing data. To facilitate such development, data simulator is a crucial tool, which not only tests and evaluates proposed approaches, but provides the feedbacks for further improvements as well. Several simulators are released to generate the next generation sequencing data. However, based on our best knowledge, none of them considers clonality information. It is suggested that clonal heterogeneity does widely exist in tumor samples. The patterns of somatic mutational events usually expose a wide spectrum of variant allelic frequencies, while some of them are only detectable in one or multiple clonal lineages. In this article, we introduce a Tumor-Normal sequencing Simulator, TNSim, to generate the next generation sequencing data by involving clonality information. The simulator is able to mimic a tumor sample and the paired normal sample, where the germline variants and somatic mutations can be settled respectively. Tumor purity is adjustable. Clonal architecture is preassigned as one or more clonal lineages, where each lineage consists of a set of somatic mutations whose variant allelic frequencies are similar. A group of experiments are conducted to evaluate its performance. The statistical features of the artificial sequencing reads are comparable to the real tumor sequencing data whose sample consists of multiple sub-clones. The source codes are available at http://github.com/lnmxgy/TNSim and for academic use only.

Original languageEnglish
Title of host publicationIntelligent Computing Theories and Application - 14th International Conference, ICIC 2018, Proceedings
EditorsKang-Hyun Jo, De-Shuang Huang, Xiao-Long Zhang
PublisherSpringer Verlag
Pages371-382
Number of pages12
ISBN (Print)9783319959320
DOIs
StatePublished - 2018
Event14th International Conference on Intelligent Computing, ICIC 2018 - Wuhan, China
Duration: 15 Aug 201818 Aug 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10955 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference14th International Conference on Intelligent Computing, ICIC 2018
Country/TerritoryChina
CityWuhan
Period15/08/1818/08/18

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Cancer genomics
  • Cancer sequencing data
  • Clonal structure
  • Data simulator

Fingerprint

Dive into the research topics of 'TNSim: A Tumor Sequencing Data Simulator for Incorporating Clonality Information'. Together they form a unique fingerprint.

Cite this