TY - JOUR
T1 - AutoGenome
T2 - An AutoML tool for genomic research
AU - Liu, Denghui
AU - Xu, Chi
AU - He, Wenjun
AU - Xu, Zhimeng
AU - Fu, Wenqi
AU - Zhang, Lei
AU - Yang, Jie
AU - Wang, Zhihao
AU - Liu, Bing
AU - Peng, Guangdun
AU - Han, Dali
AU - Bai, Xiaolong
AU - Qiao, Nan
N1 - Publisher Copyright:
© 2021
PY - 2021/12
Y1 - 2021/12
N2 - Deep learning has achieved great successes in traditional fields like computer vision (CV), natural language processing (NLP), speech processing, and more. These advancements have greatly inspired researchers in genomics and made deep learning in genomics an exciting and popular topic. The convolutional neural network (CNN) and recurrent neural network (RNN) are frequently used to solve genomic sequencing and prediction problems, and multiple layer perception (MLP) and auto-encoders (AE) are frequently used for genomic profiling data like RNA expression data and gene mutation data. Here, we introduce a new neural network architecture-the residual fully-connected neural network (RFCN)-and describe its advantage in modeling genomic profiling data. We also incorporate AutoML algorithms and implement AutoGenome, an end-to-end, automated deep learning framework for genomic studies. By utilizing the proposed RFCN architecture, automatic hyper-parameter search, and neural architecture search algorithms, AutoGenome can automatically train high-performance deep learning models for various kinds of genomic profiling data. To help researchers better understand the trained models, AutoGenome can assess the importance of different features and export the most critical features for supervised learning tasks and the representative latent vectors for unsupervised learning tasks. We expect AutoGenome will become a popular tool in genomic studies.
AB - Deep learning has achieved great successes in traditional fields like computer vision (CV), natural language processing (NLP), speech processing, and more. These advancements have greatly inspired researchers in genomics and made deep learning in genomics an exciting and popular topic. The convolutional neural network (CNN) and recurrent neural network (RNN) are frequently used to solve genomic sequencing and prediction problems, and multiple layer perception (MLP) and auto-encoders (AE) are frequently used for genomic profiling data like RNA expression data and gene mutation data. Here, we introduce a new neural network architecture-the residual fully-connected neural network (RFCN)-and describe its advantage in modeling genomic profiling data. We also incorporate AutoML algorithms and implement AutoGenome, an end-to-end, automated deep learning framework for genomic studies. By utilizing the proposed RFCN architecture, automatic hyper-parameter search, and neural architecture search algorithms, AutoGenome can automatically train high-performance deep learning models for various kinds of genomic profiling data. To help researchers better understand the trained models, AutoGenome can assess the importance of different features and export the most critical features for supervised learning tasks and the representative latent vectors for unsupervised learning tasks. We expect AutoGenome will become a popular tool in genomic studies.
KW - Autogenome
KW - Automl
KW - Deep learning
KW - Genomic
KW - Residual fully-connected neural network
UR - https://www.scopus.com/pages/publications/85136591199
U2 - 10.1016/j.ailsci.2021.100017
DO - 10.1016/j.ailsci.2021.100017
M3 - 文章
AN - SCOPUS:85136591199
SN - 2667-3185
VL - 1
JO - Artificial Intelligence in the Life Sciences
JF - Artificial Intelligence in the Life Sciences
M1 - 100017
ER -