File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

김동혁

Kim, Donghyuk
Systems Biology and Machine Learning Lab.
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Design of synthetic promoters for cyanobacteria with generative deep-learning model

Author(s)
Seo, EuijinChoi, Yun-NamShin, Ye RimKim, DonghyukLee, Jeong Wook
Issued Date
2023-07
DOI
10.1093/nar/gkad451
URI
https://scholarworks.unist.ac.kr/handle/201301/64717
Citation
NUCLEIC ACIDS RESEARCH, v.51, no.13, pp.7071 - 7082
Abstract
Deep generative models, which can approximate complex data distribution from large datasets, are widely used in biological dataset analysis. In particular, they can identify and unravel hidden traits encoded within a complicated nucleotide sequence, allowing us to design genetic parts with accuracy. Here, we provide a deep-learning based generic framework to design and evaluate synthetic promoters for cyanobacteria using generative models, which was in turn validated with cell-free transcription assay. We developed a deep generative model and a predictive model using a variational autoencoder and convolutional neural network, respectively. Using native promoter sequences of the model unicellular cyanobacterium Synechocystis sp. PCC 6803 as a training dataset, we generated 10 000 synthetic promoter sequences and predicted their strengths. By position weight matrix and k-mer analyses, we confirmed that our model captured a valid feature of cyanobacteria promoters from the dataset. Furthermore, critical subregion identification analysis consistently revealed the importance of the -10 box sequence motif in cyanobacteria promoters. Moreover, we validated that the generated promoter sequence can efficiently drive transcription via cell-free transcription assay. This approach, combining in silico and in vitro studies, will provide a foundation for the rapid design and validation of synthetic promoters, especially for non-model organisms.
Publisher
OXFORD UNIV PRESS
ISSN
0305-1048
Keyword
CONVOLUTIONAL NEURAL-NETWORKSGENE-EXPRESSION

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.