File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

김동혁

Kim, Donghyuk
Systems Biology and Machine Learning Lab.
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Global transcriptional regulatory network for Escherichia coli robustly connects gene expression to transcription factor activities

Author(s)
Fang, XinSastry, AnandMih, NathanKim, DonghyukTan, JustinYurkovich, James T.Lloyd, Colton J.Gao, YeYang, LaurencePalsson, Bernhard O.
Issued Date
2017-09
DOI
10.1073/pnas.1702581114
URI
https://scholarworks.unist.ac.kr/handle/201301/24275
Fulltext
http://www.pnas.org/content/114/38/10286
Citation
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, v.114, no.38, pp.10286 - 10291
Abstract
Transcriptional regulatory networks (TRNs) have been studied intensely for >25 y. Yet, even for the Escherichia coli TRN-probably the best characterized TRN-several questions remain. Here, we address three questions: (i) How complete is our knowledge of the E. coli TRN; (ii) how well can we predict gene expression using this TRN; and (iii) how robust is our understanding of the TRN? First, we reconstructed a high-confidence TRN (hiTRN) consisting of 147 transcription factors (TFs) regulating 1,538 transcription units (TUs) encoding 1,764 genes. The 3,797 high-confidence regulatory interactions were collected from published, validated chromatin immunoprecipitation (ChIP) data and RegulonDB. For 21 different TF knockouts, up to 63% of the differentially expressed genes in the hiTRN were traced to the knocked-out TF through regulatory cascades. Second, we trained supervised machine learning algorithms to predict the expression of 1,364 TUs given TF activities using 441 samples. The algorithms accurately predicted condition-specific expression for 86% (1,174 of 1,364) of the TUs, while 193 TUs (14%) were predicted better than random TRNs. Third, we identified 10 regulatory modules whose definitions were robust against changes to the TRN or expression compendium. Using surrogate variable analysis, we also identified three unmodeled factors that systematically influenced gene expression. Our computational workflow comprehensively characterizes the predictive capabilities and systems-level functions of an organism's TRN from disparate data types.
Publisher
NATL ACAD SCIENCES
ISSN
0027-8424
Keyword (Author)
transcriptional regulationtranscriptomicsmatrix factorizationregression
Keyword
NONNEGATIVE MATRIX FACTORIZATIONGENOME-SCALEMYCOBACTERIUM-TUBERCULOSISHIGH-THROUGHPUTK-12 MG1655RECONSTRUCTIONMETABOLISMDISCOVERY

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.