File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

김동혁

Kim, Donghyuk
Systems Biology and Machine Learning Lab.
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Deep learning for NAD/NADP cofactor prediction and engineering using transformer attention analysis in enzymes

Author(s)
Kim, JaehyungWoo, JihoonPark, Joon YoungKim, Kyung-JinKim, Donghyuk
Issued Date
2025-01
DOI
10.1016/j.ymben.2024.11.007
URI
https://scholarworks.unist.ac.kr/handle/201301/85272
Citation
METABOLIC ENGINEERING, v.87, pp.86 - 94
Abstract
Understanding and manipulating the cofactor preferences of NAD(P)-dependent oxidoreductases, the most widely distributed enzyme group in nature, is increasingly crucial in bioengineering. However, large-scale identification of the cofactor preferences and the design of mutants to switch cofactor specificity remain as complex tasks. Here, we introduce DISCODE (Deep learning-based Iterative pipeline to analyze Specificity of COfactors and to Design Enzyme), a novel transformer-based deep learning model to predict NAD(P) cofactor preferences. For model training, a total of 7,132 NAD(P)-dependent enzyme sequences were collected. Leveraging whole-length sequence information, DISCODE classifies the cofactor preferences of NAD(P)dependent oxidoreductase protein sequences without structural or taxonomic limitation. The model showed 97.4% and 97.3% of accuracy and F1 score, respectively. A notable feature of DISCODE is the interpretability of its transformer layers. Analysis of attention layers in the model enables identification of several residues that showed significantly higher attention weights. They were well aligned with structurally important residues that closely interact with NAD(P), facilitating the identification of key residues for determining cofactor specificities. These key residues showed high consistency with verified cofactor switching mutants. Integrated into an enzyme design pipeline, DISCODE coupled with attention analysis, enables a fully automated approach to redesign cofactor specificity.
Publisher
ACADEMIC PRESS INC ELSEVIER SCIENCE
ISSN
1096-7176
Keyword (Author)
NAD(P) specificityCofactor switchingDeep learningExplainable AIProtein engineeringSynthetic biology
Keyword
COENZYME SPECIFICITYREDUCTASEBINDINGCLASSIFICATIONDEHYDROGENASEPREFERENCEPHOSPHATESUBSTRATESEQUENCE

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.