File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Prediction and interpretation of pathogenic bacteria occurrence at a recreational beach using data-driven algorithms

Author(s)
Jang, JiyiAbbas, AtherKim, HyeinRhee, ChaeyoungShin, Seung GuChun, Jong AhnBaek, SangsooCho, Kyung Hwa
Issued Date
2023-12
DOI
10.1016/j.ecoinf.2023.102370
URI
https://scholarworks.unist.ac.kr/handle/201301/68046
Citation
ECOLOGICAL INFORMATICS, v.78, pp.102370
Abstract
Recreational beaches face a threat from pathogenic bacteria that harbor antibiotic resistance genes (ARGs). To predict bacterial occurrence and comprehend their non-linear relationship with hydrometeorological features, advanced machine- and deep-learning algorithms were employed. These algorithms include regression trees (RT), as well as interpretable deep-learning algorithms such as the 'Input Attention-Long Short-Term Memory (IA-LSTM)' and 'Temporal Fusion Transformer (TFT)'. Our focus was on predicting the occurrence of Prevotella, a prevalent pathogenic bacterium found at the beaches. Utilizing model-dependent and model-agnostic interpretation methods, which encompass sensitivity analysis, permutation, and the SHapley Additive exPlanations (SHAP) importance, we evaluated model behavior. RT-based algorithms exhibited predictive capabilities comparable to those of IA-LSTM and TFT, achieving validation Nash-Sutcliffe efficiencies of 0.93, 0.94, and 0.96, respectively. However, the deep-learning algorithms (IA-LSTM and TFT) are surpassed in terms of interpretability. The model-dependent interpretation method identified heavy precipitation as a pivotal hydrometeorological feature linked to increased Prevotella occurrence. Notably, the IA-LSTM identified Prevotella as a potential host for the sulfonamide resistance gene (sul1), suggesting the potential of Prevotella as an indicator for sul1. This research, leveraging interpretable data-driven models, advances our understanding of the hydrometeorological features influencing the occurrence of pathogenic bacteria and the prevalence of ARGs at the beach, and enhances predictive capabilities for bacterial occurrence.
Publisher
ELSEVIER
ISSN
1574-9541
Keyword (Author)
Interpretable modelsDeep-learning algorithmsStrategy modelingCumulative importance featuresSimulation
Keyword
GLOBAL SENSITIVITY-ANALYSISESCHERICHIA-COLIMODELSINDEXESWATERS

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.