Deep-learning optimized DEOCSU suite provides an iterable pipeline for accurate ChIP-exo peak calling
Cited 0 times in
Cited 0 times in
- Title
- Deep-learning optimized DEOCSU suite provides an iterable pipeline for accurate ChIP-exo peak calling
- Author
- Bang, Ina; Lee, Sang-Mok; Park, Seojoung; Park, Joon Young; Nong, Linh Khanh; Gao, Ye; Palsson, Bernhard O; Kim, Donghyuk
- Issue Date
- 2023-03
- Publisher
- Oxford University Press
- Citation
- BRIEFINGS IN BIOINFORMATICS, v.24, no.2, pp.bbad024
- Abstract
- Recognizing binding sites of DNA-binding proteins is a key factor for elucidating transcriptional regulation in organisms. ChIP-exo enables researchers to delineate genome-wide binding landscapes of DNA-binding proteins with near single base-pair resolution. However, the peak calling step hinders ChIP-exo application since the published algorithms tend to generate false-positive and false-negative predictions. Here, we report the development of DEOCSU (DEep-learning Optimized ChIP-exo peak calling SUite), a novel machine learning-based ChIP-exo peak calling suite. DEOCSU entails the deep convolutional neural network model which was trained with curated ChIP-exo peak data to distinguish the visualized data of bona fide peaks from false ones. Performance validation of the trained deep-learning model indicated its high accuracy, high precision and high recall of over 95%. Applying the new suite to both in-house and publicly available ChIP-exo datasets obtained from bacteria, eukaryotes and archaea revealed an accurate prediction of peaks containing canonical motifs, highlighting the versatility and efficiency of DEOCSU. Furthermore, DEOCSU can be executed on a cloud computing platform or the local environment. With visualization software included in the suite, adjustable options such as the threshold of peak probability, and iterable updating of the pre-trained model, DEOCSU can be optimized for users’ specific needs.
- URI
- https://scholarworks.unist.ac.kr/handle/201301/62174
- DOI
- 10.1093/bib/bbad024
- ISSN
- 1467-5463
- Appears in Collections:
- ECHE_Journal Papers
- Files in This Item:
- There are no files associated with this item.
can give you direct access to the published full text of this article. (UNISTARs only)
Show full item record
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.