File Download

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

GrzybowskiBartosz Andrzej

Grzybowski, Bartosz A.
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Full metadata record

DC Field Value Language
dc.citation.endPage 4827 -
dc.citation.number 11 -
dc.citation.startPage 4819 -
dc.citation.title JOURNAL OF THE AMERICAN CHEMICAL SOCIETY -
dc.citation.volume 144 -
dc.contributor.author Beker, Wiktor -
dc.contributor.author Roszak, Rafal -
dc.contributor.author Wolos, Agnieszka -
dc.contributor.author Angello, Nicholas H. -
dc.contributor.author Rathore, Vandana -
dc.contributor.author Burke, Martin D. -
dc.contributor.author Grzybowski, Bartosz A. -
dc.date.accessioned 2023-12-21T14:22:44Z -
dc.date.available 2023-12-21T14:22:44Z -
dc.date.created 2022-05-03 -
dc.date.issued 2022-03 -
dc.description.abstract Applications of machine learning (ML) to synthetic chemistry rely on the assumption that large numbers ofliterature-reported examples should enable construction of accurate and predictive models of chemical reactivity. This paperdemonstrates that abundance of carefully curated literature data may be insufficient for this purpose. Using an example of Suzuki-Miyaura coupling with heterocyclic building blocks & xe0d5;and a carefully selected database of >10,000 literature examples & xe0d5;we show thatML models cannot offer any meaningful predictions of optimum reaction conditions, even if the search space is restricted to onlysolvents and bases. This result holds irrespective of the ML model applied (from simple feed-forward to state-of-the-art graph-convolution neural networks) or the representation to describe the reaction partners (variousfingerprints, chemical descriptors,latent representations, etc.). In all cases, the ML methods fail to perform significantly better than naive assignments based on thesheer frequency of certain reaction conditions reported in the literature. These unsatisfactory results likely reflect subjectivepreferences of various chemists to use certain protocols, other biasing factors as mundane as availability of certain solvents/reagents,and/or a lack of negative data. Thesefindings highlight the likely importance of systematically generating reliable and standardizeddata sets for algorithm training. -
dc.identifier.bibliographicCitation JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, v.144, no.11, pp.4819 - 4827 -
dc.identifier.doi 10.1021/jacs.1c12005 -
dc.identifier.issn 0002-7863 -
dc.identifier.scopusid 2-s2.0-85126567677 -
dc.identifier.uri https://scholarworks.unist.ac.kr/handle/201301/61157 -
dc.identifier.url https://pubs.acs.org/doi/10.1021/jacs.1c12005 -
dc.identifier.wosid 000777169400018 -
dc.language 영어 -
dc.publisher AMER CHEMICAL SOC -
dc.title Machine Learning May Sometimes Simply Capture LiteraturePopularity Trends: A Case Study of Heterocyclic Suzuki-MiyauraCoupling br -
dc.type Article -
dc.description.isOpenAccess TRUE -
dc.relation.journalWebOfScienceCategory Chemistry, Multidisciplinary -
dc.relation.journalResearchArea Chemistry -
dc.type.docType Article -
dc.description.journalRegisteredClass scie -
dc.description.journalRegisteredClass scopus -
dc.subject.keywordPlus CROSS-COUPLING REACTIONS -
dc.subject.keywordPlus PREDICTION -
dc.subject.keywordPlus CLASSIFICATION -
dc.subject.keywordPlus ALGORITHM -
dc.subject.keywordPlus COMPUTER -
dc.subject.keywordPlus ALLOWS -
dc.subject.keywordPlus GO -

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.