Full metadata record
DC Field | Value | Language |
---|---|---|
dc.citation.endPage | 4827 | - |
dc.citation.number | 11 | - |
dc.citation.startPage | 4819 | - |
dc.citation.title | JOURNAL OF THE AMERICAN CHEMICAL SOCIETY | - |
dc.citation.volume | 144 | - |
dc.contributor.author | Beker, Wiktor | - |
dc.contributor.author | Roszak, Rafal | - |
dc.contributor.author | Wolos, Agnieszka | - |
dc.contributor.author | Angello, Nicholas H. | - |
dc.contributor.author | Rathore, Vandana | - |
dc.contributor.author | Burke, Martin D. | - |
dc.contributor.author | Grzybowski, Bartosz A. | - |
dc.date.accessioned | 2023-12-21T14:22:44Z | - |
dc.date.available | 2023-12-21T14:22:44Z | - |
dc.date.created | 2022-05-03 | - |
dc.date.issued | 2022-03 | - |
dc.description.abstract | Applications of machine learning (ML) to synthetic chemistry rely on the assumption that large numbers ofliterature-reported examples should enable construction of accurate and predictive models of chemical reactivity. This paperdemonstrates that abundance of carefully curated literature data may be insufficient for this purpose. Using an example of Suzuki-Miyaura coupling with heterocyclic building blocks & xe0d5;and a carefully selected database of >10,000 literature examples & xe0d5;we show thatML models cannot offer any meaningful predictions of optimum reaction conditions, even if the search space is restricted to onlysolvents and bases. This result holds irrespective of the ML model applied (from simple feed-forward to state-of-the-art graph-convolution neural networks) or the representation to describe the reaction partners (variousfingerprints, chemical descriptors,latent representations, etc.). In all cases, the ML methods fail to perform significantly better than naive assignments based on thesheer frequency of certain reaction conditions reported in the literature. These unsatisfactory results likely reflect subjectivepreferences of various chemists to use certain protocols, other biasing factors as mundane as availability of certain solvents/reagents,and/or a lack of negative data. Thesefindings highlight the likely importance of systematically generating reliable and standardizeddata sets for algorithm training. | - |
dc.identifier.bibliographicCitation | JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, v.144, no.11, pp.4819 - 4827 | - |
dc.identifier.doi | 10.1021/jacs.1c12005 | - |
dc.identifier.issn | 0002-7863 | - |
dc.identifier.scopusid | 2-s2.0-85126567677 | - |
dc.identifier.uri | https://scholarworks.unist.ac.kr/handle/201301/61157 | - |
dc.identifier.url | https://pubs.acs.org/doi/10.1021/jacs.1c12005 | - |
dc.identifier.wosid | 000777169400018 | - |
dc.language | 영어 | - |
dc.publisher | AMER CHEMICAL SOC | - |
dc.title | Machine Learning May Sometimes Simply Capture LiteraturePopularity Trends: A Case Study of Heterocyclic Suzuki-MiyauraCoupling br | - |
dc.type | Article | - |
dc.description.isOpenAccess | TRUE | - |
dc.relation.journalWebOfScienceCategory | Chemistry, Multidisciplinary | - |
dc.relation.journalResearchArea | Chemistry | - |
dc.type.docType | Article | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.subject.keywordPlus | CROSS-COUPLING REACTIONS | - |
dc.subject.keywordPlus | PREDICTION | - |
dc.subject.keywordPlus | CLASSIFICATION | - |
dc.subject.keywordPlus | ALGORITHM | - |
dc.subject.keywordPlus | COMPUTER | - |
dc.subject.keywordPlus | ALLOWS | - |
dc.subject.keywordPlus | GO | - |
Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Tel : 052-217-1404 / Email : scholarworks@unist.ac.kr
Copyright (c) 2023 by UNIST LIBRARY. All rights reserved.
ScholarWorks@UNIST was established as an OAK Project for the National Library of Korea.