Machine Learning May Sometimes Simply Capture LiteraturePopularity Trends: A Case Study of Heterocyclic Suzuki-MiyauraCoupling br

Beker, Wiktor; Roszak, Rafal; Wolos, Agnieszka; Angello, Nicholas H.; Rathore, Vandana; Burke, Martin D.; Grzybowski, Bartosz A.

doi:10.1021/jacs.1c12005

Scholarworks@UNIST

UNIST Library

File Download

000777169400018.pdf.pdf

SFX Link

Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)

Related Researcher

GrzybowskiBartosz Andrzej

Grzybowski, Bartosz A.

Read More

Views & Downloads

Detailed Information

Cited time in webofscience

Cited time in scopus

Metadata Downloads

Full metadata record

DC Field	Value	Language
dc.citation.endPage	4827	-
dc.citation.number	11	-
dc.citation.startPage	4819	-
dc.citation.title	JOURNAL OF THE AMERICAN CHEMICAL SOCIETY	-
dc.citation.volume	144	-
dc.contributor.author	Beker, Wiktor	-
dc.contributor.author	Roszak, Rafal	-
dc.contributor.author	Wolos, Agnieszka	-
dc.contributor.author	Angello, Nicholas H.	-
dc.contributor.author	Rathore, Vandana	-
dc.contributor.author	Burke, Martin D.	-
dc.contributor.author	Grzybowski, Bartosz A.	-
dc.date.accessioned	2023-12-21T14:22:44Z	-
dc.date.available	2023-12-21T14:22:44Z	-
dc.date.created	2022-05-03	-
dc.date.issued	2022-03	-
dc.description.abstract	Applications of machine learning (ML) to synthetic chemistry rely on the assumption that large numbers ofliterature-reported examples should enable construction of accurate and predictive models of chemical reactivity. This paperdemonstrates that abundance of carefully curated literature data may be insufficient for this purpose. Using an example of Suzuki-Miyaura coupling with heterocyclic building blocks & xe0d5;and a carefully selected database of >10,000 literature examples & xe0d5;we show thatML models cannot offer any meaningful predictions of optimum reaction conditions, even if the search space is restricted to onlysolvents and bases. This result holds irrespective of the ML model applied (from simple feed-forward to state-of-the-art graph-convolution neural networks) or the representation to describe the reaction partners (variousfingerprints, chemical descriptors,latent representations, etc.). In all cases, the ML methods fail to perform significantly better than naive assignments based on thesheer frequency of certain reaction conditions reported in the literature. These unsatisfactory results likely reflect subjectivepreferences of various chemists to use certain protocols, other biasing factors as mundane as availability of certain solvents/reagents,and/or a lack of negative data. Thesefindings highlight the likely importance of systematically generating reliable and standardizeddata sets for algorithm training.	-
dc.identifier.bibliographicCitation	JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, v.144, no.11, pp.4819 - 4827	-
dc.identifier.doi	10.1021/jacs.1c12005	-
dc.identifier.issn	0002-7863	-
dc.identifier.scopusid	2-s2.0-85126567677	-
dc.identifier.uri	https://scholarworks.unist.ac.kr/handle/201301/61157	-
dc.identifier.url	https://pubs.acs.org/doi/10.1021/jacs.1c12005	-
dc.identifier.wosid	000777169400018	-
dc.language	영어	-
dc.publisher	AMER CHEMICAL SOC	-
dc.title	Machine Learning May Sometimes Simply Capture LiteraturePopularity Trends: A Case Study of Heterocyclic Suzuki-MiyauraCoupling br	-
dc.type	Article	-
dc.description.isOpenAccess	TRUE	-
dc.relation.journalWebOfScienceCategory	Chemistry, Multidisciplinary	-
dc.relation.journalResearchArea	Chemistry	-
dc.type.docType	Article	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.subject.keywordPlus	CROSS-COUPLING REACTIONS	-
dc.subject.keywordPlus	PREDICTION	-
dc.subject.keywordPlus	CLASSIFICATION	-
dc.subject.keywordPlus	ALGORITHM	-
dc.subject.keywordPlus	COMPUTER	-
dc.subject.keywordPlus	ALLOWS	-
dc.subject.keywordPlus	GO	-

Show Simple Item Record

qrcode

RSS 1.0 RSS 2.0

UNIST | Library

Tel : 052-217-1404 / Email : scholarworks@unist.ac.kr

ScholarWorks@UNIST was established as an OAK Project for the National Library of Korea.