BROWSE

Related Researcher

Author

Grzybowski, Bartosz A.
School of Natural Science
Research Interests
  • Nano science

ITEM VIEW & DOWNLOAD

Linguistic measures of chemical diversity and the "keywords" of molecular collections

Cited 0 times inthomson ciCited 0 times inthomson ci
Title
Linguistic measures of chemical diversity and the "keywords" of molecular collections
Author
Wozniak, MichalWolos, AgnieszkaModrzyk, UrszulaGorski, Rafal L.Winkowski, JanBajczyk, MichalSzymkuc, SaraGrzybowski, Bartosz A.Eder, Maciej
Issue Date
201805
Publisher
NATURE PUBLISHING GROUP
Citation
SCIENTIFIC REPORTS, v.8, no., pp.7598 -
Abstract
Computerized linguistic analyses have proven of immense value in comparing and searching through large text collections ("corpora"), including those deposited on the Internet-indeed, it would nowadays be hard to imagine browsing the Web without, for instance, search algorithms extracting most appropriate keywords from documents. This paper describes how such corpus-linguistic concepts can be extended to chemistry based on characteristic "chemical words" that span more than traditional functional groups and, instead, look at common structural fragments molecules share. Using these words, it is possible to quantify the diversity of chemical collections/databases in new ways and to define molecular "keywords" by which such collections are best characterized and annotated.
URI
Go to Link
DOI
http://dx.doi.org/10.1038/s41598-018-25440-6
ISSN
2045-2322
Appears in Collections:
SNS_Journal Papers
Files in This Item:
000432109500001.pdfDownload

find_unist can give you direct access to the published full text of this article. (UNISTARs only)

Show full item record

qr_code

  • mendeley

    citeulike

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

MENU