File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

김영근

Kim, Younggeun
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Full metadata record

DC Field Value Language
dc.citation.endPage 667 -
dc.citation.startPage 661 -
dc.citation.title PATTERN RECOGNITION LETTERS -
dc.citation.volume 125 -
dc.contributor.author Kim, Young-geun -
dc.contributor.author Kwon, Yongchan -
dc.contributor.author Paik, Myunghee Cho -
dc.date.accessioned 2026-03-05T14:33:02Z -
dc.date.available 2026-03-05T14:33:02Z -
dc.date.created 2026-02-27 -
dc.date.issued 2019-07 -
dc.description.abstract An imbalance is one of the problems in machine learning. When data are not balanced, the correct specification rate for the minor class suffers even if accuracy is high. The oversampling method has been used to address the issue without consideration about the sacrifice of accuracy. In addition, an arbitrary oversampling scheme may introduce bias. In this paper, we propose principled methods of handling imbalance under user-specified constraints on the sensitivity and specificity. Our work consists of three elements of contributions. First, we provide an optimized target proportion that minimizes the maximum error rate under user-specified constraints on sensitivity and specificity. Second, we introduce the notion of resampling at random (RAR) under which the limit of the estimator is not altered from the original sample. These two elements are relevant to any classification methods. Third, we derive asymptotic properties of selected classifiers when we apply RAR oversampling with the target proportion. Finally, we implement the proposed method in an image recognition context using the extracted feature from the last layer of deep convolutional neural networks (CNNs). We present an analysis of fundus data to classify diabetic retinopathy using the proposed method. (C) 2019 Elsevier B.V. All rights reserved. -
dc.identifier.bibliographicCitation PATTERN RECOGNITION LETTERS, v.125, pp.661 - 667 -
dc.identifier.doi 10.1016/j.patrec.2019.07.006 -
dc.identifier.issn 0167-8655 -
dc.identifier.scopusid 2-s2.0-85068870984 -
dc.identifier.uri https://scholarworks.unist.ac.kr/handle/201301/90577 -
dc.identifier.url https://www.sciencedirect.com/science/article/pii/S0167865519301965?via%3Dihub -
dc.identifier.wosid 000482374500091 -
dc.language 영어 -
dc.publisher ELSEVIER -
dc.title Valid oversampling schemes to handle imbalance -
dc.type Article -
dc.description.isOpenAccess FALSE -
dc.relation.journalWebOfScienceCategory Computer Science, Artificial Intelligence -
dc.relation.journalResearchArea Computer Science -
dc.type.docType Article -
dc.description.journalRegisteredClass scie -
dc.description.journalRegisteredClass scopus -
dc.subject.keywordAuthor Optimal oversampling target proportion -
dc.subject.keywordAuthor Resampling at random -
dc.subject.keywordAuthor Medical imaging -
dc.subject.keywordAuthor Imbalance -
dc.subject.keywordAuthor Oversampling -
dc.subject.keywordPlus CLASSIFICATION -

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.