Markov blanket-based universal feature selection for classification and regression of mixed-type data

Lee, Junghye; Jeong, Jun-Yong; Jun, Chi-Hyuck

doi:10.1016/j.eswa.2020.113398

Scholarworks@UNIST

UNIST Library

File Download

There are no files associated with this item.

SFX Link

Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)

Views & Downloads

Detailed Information

Cited time in webofscience

Cited time in scopus

Metadata Downloads

Full metadata record

DC Field	Value	Language
dc.citation.startPage	113398	-
dc.citation.title	EXPERT SYSTEMS WITH APPLICATIONS	-
dc.citation.volume	158	-
dc.contributor.author	Lee, Junghye	-
dc.contributor.author	Jeong, Jun-Yong	-
dc.contributor.author	Jun, Chi-Hyuck	-
dc.date.accessioned	2023-12-21T16:46:06Z	-
dc.date.available	2023-12-21T16:46:06Z	-
dc.date.created	2020-07-02	-
dc.date.issued	2020-11	-
dc.description.abstract	Feature selection has been successfully applied to improve the quality of data analysis in various expert and intelligent systems. However, because most real-world data nowadays come with mixed features, traditional feature selection approaches that are mainly designed to handle single-type data are not suitable for this situation. In addition, most of existing methods are only applicable to a specific problem, either classification or regression. Therefore, it is an urgent need to develop a universal feature selection method that can be applied to classification and regression with mixed-type data. In response to this, our paper presents a new feature selection method based on a Markov blanket (MB) called Mixed-MB. The key idea behind this is to embed a likelihood ratio-based generalized conditional independence test into an efficient MB search algorithm to find the minimal set of features to fully explain the target variable on mixed-type data. This new MB feature selection method eliminates the weakness of existing MB feature selection method that it only can handle single-type data, while maintaining its strengths such as theoretical soundness, simplicity, speed, and versatility. Experimental results on real-world data sets with mixed features demonstrate that the proposed method is effective for improving the accuracy of prediction models in both classification and regression. It is also shown to be able to yield more accurate results with fewer features than other methods. We believe that Mixed-MB will be widely used in expert and intelligent systems that utilize various data to create value since it can be applied to any type of data and problem.	-
dc.identifier.bibliographicCitation	EXPERT SYSTEMS WITH APPLICATIONS, v.158, pp.113398	-
dc.identifier.doi	10.1016/j.eswa.2020.113398	-
dc.identifier.issn	0957-4174	-
dc.identifier.scopusid	2-s2.0-85086708952	-
dc.identifier.uri	https://scholarworks.unist.ac.kr/handle/201301/32966	-
dc.identifier.url	https://www.sciencedirect.com/science/article/pii/S0957417420302220	-
dc.identifier.wosid	000571732700015	-
dc.language	영어	-
dc.publisher	Pergamon Press Ltd.	-
dc.title	Markov blanket-based universal feature selection for classification and regression of mixed-type data	-
dc.type	Article	-
dc.description.isOpenAccess	FALSE	-
dc.relation.journalWebOfScienceCategory	Computer Science, Artificial Intelligence; Engineering, Electrical & Electronic; Operations Research & Management Science	-
dc.relation.journalResearchArea	Computer Science; Engineering; Operations Research & Management Science	-
dc.type.docType	Article	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.subject.keywordAuthor	Markov blanket	-
dc.subject.keywordAuthor	Multivariate feature selection	-
dc.subject.keywordAuthor	Conditional independence test	-
dc.subject.keywordAuthor	Likelihood-ratio test	-
dc.subject.keywordAuthor	Classification	-
dc.subject.keywordAuthor	Regression	-

Show Simple Item Record

qrcode

RSS 1.0 RSS 2.0

UNIST | Library

Tel : 052-217-1404 / Email : scholarworks@unist.ac.kr

ScholarWorks@UNIST was established as an OAK Project for the National Library of Korea.