Learning the group structure of deep neural networks with an expectation maximization method

Yi, Subin; Choi, Jaesik

doi:10.1109/ICDMW.2018.00106

Scholarworks@UNIST

UNIST Library

File Download

There are no files associated with this item.

SFX Link

Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)

Related Researcher

최재식

Choi, Jaesik

Read More

Views & Downloads

Detailed Information

Cited time in webofscience

Cited time in scopus

Metadata Downloads

Full metadata record

DC Field	Value	Language
dc.citation.conferencePlace	SI	-
dc.citation.conferencePlace	Singapore	-
dc.citation.endPage	696	-
dc.citation.startPage	689	-
dc.citation.title	18th IEEE International Conference on Data Mining Workshops, ICDMW 2018	-
dc.contributor.author	Yi, Subin	-
dc.contributor.author	Choi, Jaesik	-
dc.date.accessioned	2024-02-01T01:06:08Z	-
dc.date.available	2024-02-01T01:06:08Z	-
dc.date.created	2019-04-09	-
dc.date.issued	2018-11-17	-
dc.description.abstract	Many recent deep learning research work use very deep neural networks exploiting huge amount of parameters. It results in the strong expressive power, however, it also brings issues such as overfitting to training data, increasing memory burden and requiring excessive computations. In this paper, we propose an expectation maximization method to learn the group structure of deep neural networks with a group regularization principle to resolve those issues. Our method clusters the neurons in a layer based on how they are connected to the neurons in the next layer using a mixture model and the neurons in the next layer based on which group in the current layer they are most strongly connected to. Our expectation maximization method uses the Gaussian mixture model to keep the most salient connections and remove others to acquire a grouped weight matrix in a block diagonal matrix form. We refine our method further to cluster the kernels of convolutional neural networks (CNNs). We define the representative value of each kernel and build a representative matrix. The matrix is then grouped and the kernels are pruned out based on the group structure of the representative matrix. In experiments, we applied our method to fully-connected networks, 1-dimensional CNNs, and 2-dimensional CNNs and compared with baseline deep neural networks in MNIST, CIFAR-10, and United States groundwater datasets with respect to the number of parameters and classification and regression accuracy. We show that our method can reduce the number of parameters significantly without loss of accuracy and outperform the baseline models.	-
dc.identifier.bibliographicCitation	18th IEEE International Conference on Data Mining Workshops, ICDMW 2018, pp.689 - 696	-
dc.identifier.doi	10.1109/ICDMW.2018.00106	-
dc.identifier.issn	2375-9232	-
dc.identifier.scopusid	2-s2.0-85062891963	-
dc.identifier.uri	https://scholarworks.unist.ac.kr/handle/201301/80402	-
dc.identifier.url	https://ieeexplore.ieee.org/document/8637406	-
dc.language	영어	-
dc.publisher	IEEE Computer Society	-
dc.title	Learning the group structure of deep neural networks with an expectation maximization method	-
dc.type	Conference Paper	-
dc.date.conferenceDate	2018-11-17	-

Show Simple Item Record

qrcode

RSS 1.0 RSS 2.0

UNIST | Library

Tel : 052-217-1404 / Email : scholarworks@unist.ac.kr

ScholarWorks@UNIST was established as an OAK Project for the National Library of Korea.