File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

양승준

Yang, Seungjoon
Signal Processing Lab .
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Efficient architecture for deep neural networks with heterogeneous sensitivity

Author(s)
Cho, HyunjoongJang, JinhyeokLee, ChanhyeokYang, Seungjoon
Issued Date
2021-02
DOI
10.1016/j.neunet.2020.10.017
URI
https://scholarworks.unist.ac.kr/handle/201301/50074
Fulltext
https://www.sciencedirect.com/science/article/pii/S0893608020303804?via%3Dihub
Citation
NEURAL NETWORKS, v.134, pp.95 - 106
Abstract
In this study, we present a neural network that consists of nodes with heterogeneous sensitivity. Each node in a network is assigned a variable that determines the sensitivity with which it learns to perform a given task. The network is trained via a constrained optimization that maximizes the sparsity of the sensitivity variables while ensuring optimal network performance. As a result, the network learns to perform a given task using only a few sensitive nodes. Insensitive nodes, which are nodes with zero sensitivity, can be removed from a trained network to obtain a computationally efficient network. Removing zero-sensitivity nodes has no effect on the performance of the network because the network has already been trained to perform the task without them. The regularization parameter used to solve the optimization problem was simultaneously found during the training of the networks. To validate our approach, we designed networks with computationally efficient architectures for various tasks such as autoregression, object recognition, facial expression recognition, and object detection using various datasets. In our experiments, the networks designed by our proposed method provided the same or higher performances but with far less computational complexity. (C) 2020 Elsevier Ltd. All rights reserved.
Publisher
PERGAMON-ELSEVIER SCIENCE LTD
ISSN
0893-6080
Keyword (Author)
Deep neural networksEfficient architectureHeterogeneous sensitivityConstrained optimizationSimultaneous regularization parameter selection
Keyword
L-CURVEREGULARIZATION

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.