Basic Enhancement Strategies When Using Bayesian Optimization for Hyperparameter Tuning of Deep Neural Networks

Cho, Hyunghun; Kim, Yongjin; Lee, Eunjung; Choi, Daeyoung; Lee, Yongjae; Rhee, Wonjong

doi:10.1109/access.2020.2981072

Scholarworks@UNIST

UNIST Library

File Download

000524748500118.pdf

SFX Link

Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)

Related Researcher

이용재

Lee, Yongjae: Financial Engineering Lab.

Read More

Views & Downloads

Detailed Information

Cited time in webofscience

Cited time in scopus

Metadata Downloads

Basic Enhancement Strategies When Using Bayesian Optimization for Hyperparameter Tuning of Deep Neural Networks

Author(s): Cho, Hyunghun, Kim, Yongjin, Lee, Eunjung, Choi, Daeyoung, Lee, Yongjae, Rhee, Wonjong

Issued Date: 2020-03

DOI: 10.1109/access.2020.2981072

URI: https://scholarworks.unist.ac.kr/handle/201301/31854

Fulltext: https://ieeexplore.ieee.org/abstract/document/9037259

Citation: IEEE ACCESS, v.8, pp.52588 - 52608

Abstract: Compared to the traditional machine learning models, deep neural networks (DNN) are known to be highly sensitive to the choice of hyperparameters. While the required time and effort for manual tuning has been rapidly decreasing for the well developed and commonly used DNN architectures, undoubtedly DNN hyperparameter optimization will continue to be a major burden whenever a new DNN architecture needs to be designed, a new task needs to be solved, a new dataset needs to be addressed, or an existing DNN needs to be improved further. For hyperparameter optimization of general machine learning problems, numerous automated solutions have been developed where some of the most popular solutions are based on Bayesian Optimization (BO). In this work, we analyze four fundamental strategies for enhancing BO when it is used for DNN hyperparameter optimization. Specifically, diversification, early termination, parallelization, and cost function transformation are investigated. Based on the analysis, we provide a simple yet robust algorithm for DNN hyperparameter optimization - DEEP-BO (Diversified, Early-termination-Enabled, and Parallel Bayesian Optimization). When evaluated over six DNN benchmarks, DEEP-BO mostly outperformed well-known solutions including GP-Hedge, BOHB, and the speed-up variants that use Median Stopping Rule or Learning Curve Extrapolation. In fact, DEEP-BO consistently provided the top, or at least close to the top, performance over all the benchmark types that we have tested. This indicates that DEEP-BO is a robust solution compared to the existing solutions. The DEEP-BO code is publicly available at https://github.com/snu-adsl/DEEP-BO.

Publisher: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

ISSN: 2169-3536

Keyword (Author): hyperparameter optimization, Bayesian optimization, diversification, Deep neural networks, early termination, parallelization, cost function transformation

Show Full Item Record

qrcode

RSS 1.0 RSS 2.0

UNIST | Library

Tel : 052-217-1403 / Email : scholarworks@unist.ac.kr

ScholarWorks@UNIST was established as an OAK Project for the National Library of Korea.