Efficient hyper-parameter optimization for NLP applications

Lidan Wang; Minwei Feng; Bowen Zhou; Bing Xiang; Sridhar Mahadevan

doi:10.18653/v1/d15-1253

EMNLP 2015

Conference paper

17 Sep 2015

Efficient hyper-parameter optimization for NLP applications

View publication

Abstract

Hyper-parameter optimization is an important problem in natural language processing (NLP) and machine learning. Recently, a group of studies has focused on using sequential Bayesian Optimization to solve this problem, which aims to reduce the number of iterations and trials required during the optimization process. In this paper, we explore this problem from a different angle, and propose a multi-stage hyper-parameter optimization that breaks the problem into multiple stages with increasingly amounts of data. Early stage provides fast estimates of good candidates which are used to initialize later stages for better performance and speed. We demonstrate the utility of this new algorithm by evaluating its speed and accuracy against state-of-the-art Bayesian Optimization algorithms on classification and prediction tasks.

Conference paper