mlr3hyperband

Extends the mlr3 package with hyperband tuning.

Installation

Install from github by running the following line:

remotes::install_github("mlr-org/mlr3hyperband")

Quickstart

If you are already familiar with mlr3tuning, then the only change compared to other tuners is to give a numeric hyperparameter a "budget" tag. Afterwards, you can handle hyperband like all other tuners:

library(mlr3hyperband)

# give a hyperparameter the "budget" tag
params = list(
  ParamInt$new("nrounds", lower = 1, upper = 16, tags = "budget"),
  ParamDbl$new("eta",     lower = 0, upper = 1),
  ParamFct$new("booster", levels = c("gbtree", "gblinear", "dart"))
)

#inst = ... here goes the usual mlr3tuning TuningInstance constructor

# initialize hyperband tuner
tuner = TunerHyperband$new(eta = 2L)

# tune the previously defined TuningInstance
#tuner$tune(inst)

For the full working example, please check out the Examples section below.

A short description of hyperband

Hyperband is a budget oriented-procedure, weeding out suboptimally performing configurations early on during their training process, increasing tuning efficiency as a consequence. For this, several brackets are constructed with an associated set of configurations for each bracket. These configuration are initialized by stochastic, often uniform, sampling. Each bracket is divided into multiple stages, and configurations are evaluated for a increasing budget in each stage. Note that currently all configurations are trained completely from the beginning, so no online updates of models.

Different brackets are initialized with different number of configurations, and different budget sizes.

To identify the budget for evaluating hyperband, the user has to specify explicitly which hyperparameter of the learner influences the budget by tagging a single hyperparameter in the parameter set with "budget". An alternative approach using subsampling and pipelines is described further below.

Examples

Originally, hyperband was created with a "natural" learning parameter as the budget parameter in mind, like nrounds of the XGBoost learner:

library(mlr3hyperband)
library(mlr3learners)
set.seed(123)

# define hyperparameter and budget parameter for tuning with hyperband
params = list(
  ParamInt$new("nrounds", lower = 1, upper = 16, tags = "budget"),
  ParamDbl$new("eta",     lower = 0, upper = 1),
  ParamFct$new("booster", levels = c("gbtree", "gblinear", "dart"))
)

# initialize TuningInstance as usual
# hyperband terminates on its own, so the terminator acts as a upper bound
inst = TuningInstance$new(
  task = tsk("iris"),
  learner = lrn("classif.xgboost"),
  resampling = rsmp("holdout"),
  measures = msr("classif.ce"),
  ParamSet$new(params),
  term("evals", n_evals = 100000L) # high value to let hyperband finish
)

# initialize Hyperband Tuner and tune
tuner = TunerHyperband$new(eta = 2L)
tuner$tune(inst)

# return best result
inst$best()

Additionally, our framework also supports the case when no natural fidelity parameter is given by the learner. In this case, one can use mlr3pipelines to define subsampling as a preprocessing step. Then, the frac parameter of subsampling, defining the fraction of the training data to be used, can act as the budget parameter:

library(mlr3hyperband)
library(mlr3pipelines)
set.seed(123)

ll = po("subsample") %>>% lrn("classif.rpart")

# define extended hyperparameters with subsampling fraction as budget
# ==> no learner budget is required
params = list(
  ParamDbl$new("classif.rpart.cp", lower = 0.001, upper = 0.1),
  ParamInt$new("classif.rpart.minsplit", lower = 1, upper = 10),
  ParamDbl$new("subsample.frac", lower = 0.1, upper = 1, tags = "budget")
)

# define TuningInstance with the Graph Learner and the extended hyperparams
inst = TuningInstance$new(
  tsk("iris"),
  ll,
  rsmp("holdout"),
  msr("classif.ce"),
  ParamSet$new(params),
  term("evals", n_evals = 100000L) # high value to let hyperband finish
)

tuner = TunerHyperband$new(eta = 4L)
tuner$tune(inst)

# return best result
inst$best()

Documentation

The function reference is can be found here. Further documentation lives in the mlr3book.

The original paper introducing the hyperband algorithm is given here.

Name		Name	Last commit message	Last commit date
Latest commit History 112 Commits
.github/workflows		.github/workflows
R		R
attic		attic
codecov		codecov
inst		inst
man		man
pkgdown		pkgdown
tests		tests
.Rbuildignore		.Rbuildignore
.editorconfig		.editorconfig
.gitignore		.gitignore
.ignore		.ignore
DESCRIPTION		DESCRIPTION
LICENSE		LICENSE
NAMESPACE		NAMESPACE
NEWS.md		NEWS.md
README.md		README.md
mlr3hyperband.Rproj		mlr3hyperband.Rproj
tic.R		tic.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Repository files navigation

mlr3hyperband

Installation

Quickstart

A short description of hyperband

Examples

Documentation

About

Uh oh!

Releases 14

Sponsor this project

Uh oh!

Uh oh!

Contributors 12

Uh oh!

Languages

Uh oh!

License

mlr-org/mlr3hyperband

Folders and files

Latest commit

History

Repository files navigation

mlr3hyperband

Installation

Quickstart

A short description of hyperband

Examples

Documentation

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 14

Sponsor this project

Uh oh!

Uh oh!

Contributors 12

Uh oh!

Languages