AC
Algorithm Configuration
Let \(\mathcal{I}\) be the set of problem instances, \(\mathcal{D}\) be the probability distribution over \(\mathcal{I}\). Let \(\mathcal{A}\) be the set of algorithms with parameters \(p_1, p_2, \ldots, p_k\). The domain of each parameter \(p_i\) is denoted as \(\Theta_i\), and the parameter space is defined as \(\Theta = \Theta_1 \times \Theta_2 \times \ldots \times \Theta_k\).
The objective of algorithm configuration is to find the optimal parameter configuration \(\theta^* \in \Theta\) that minimizes the expected cost of the algorithm over the distribution of problem instances:
\[ \theta^* \in \arg\min_{\theta \in \Theta} \mathbb{E}_{i \sim \mathcal{D}}[c(\theta, i)] \]
where \(c(\theta, i)\) is the cost of running algorithm \(\mathcal{A}\) with parameter configuration \(\theta\) on problem instance \(i\).
Dynamic Algorithm Configuration
Hyperparameter Optimization in Reinforcement Learning
Eimer et al. (2023) discussed the hyperparameter optimization problem in the context of reinforcement learning (RL).1 They conducted a experimental study on the performance of various hyperparameter optimization methods, including random search, DEHB (Awad et al. 2021),2 and BGT (Wan et al. 2022).3
- Optuna (Akiba et al. 2019)
References
Footnotes
Source code: https://github.com/facebookresearch/how-to-autorl.↩︎
Source code: https://github.com/automl/DEHB.↩︎
Source code: https://github.com/xingchenwan/bgpbt.↩︎