Probabilistic Numerics – Max Planck Institute for Intelligent Systems

Institute Homepage

Institute Homepage Sign In

Back

Research Overview

Other

When Managers Become Robin Hoods: A Mixed Method Investigation

Using Human-in-the-Loop in Hiring

Probabilistic Numerics Members Publications

Members

Probabilistic Numerics, Empirical Inference

Philipp Hennig

Affiliated Researcher

Probabilistic Numerics

Simon Bartels

Doctoral Researcher

Intelligent Control Systems

Alonso Marco Valle

Intelligent Control Systems

Sebastian Trimpe

Publications

Autonomous Motion Probabilistic Numerics Intelligent Control Systems Conference Paper On the Design of LQR Kernels for Efficient Controller Learning Marco, A., Hennig, P., Schaal, S., Trimpe, S. Proceedings of the 56th IEEE Annual Conference on Decision and Control (CDC), 5193-5200, IEEE, IEEE Conference on Decision and Control, December 2017 (Published)

Abstract ›

Finding optimal feedback controllers for nonlinear dynamic systems from data is hard. Recently, Bayesian optimization (BO) has been proposed as a powerful framework for direct controller tuning from experimental trials. For selecting the next query point and finding the global optimum, BO relies on a probabilistic description of the latent objective function, typically a Gaussian process (GP). As is shown herein, GPs with a common kernel choice can, however, lead to poor learning outcomes on standard quadratic control problems. For a first-order system, we construct two kernels that specifically leverage the structure of the well-known Linear Quadratic Regulator (LQR), yet retain the flexibility of Bayesian nonparametric learning. Simulations of uncertain linear and nonlinear systems demonstrate that the LQR kernels yield superior learning performance.

arXiv PDF On the Design of LQR Kernels for Efficient Controller Learning - CDC presentation DOI BibTeX

Autonomous Motion Probabilistic Numerics Intelligent Control Systems Conference Paper Virtual vs. Real: Trading Off Simulations and Physical Experiments in Reinforcement Learning with Bayesian Optimization Marco, A., Berkenkamp, F., Hennig, P., Schoellig, A. P., Krause, A., Schaal, S., Trimpe, S. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), 1557-1563, IEEE, Piscataway, NJ, USA, May 2017 (Published) PDF arXiv ICRA 2017 Spotlight presentation Virtual vs. Real - Video explanation DOI BibTeX

Probabilistic Numerics Conference Paper Fast Bayesian Optimization of Machine Learning Hyperparameters on Large Datasets Klein, A., Falkner, S., Bartels, S., Hennig, P., Hutter, F. Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS), 54:528-536, Proceedings of Machine Learning Research, (Editors: Sign, Aarti and Zhu, Jerry), PMLR, April 2017 (Published) pdf URL BibTeX

Autonomous Motion Probabilistic Numerics Intelligent Control Systems Conference Paper Automatic LQR Tuning Based on Gaussian Process Global Optimization Marco, A., Hennig, P., Bohg, J., Schaal, S., Trimpe, S. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), 270-277, IEEE, IEEE International Conference on Robotics and Automation, May 2016 (Published)

Abstract ›

This paper proposes an automatic controller tuning framework based on linear optimal control combined with Bayesian optimization. With this framework, an initial set of controller gains is automatically improved according to a pre-defined performance objective evaluated from experimental data. The underlying Bayesian optimization algorithm is Entropy Search, which represents the latent objective as a Gaussian process and constructs an explicit belief over the location of the objective minimum. This is used to maximize the information gain from each experimental evaluation. Thus, this framework shall yield improved controllers with fewer evaluations compared to alternative approaches. A seven-degree- of-freedom robot arm balancing an inverted pole is used as the experimental demonstrator. Results of a two- and four- dimensional tuning problems highlight the method’s potential for automatic controller tuning on robotic platforms.

Video - Automatic LQR Tuning Based on Gaussian Process Global Optimization - ICRA 2016 Video - Automatic Controller Tuning on a Two-legged Robot PDF DOI BibTeX

Probabilistic Numerics Conference Paper Batch Bayesian Optimization via Local Penalization González, J., Dai, Z., Hennig, P., Lawrence, N. Proceedings of the 19th International Conference on Artificial Intelligence and Statistics (AISTATS), 51:648-657, JMLR Workshop and Conference Proceedings, (Editors: Gretton, A. and Robert, C. C.), May 2016 (Published) URL BibTeX

Autonomous Motion Empirical Inference Probabilistic Numerics Intelligent Control Systems Conference Paper Automatic LQR Tuning Based on Gaussian Process Optimization: Early Experimental Results Marco, A., Hennig, P., Bohg, J., Schaal, S., Trimpe, S. Machine Learning in Planning and Control of Robot Motion Workshop at the IEEE/RSJ International Conference on Intelligent Robots and Systems (iROS), Machine Learning in Planning and Control of Robot Motion Workshop, October 2015 (Published)

Abstract ›

This paper proposes an automatic controller tuning framework based on linear optimal control combined with Bayesian optimization. With this framework, an initial set of controller gains is automatically improved according to a pre-defined performance objective evaluated from experimental data. The underlying Bayesian optimization algorithm is Entropy Search, which represents the latent objective as a Gaussian process and constructs an explicit belief over the location of the objective minimum. This is used to maximize the information gain from each experimental evaluation. Thus, this framework shall yield improved controllers with fewer evaluations compared to alternative approaches. A seven-degree-of-freedom robot arm balancing an inverted pole is used as the experimental demonstrator. Preliminary results of a low-dimensional tuning problem highlight the method’s potential for automatic controller tuning on robotic platforms.

PDF DOI BibTeX

Autonomous Motion Intelligent Control Systems Master Thesis Gaussian Process Optimization for Self-Tuning Control Marco, A. Polytechnic University of Catalonia (BarcelonaTech), October 2015 PDF BibTeX

Empirical Inference Probabilistic Numerics Article Entropy Search for Information-Efficient Global Optimization Hennig, P., Schuler, C. Journal of Machine Learning Research, 13:1809-1837, -, June 2012

Abstract ›

Contemporary global optimization algorithms are based on local measures of utility, rather than a probability measure over location and value of the optimum. They thus attempt to collect low function values, not to learn about the optimum. The reason for the absence of probabilistic global optimizers is that the corresponding inference problem is intractable in several ways. This paper develops desiderata for probabilistic optimization algorithms, then presents a concrete algorithm which addresses each of the computational intractabilities with a sequence of approximations and explicitly adresses the decision problem of maximizing information gain from each evaluation.

PDF Web BibTeX