Publications

DEPARTMENTS

Emperical Interference

Haptic Intelligence

Modern Magnetic Systems

Perceiving Systems

Physical Intelligence

Robotic Materials

Social Foundations of Computation


Research Groups

Autonomous Vision

Autonomous Learning

Bioinspired Autonomous Miniature Robots

Dynamic Locomotion

Embodied Vision

Human Aspects of Machine Learning

Intelligent Control Systems

Learning and Dynamical Systems

Locomotion in Biorobotic and Somatic Systems

Micro, Nano, and Molecular Systems

Movement Generation and Control

Neural Capture and Synthesis

Physics for Inference and Optimization

Organizational Leadership and Diversity

Probabilistic Learning Group


Topics

Robot Learning

Conference Paper

2022

Autonomous Learning

Robotics

AI

Career

Award


Empirical Inference Probabilistic Numerics Conference Paper Convergence Guarantees for Adaptive Bayesian Quadrature Methods Kanagawa, M., Hennig, P. Advances in Neural Information Processing Systems 32 (NeurIPS 2019), 6234-6245, (Editors: H. Wallach and H. Larochelle and A. Beygelzimer and F. d’Alché-Buc and E. Fox and R. Garnett), Curran Associates, Inc., 33rd Annual Conference on Neural Information Processing Systems, December 2019 (Published) URL BibTeX

Empirical Inference Probabilistic Numerics Conference Paper Limitations of the empirical Fisher approximation for natural gradient descent Kunstner, F., Hennig, P., Balles, L. Advances in Neural Information Processing Systems 32 (NeurIPS 2019), 4158-4169, (Editors: H. Wallach and H. Larochelle and A. Beygelzimer and F. d’Alché-Buc and E. Fox and R. Garnett), Curran Associates, Inc., 33rd Annual Conference on Neural Information Processing Systems, December 2019 (Published) URL BibTeX

Probabilistic Numerics Conference Paper Active Multi-Information Source Bayesian Quadrature Gessner, A. G. J. M. M. Proceedings 35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 712-721, (Editors: Adams, RP; Gogate, V), UAI, July 2019 (Published) URL BibTeX

Probabilistic Numerics Empirical Inference Conference Paper Active Probabilistic Inference on Matrices for Pre-Conditioning in Stochastic Optimization de Roos, F., Hennig, P. Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS), 89:1448-1457, (Editors: Kamalika Chaudhuri and Masashi Sugiyama), PMLR, April 2019 (Published)
Pre-conditioning is a well-known concept that can significantly improve the convergence of optimization algorithms. For noise-free problems, where good pre-conditioners are not known a priori, iterative linear algebra methods offer one way to efficiently construct them. For the stochastic optimization problems that dominate contemporary machine learning, however, this approach is not readily available. We propose an iterative algorithm inspired by classic iterative linear solvers that uses a probabilistic model to actively infer a pre-conditioner in situations where Hessian-projections can only be constructed with strong Gaussian noise. The algorithm is empirically demonstrated to efficiently construct effective pre-conditioners for stochastic gradient descent and its variants. Experiments on problems of comparably low dimensionality show improved convergence. In very high-dimensional problems, such as those encountered in deep learning, the pre-conditioner effectively becomes an automatic learning-rate adaptation scheme, which we also empirically show to work well.
PDF URL BibTeX

Probabilistic Numerics Empirical Inference Conference Paper Fast and Robust Shortest Paths on Manifolds Learned from Data Arvanitidis, G., Hauberg, S., Hennig, P., Schober, M. Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS), 89:1506-1515, (Editors: Kamalika Chaudhuri and Masashi Sugiyama), PMLR, April 2019 (Published) PDF URL BibTeX

Probabilistic Numerics Conference Paper Kernel Recursive ABC: Point Estimation with Intractable Likelihood Kajihara, T., Kanagawa, M., Yamazaki, K., Fukumizu, K. Proceedings of the 35th International Conference on Machine Learning, 2405-2414, PMLR, July 2018
We propose a novel approach to parameter estimation for simulator-based statistical models with intractable likelihood. Our proposed method involves recursive application of kernel ABC and kernel herding to the same observed data. We provide a theoretical explanation regarding why the approach works, showing (for the population setting) that, under a certain assumption, point estimates obtained with this method converge to the true parameter, as recursion proceeds. We have conducted a variety of numerical experiments, including parameter estimation for a real-world pedestrian flow simulator, and show that in most cases our method outperforms existing approaches.
Paper BibTeX

Empirical Inference Probabilistic Numerics Conference Paper Counterfactual Mean Embedding: A Kernel Method for Nonparametric Causal Inference Muandet, K., Kanagawa, M., Saengkyongam, S., Marukata, S. Workshop on Machine Learning for Causal Inference, Counterfactual Prediction, and Autonomous Action (CausalML) at ICML, July 2018 (Published) BibTeX

Probabilistic Numerics Conference Paper Dissecting Adam: The Sign, Magnitude and Variance of Stochastic Gradients Balles, L., Hennig, P. Proceedings of the 35th International Conference on Machine Learning (ICML), 80:404-413, Proceedings of Machine Learning Research, (Editors: Jennifer Dy and Andreas Krause), PMLR, ICML, July 2018 (Published)
The ADAM optimizer is exceedingly popular in the deep learning community. Often it works very well, sometimes it doesn't. Why? We interpret ADAM as a combination of two aspects: for each weight, the update direction is determined by the sign of stochastic gradients, whereas the update magnitude is determined by an estimate of their relative variance. We disentangle these two aspects and analyze them in isolation, gaining insight into the mechanisms underlying ADAM. This analysis also extends recent results on adverse effects of ADAM on generalization, isolating the sign aspect as the problematic one. Transferring the variance adaptation to SGD gives rise to a novel method, completing the practitioner's toolbox for problems where ADAM fails.
URL BibTeX

Autonomous Motion Probabilistic Numerics Intelligent Control Systems Conference Paper On the Design of LQR Kernels for Efficient Controller Learning Marco, A., Hennig, P., Schaal, S., Trimpe, S. Proceedings of the 56th IEEE Annual Conference on Decision and Control (CDC), 5193-5200, IEEE, IEEE Conference on Decision and Control, December 2017 (Published)
Finding optimal feedback controllers for nonlinear dynamic systems from data is hard. Recently, Bayesian optimization (BO) has been proposed as a powerful framework for direct controller tuning from experimental trials. For selecting the next query point and finding the global optimum, BO relies on a probabilistic description of the latent objective function, typically a Gaussian process (GP). As is shown herein, GPs with a common kernel choice can, however, lead to poor learning outcomes on standard quadratic control problems. For a first-order system, we construct two kernels that specifically leverage the structure of the well-known Linear Quadratic Regulator (LQR), yet retain the flexibility of Bayesian nonparametric learning. Simulations of uncertain linear and nonlinear systems demonstrate that the LQR kernels yield superior learning performance.
arXiv PDF On the Design of LQR Kernels for Efficient Controller Learning - CDC presentation DOI BibTeX

Empirical Inference Probabilistic Numerics Conference Paper Dynamic Time-of-Flight Schober, M., Adam, A., Yair, O., Mazor, S., Nowozin, S. Proceedings IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017, 170-179, IEEE, Piscataway, NJ, USA, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 2017 (Published) DOI BibTeX

Probabilistic Numerics Conference Paper Fast Bayesian Optimization of Machine Learning Hyperparameters on Large Datasets Klein, A., Falkner, S., Bartels, S., Hennig, P., Hutter, F. Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS), 54:528-536, Proceedings of Machine Learning Research, (Editors: Sign, Aarti and Zhu, Jerry), PMLR, April 2017 (Published) pdf URL BibTeX

Probabilistic Numerics Conference Paper Active Uncertainty Calibration in Bayesian ODE Solvers Kersting, H., Hennig, P. Proceedings of the 32nd Conference on Uncertainty in Artificial Intelligence (UAI), 309-318, (Editors: Ihler, Alexander T. and Janzing, Dominik), June 2016 (Published)
There is resurging interest, in statistics and machine learning, in solvers for ordinary differential equations (ODEs) that return probability measures instead of point estimates. Recently, Conrad et al.~introduced a sampling-based class of methods that are `well-calibrated' in a specific sense. But the computational cost of these methods is significantly above that of classic methods. On the other hand, Schober et al.~pointed out a precise connection between classic Runge-Kutta ODE solvers and Gaussian filters, which gives only a rough probabilistic calibration, but at negligible cost overhead. By formulating the solution of ODEs as approximate inference in linear Gaussian SDEs, we investigate a range of probabilistic ODE solvers, that bridge the trade-off between computational cost and probabilistic calibration, and identify the inaccurate gradient measurement as the crucial source of uncertainty. We propose the novel filtering-based method Bayesian Quadrature filtering (BQF) which uses Bayesian quadrature to actively learn the imprecision in the gradient measurement by collecting multiple gradient evaluations.
URL BibTeX

Probabilistic Numerics Conference Paper Batch Bayesian Optimization via Local Penalization González, J., Dai, Z., Hennig, P., Lawrence, N. Proceedings of the 19th International Conference on Artificial Intelligence and Statistics (AISTATS), 51:648-657, JMLR Workshop and Conference Proceedings, (Editors: Gretton, A. and Robert, C. C.), May 2016 (Published) URL BibTeX

Probabilistic Numerics Conference Paper Probabilistic Approximate Least-Squares Bartels, S., Hennig, P. Proceedings of the 19th International Conference on Artificial Intelligence and Statistics (AISTATS), 51:676-684, JMLR Workshop and Conference Proceedings, (Editors: Gretton, A. and Robert, C. C. ), May 2016 (Published)
Least-squares and kernel-ridge / Gaussian process regression are among the foundational algorithms of statistics and machine learning. Famously, the worst-case cost of exact nonparametric regression grows cubically with the data-set size; but a growing number of approximations have been developed that estimate good solutions at lower cost. These algorithms typically return point estimators, without measures of uncertainty. Leveraging recent results casting elementary linear algebra operations as probabilistic inference, we propose a new approximate method for nonparametric least-squares that affords a probabilistic uncertainty estimate over the error between the approximate and exact least-squares solution (this is not the same as the posterior variance of the associated Gaussian process regressor). This allows estimating the error of the least-squares solution on a subset of the data relative to the full-data solution. The uncertainty can be used to control the computational effort invested in the approximation. Our algorithm has linear cost in the data-set size, and a simple formal form, so that it can be implemented with a few lines of code in programming languages with linear algebra functionality.
URL BibTeX

Autonomous Motion Empirical Inference Probabilistic Numerics Intelligent Control Systems Conference Paper Automatic LQR Tuning Based on Gaussian Process Optimization: Early Experimental Results Marco, A., Hennig, P., Bohg, J., Schaal, S., Trimpe, S. Machine Learning in Planning and Control of Robot Motion Workshop at the IEEE/RSJ International Conference on Intelligent Robots and Systems (iROS), Machine Learning in Planning and Control of Robot Motion Workshop, October 2015 (Published)
This paper proposes an automatic controller tuning framework based on linear optimal control combined with Bayesian optimization. With this framework, an initial set of controller gains is automatically improved according to a pre-defined performance objective evaluated from experimental data. The underlying Bayesian optimization algorithm is Entropy Search, which represents the latent objective as a Gaussian process and constructs an explicit belief over the location of the objective minimum. This is used to maximize the information gain from each experimental evaluation. Thus, this framework shall yield improved controllers with fewer evaluations compared to alternative approaches. A seven-degree-of-freedom robot arm balancing an inverted pole is used as the experimental demonstrator. Preliminary results of a low-dimensional tuning problem highlight the method’s potential for automatic controller tuning on robotic platforms.
PDF DOI BibTeX