Publications

DEPARTMENTS

Emperical Interference

Haptic Intelligence

Modern Magnetic Systems

Perceiving Systems

Physical Intelligence

Robotic Materials

Social Foundations of Computation


Research Groups

Autonomous Vision

Autonomous Learning

Bioinspired Autonomous Miniature Robots

Dynamic Locomotion

Embodied Vision

Human Aspects of Machine Learning

Intelligent Control Systems

Learning and Dynamical Systems

Locomotion in Biorobotic and Somatic Systems

Micro, Nano, and Molecular Systems

Movement Generation and Control

Neural Capture and Synthesis

Physics for Inference and Optimization

Organizational Leadership and Diversity

Probabilistic Learning Group


Topics

Robot Learning

Conference Paper

2022

Autonomous Learning

Robotics

AI

Career

Award


Autonomous Motion Probabilistic Numerics Intelligent Control Systems Conference Paper On the Design of LQR Kernels for Efficient Controller Learning Marco, A., Hennig, P., Schaal, S., Trimpe, S. Proceedings of the 56th IEEE Annual Conference on Decision and Control (CDC), 5193-5200, IEEE, IEEE Conference on Decision and Control, December 2017 (Published)
Finding optimal feedback controllers for nonlinear dynamic systems from data is hard. Recently, Bayesian optimization (BO) has been proposed as a powerful framework for direct controller tuning from experimental trials. For selecting the next query point and finding the global optimum, BO relies on a probabilistic description of the latent objective function, typically a Gaussian process (GP). As is shown herein, GPs with a common kernel choice can, however, lead to poor learning outcomes on standard quadratic control problems. For a first-order system, we construct two kernels that specifically leverage the structure of the well-known Linear Quadratic Regulator (LQR), yet retain the flexibility of Bayesian nonparametric learning. Simulations of uncertain linear and nonlinear systems demonstrate that the LQR kernels yield superior learning performance.
arXiv PDF On the Design of LQR Kernels for Efficient Controller Learning - CDC presentation DOI BibTeX

Autonomous Motion Intelligent Control Systems Conference Paper Optimizing Long-term Predictions for Model-based Policy Search Doerr, A., Daniel, C., Nguyen-Tuong, D., Marco, A., Schaal, S., Toussaint, M., Trimpe, S. Proceedings of 1st Annual Conference on Robot Learning (CoRL), 78:227-238, (Editors: Sergey Levine and Vincent Vanhoucke and Ken Goldberg), 1st Annual Conference on Robot Learning, November 2017 (Published)
We propose a novel long-term optimization criterion to improve the robustness of model-based reinforcement learning in real-world scenarios. Learning a dynamics model to derive a solution promises much greater data-efficiency and reusability compared to model-free alternatives. In practice, however, modelbased RL suffers from various imperfections such as noisy input and output data, delays and unmeasured (latent) states. To achieve higher resilience against such effects, we propose to optimize a generative long-term prediction model directly with respect to the likelihood of observed trajectories as opposed to the common approach of optimizing a dynamics model for one-step-ahead predictions. We evaluate the proposed method on several artificial and real-world benchmark problems and compare it to PILCO, a model-based RL framework, in experiments on a manipulation robot. The results show that the proposed method is competitive compared to state-of-the-art model learning methods. In contrast to these more involved models, our model can directly be employed for policy search and outperforms a baseline method in the robot experiment.
PDF BibTeX

Autonomous Motion Intelligent Control Systems Article Event-based State Estimation: An Emulation-based Approach Trimpe, S. IET Control Theory & Applications, 11(11):1684-1693, July 2017 (Published)
An event-based state estimation approach for reducing communication in a networked control system is proposed. Multiple distributed sensor agents observe a dynamic process and sporadically transmit their measurements to estimator agents over a shared bus network. Local event-triggering protocols ensure that data is transmitted only when necessary to meet a desired estimation accuracy. The event-based design is shown to emulate the performance of a centralised state observer design up to guaranteed bounds, but with reduced communication. The stability results for state estimation are extended to the distributed control system that results when the local estimates are used for feedback control. Results from numerical simulations and hardware experiments illustrate the effectiveness of the proposed approach in reducing network communication.
arXiv Supplementary material PDF DOI BibTeX

Autonomous Motion Intelligent Control Systems Conference Paper Model-Based Policy Search for Automatic Tuning of Multivariate PID Controllers Doerr, A., Nguyen-Tuong, D., Marco, A., Schaal, S., Trimpe, S. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), 5295-5301, IEEE, Piscataway, NJ, USA, IEEE International Conference on Robotics and Automation (ICRA), May 2017 (Published) PDF arXiv DOI BibTeX

Autonomous Motion Probabilistic Numerics Intelligent Control Systems Conference Paper Virtual vs. Real: Trading Off Simulations and Physical Experiments in Reinforcement Learning with Bayesian Optimization Marco, A., Berkenkamp, F., Hennig, P., Schoellig, A. P., Krause, A., Schaal, S., Trimpe, S. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), 1557-1563, IEEE, Piscataway, NJ, USA, May 2017 (Published) PDF arXiv ICRA 2017 Spotlight presentation Virtual vs. Real - Video explanation DOI BibTeX