Empirical Inference
Members
Publications
2019 Progress Report
2019 Progress Report
Members
Empirical Inference
Publications
Empirical Inference
Conference Paper
Data-Efficient Hierarchical Reinforcement Learning
Nachum, O., Gu, S., Lee, H., Levine, S.
Advances in Neural Information Processing Systems 31 (NeurIPS 2018), 3307-3317, (Editors: S. Bengio and H. Wallach and H. Larochelle and K. Grauman and N. Cesa-Bianchi and R. Garnett), Curran Associates, Inc., 32nd Annual Conference on Neural Information Processing Systems, December 2018 (Published)
URL
BibTeX
Intelligent Control Systems
Empirical Inference
Conference Paper
Efficient Encoding of Dynamical Systems through Local Approximations
Solowjow, F., Mehrjou, A., Schölkopf, B., Trimpe, S.
In Proceedings of the 57th IEEE International Conference on Decision and Control (CDC), 6073 - 6079 , Miami, Fl, USA, December 2018 (Published)
arXiv
PDF
DOI
BibTeX
Empirical Inference
Conference Paper
Constraint-Space Projection Direct Policy Search
Akrour, R., Peters, J., Neuman, G.
14th European Workshop on Reinforcement Learning (EWRL), October 2018 (Published)
URL
BibTeX
Empirical Inference
Article
Control of Musculoskeletal Systems using Learned Dynamics Models
Büchler, D., Calandra, R., Schölkopf, B., Peters, J.
IEEE Robotics and Automation Letters, 3(4):3161-3168, IEEE, October 2018 (Published)
RAL18final
DOI
URL
BibTeX
Controlling musculoskeletal systems, especially robots actuated by pneumatic artificial muscles, is a challenging task due to nonlinearities, hysteresis effects, massive actuator de- lay and unobservable dependencies such as temperature. Despite such difficulties, muscular systems offer many beneficial prop- erties to achieve human-comparable performance in uncertain and fast-changing tasks. For example, muscles are backdrivable and provide variable stiffness while offering high forces to reach high accelerations. In addition, the embodied intelligence deriving from the compliance might reduce the control demands for specific tasks. In this paper, we address the problem of how to accurately control musculoskeletal robots. To address this issue, we propose to learn probabilistic forward dynamics models using Gaussian processes and, subsequently, to employ these models for control. However, Gaussian processes dynamics models cannot be set-up for our musculoskeletal robot as for traditional motor- driven robots because of unclear state composition etc. We hence empirically study and discuss in detail how to tune these approaches to complex musculoskeletal robots and their specific challenges. Moreover, we show that our model can be used to accurately control an antagonistic pair of pneumatic artificial muscles for a trajectory tracking task while considering only one- step-ahead predictions of the forward model and incorporating model uncertainty.
Empirical Inference
Conference Paper
Domain Randomization for Simulation-Based Policy Optimization with Transferability Assessment
Muratore, F., Treede, F., Gienger, M., Peters, J.
2nd Annual Conference on Robot Learning (CoRL), 87:700-713, Proceedings of Machine Learning Research, PMLR, October 2018 (Published)
URL
BibTeX
Empirical Inference
Conference Paper
Regularizing Reinforcement Learning with State Abstraction
Akrour, R., Veiga, F., Peters, J., Neuman, G.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 534-539, October 2018 (Published)
DOI
URL
BibTeX
Empirical Inference
Conference Paper
Reinforcement Learning of Phase Oscillators for Fast Adaptation to Moving Targets
Maeda, G., Koc, O., Morimoto, J.
Proceedings of The 2nd Conference on Robot Learning (CoRL), 87:630-640, (Editors: Aude Billard, Anca Dragan, Jan Peters, Jun Morimoto ), PMLR, October 2018 (Published)
URL
BibTeX
Empirical Inference
Conference Paper
PIPPS: Flexible Model-Based Policy Search Robust to the Curse of Chaos
Parmas, P., Rasmussen, C., Peters, J., Doya, K.
Proceedings of the 35th International Conference on Machine Learning (ICML), 80:4065-4074, Proceedings of Machine Learning Research, (Editors: Dy, Jennifer and Krause, Andreas), PMLR, July 2018 (Published)
URL
BibTeX
Empirical Inference
Conference Paper
The Mirage of Action-Dependent Baselines in Reinforcement Learning
Tucker, G., Bhupatiraju, S., Gu, S., Turner, R., Ghahramani, Z., Levine, S.
Proceedings of the 35th International Conference on Machine Learning (ICML), 80:5022-5031, Proceedings of Machine Learning Research, (Editors: Dy, Jennifer and Krause, Andreas), PMLR, July 2018 (Published)
PDF
URL
BibTeX
Empirical Inference
Conference Paper
Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning
Eysenbach, B., Gu, S., Ibarz, J., Levine, S.
6th International Conference on Learning Representations (ICLR), May 2018 (Published)
Videos
URL
BibTeX
Empirical Inference
Conference Paper
Sample and Feedback Efficient Hierarchical Reinforcement Learning from Human Preferences
Pinsler, R., Akrour, R., Osa, T., Peters, J., Neumann, G.
IEEE International Conference on Robotics and Automation, (ICRA), 596-601, IEEE, May 2018 (Published)
DOI
BibTeX
Empirical Inference
Conference Paper
Temporal Difference Models: Model-Free Deep RL for Model-Based Control
Pong*, V., Gu*, S., Dalal, M., Levine, S.
6th International Conference on Learning Representations (ICLR), May 2018, *equal contribution (Published)
URL
BibTeX
Empirical Inference
Article
Approximate Value Iteration Based on Numerical Quadrature
Vinogradska, J., Bischoff, B., Peters, J.
IEEE Robotics and Automation Letters, 3(2):1330-1337, January 2018 (Published)
DOI
BibTeX
Empirical Inference
Article
Intrinsic Motivation and Mental Replay enable Efficient Online Adaptation in Stochastic Recurrent Networks
Tanneberg, D., Peters, J., Rueckert, E.
Neural Networks, 109:67-80, 2018 (Published)
DOI
URL
BibTeX
Empirical Inference
Article
Inverse Reinforcement Learning via Nonparametric Spatio-Temporal Subgoal Modeling
Šošić, A., Rueckert, E., Peters, J., Zoubir, A., Koeppl, H.
Journal of Machine Learning Research, 19(69):1-45, 2018 (Published)
URL
BibTeX
Empirical Inference
Conference Paper
Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning
Gu, S., Lillicrap, T., Turner, R. E., Ghahramani, Z., Schölkopf, B., Levine, S.
Advances in Neural Information Processing Systems 30 (NIPS 2017), 3849-3858, (Editors: Guyon I. and Luxburg U.v. and Bengio S. and Wallach H. and Fergus R. and Vishwanathan S. and Garnett R.), Curran Associates, Inc., 31st Annual Conference on Neural Information Processing Systems, December 2017 (Published)
URL
BibTeX
Empirical Inference
Conference Paper
Efficient Online Adaptation with Stochastic Recurrent Neural Networks
Tanneberg, D., Peters, J., Rueckert, E.
IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids), 198-204, IEEE, November 2017 (Published)
DOI
BibTeX
Empirical Inference
Conference Paper
Online Learning with Stochastic Recurrent Neural Networks using Intrinsic Motivation Signals
Tanneberg, D., Peters, J., Rueckert, E.
Proceedings of the 1st Annual Conference on Robot Learning (CoRL), 167-174, Proceedings of Machine Learning Research, (Editors: Sergey Levine, Vincent Vanhoucke and Ken Goldberg), PMLR, November 2017 (Published)
URL
BibTeX
Empirical Inference
Article
Generalized exploration in policy search
van Hoof, H., Tanneberg, D., Peters, J.
Machine Learning, 106(9-10):1705-1724 , (Editors: Kurt Driessens, Dragi Kocev, Marko Robnik‐Sikonja, and Myra Spiliopoulou), October 2017, Special Issue of the ECML PKDD 2017 Journal Track (Published)
DOI
BibTeX
Empirical Inference
Conference Paper
Goal-driven dimensionality reduction for reinforcement learning
Parisi, S., Ramstedt, S., Peters, J.
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 4634-4639, IEEE, September 2017 (Published)
DOI
BibTeX
Empirical Inference
Conference Paper
Hybrid control trajectory optimization under uncertainty
Pajarinen, J., Kyrki, V., Koval, M., Srinivasa, S., Peters, J., Neumann, G.
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 5694-5701, September 2017 (Published)
DOI
BibTeX
Empirical Inference
Conference Paper
Sequence Tutor: Conservative fine-tuning of sequence generation models with KL-control
Jaques, N., Gu, S., Bahdanau, D., Hernández-Lobato, J. M., Turner, R. E., Eck, D.
Proceedings of the 34th International Conference on Machine Learning (ICML), 70:1645-1654, Proceedings of Machine Learning Research, (Editors: Doina Precup, Yee Whye Teh), PMLR, August 2017 (Published)
Arxiv
URL
BibTeX
Empirical Inference
Conference Paper
State-Regularized Policy Search for Linearized Dynamical Systems
Abdulsamad, H., Arenz, O., Peters, J., Neumann, G.
Proceedings of the Twenty-Seventh International Conference on Automated Planning and Scheduling, (ICAPS), 419-424, (Editors: Laura Barbulescu, Jeremy Frank, Mausam and Stephen F. Smith), AAAI Press, June 2017 (Published)
URL
BibTeX
Empirical Inference
Conference Paper
Deep Reinforcement Learning for Robotic Manipulation with Asynchronous Off-Policy Updates
Gu*, S., Holly*, E., Lillicrap, T., Levine, S.
Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), IEEE, Piscataway, NJ, USA, May 2017, *equal contribution (Published)
Arxiv
DOI
BibTeX
Empirical Inference
Conference Paper
A Learning-based Shared Control Architecture for Interactive Task Execution
Farraj, F. B., Osa, T., Pedemonte, N., Peters, J., Neumann, G., Giordano, P.
IEEE International Conference on Robotics and Automation (ICRA), 329-335, IEEE, May 2017 (Published)
DOI
BibTeX
Empirical Inference
Conference Paper
Layered direct policy search for learning hierarchical skills
End, F., Akrour, R., Peters, J., Neumann, G.
IEEE International Conference on Robotics and Automation (ICRA), 6442-6448, IEEE, May 2017 (Published)
DOI
BibTeX
Empirical Inference
Conference Paper
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic
Gu, S., Lillicrap, T., Ghahramani, Z., Turner, R. E., Levine, S.
Proceedings International Conference on Learning Representations (ICLR), April 2017 (Published)
PDF
URL
BibTeX
Empirical Inference
Conference Paper
Policy Search with High-Dimensional Context Variables
Tangkaratt, V., van Hoof, H., Parisi, S., Neumann, G., Peters, J., Sugiyama, M.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence (AAAI), 2632-2638, (Editors: Satinder P. Singh and Shaul Markovitch), AAAI Press, February 2017 (Published)
URL
BibTeX
Empirical Inference
Article
Manifold-based multi-objective policy search with sample reuse
Parisi, S., Pirotta, M., Peters, J.
Neurocomputing, 263:3-14, (Editors: Madalina Drugan, Marco Wiering, Peter Vamplew, and Madhu Chetty), 2017, Special Issue on Multi-Objective Reinforcement Learning (Published)
DOI
BibTeX
Empirical Inference
Article
Model-based Contextual Policy Search for Data-Efficient Generalization of Robot Skills
Kupcsik, A., Deisenroth, M., Peters, J., Ai Poh, L., Vadakkepat, V., Neumann, G.
Artificial Intelligence, 247:415-439, 2017, Special Issue on AI and Robotics (Published)
DOI
URL
BibTeX
Empirical Inference
Article
Non-parametric Policy Search with Limited Information Loss
van Hoof, H., Neumann, G., Peters, J.
Journal of Machine Learning Research , 18(73):1-46, 2017 (Published)
URL
BibTeX
Empirical Inference
Book Chapter
Policy Gradient Methods
Peters, J., Bagnell, J.
In Encyclopedia of Machine Learning and Data Mining, 982-985, 2nd, (Editors: Sammut, Claude and Webb, Geoffrey I.), Springer US, 2017 (Published)
URL
BibTeX
Empirical Inference
Article
Stability of Controllers for Gaussian Process Dynamics
Vinogradska, J., Bischoff, B., Nguyen-Tuong, D., Peters, J.
Journal of Machine Learning Research, 18(100):1-37, 2017 (Published)
URL
BibTeX
Empirical Inference
Miscellaneous
f-Divergence constrained policy improvement
Belousov, B., Peters, J.
2017 (Published)
arXiv
BibTeX
Empirical Inference
Conference Paper
Catching heuristics are optimal control policies
Belousov, B., Neumann, G., Rothkopf, C., Peters, J.
Advances in Neural Information Processing Systems 29 (NIPS 2016), 1426-1434, (Editors: D. D. Lee, M. Sugiyama, U. V. Luxburg, I. Guyon, and R. Garnett), Curran Associates, Inc., 30th Annual Conference on Neural Information Processing Systems, December 2016 (Published)
URL
BibTeX
Empirical Inference
Conference Paper
Stable Reinforcement Learning with Autoencoders for Tactile and Visual Data
van Hoof, H., Chen, N., Karl, M., van der Smagt, P., Peters, J.
Proceedings of the IEEE/RSJ Conference on Intelligent Robots and Systems (IROS), 3928-3934, IEEE, October 2016 (Published)
DOI
BibTeX
Empirical Inference
Probabilistic Numerics
Conference Paper
Approximate dual control maintaining the value of information with an application to building control
Klenske, E. D., Hennig, P., Schölkopf, B., Zeilinger, M. N.
In European Control Conference (ECC), 800-806, June 2016
PDF
DOI
BibTeX
Empirical Inference
Conference Paper
Continuous Deep Q-Learning with Model-based Acceleration
Gu, S., Lillicrap, T., Sutskever, I., Levine, S.
Proceedings of the 33nd International Conference on Machine Learning (ICML), 48:2829-2838, JMLR Workshop and Conference Proceedings, (Editors: Maria-Florina Balcan and Kilian Q. Weinberger), JMLR.org, June 2016 (Published)
URL
BibTeX
Empirical Inference
Probabilistic Numerics
Article
Dual Control for Approximate Bayesian Reinforcement Learning
Klenske, E. D., Hennig, P.
Journal of Machine Learning Research, 17(127):1-30, 2016 (Published)
PDF
URL
BibTeX
Empirical Inference
Probabilistic Numerics
Article
Gaussian Process-Based Predictive Control for Periodic Error Correction
Klenske, E. D., Zeilinger, M., Schölkopf, B., Hennig, P.
IEEE Transactions on Control Systems Technology , 24(1):110-121, 2016 (Published)
PDF
DOI
BibTeX
Empirical Inference
Article
Hierarchical Relative Entropy Policy Search
Daniel, C., Neumann, G., Kroemer, O., Peters, J.
Journal of Machine Learning Research, 17(93):1-50, 2016 (Published)
URL
BibTeX
Empirical Inference
Autonomous Motion
Article
Probabilistic Inference for Determining Options in Reinforcement Learning
Daniel, C., van Hoof, H., Peters, J., Neumann, G.
Machine Learning, Special Issue, 104(2):337-357, (Editors: Gärtner, T., Nanni, M., Passerini, A. and Robardet, C.), European Conference on Machine Learning im Machine Learning, Journal Track, 2016, Best Student Paper Award of ECML-PKDD 2016
DOI
BibTeX