Empirical Inference – Max Planck Institute for Intelligent Systems

Empirical Inference Conference Paper Data-Efficient Hierarchical Reinforcement Learning Nachum, O., Gu, S., Lee, H., Levine, S. Advances in Neural Information Processing Systems 31 (NeurIPS 2018), 3307-3317, (Editors: S. Bengio and H. Wallach and H. Larochelle and K. Grauman and N. Cesa-Bianchi and R. Garnett), Curran Associates, Inc., 32nd Annual Conference on Neural Information Processing Systems, December 2018 (Published) URL BibTeX

Intelligent Control Systems Empirical Inference Conference Paper Efficient Encoding of Dynamical Systems through Local Approximations Solowjow, F., Mehrjou, A., Schölkopf, B., Trimpe, S. In Proceedings of the 57th IEEE International Conference on Decision and Control (CDC), 6073 - 6079 , Miami, Fl, USA, December 2018 (Published) arXiv PDF DOI BibTeX

Empirical Inference Conference Paper Constraint-Space Projection Direct Policy Search Akrour, R., Peters, J., Neuman, G. 14th European Workshop on Reinforcement Learning (EWRL), October 2018 (Published) URL BibTeX

Empirical Inference Article Control of Musculoskeletal Systems using Learned Dynamics Models Büchler, D., Calandra, R., Schölkopf, B., Peters, J. IEEE Robotics and Automation Letters, 3(4):3161-3168, IEEE, October 2018 (Published)

Abstract ›

Controlling musculoskeletal systems, especially robots actuated by pneumatic artificial muscles, is a challenging task due to nonlinearities, hysteresis effects, massive actuator de- lay and unobservable dependencies such as temperature. Despite such difficulties, muscular systems offer many beneficial prop- erties to achieve human-comparable performance in uncertain and fast-changing tasks. For example, muscles are backdrivable and provide variable stiffness while offering high forces to reach high accelerations. In addition, the embodied intelligence deriving from the compliance might reduce the control demands for specific tasks. In this paper, we address the problem of how to accurately control musculoskeletal robots. To address this issue, we propose to learn probabilistic forward dynamics models using Gaussian processes and, subsequently, to employ these models for control. However, Gaussian processes dynamics models cannot be set-up for our musculoskeletal robot as for traditional motor- driven robots because of unclear state composition etc. We hence empirically study and discuss in detail how to tune these approaches to complex musculoskeletal robots and their specific challenges. Moreover, we show that our model can be used to accurately control an antagonistic pair of pneumatic artificial muscles for a trajectory tracking task while considering only one- step-ahead predictions of the forward model and incorporating model uncertainty.

RAL18final DOI URL BibTeX

Empirical Inference Conference Paper Domain Randomization for Simulation-Based Policy Optimization with Transferability Assessment Muratore, F., Treede, F., Gienger, M., Peters, J. 2nd Annual Conference on Robot Learning (CoRL), 87:700-713, Proceedings of Machine Learning Research, PMLR, October 2018 (Published) URL BibTeX

Empirical Inference Conference Paper Regularizing Reinforcement Learning with State Abstraction Akrour, R., Veiga, F., Peters, J., Neuman, G. Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 534-539, October 2018 (Published) DOI URL BibTeX

Empirical Inference Conference Paper Reinforcement Learning of Phase Oscillators for Fast Adaptation to Moving Targets Maeda, G., Koc, O., Morimoto, J. Proceedings of The 2nd Conference on Robot Learning (CoRL), 87:630-640, (Editors: Aude Billard, Anca Dragan, Jan Peters, Jun Morimoto ), PMLR, October 2018 (Published) URL BibTeX

Empirical Inference Conference Paper PIPPS: Flexible Model-Based Policy Search Robust to the Curse of Chaos Parmas, P., Rasmussen, C., Peters, J., Doya, K. Proceedings of the 35th International Conference on Machine Learning (ICML), 80:4065-4074, Proceedings of Machine Learning Research, (Editors: Dy, Jennifer and Krause, Andreas), PMLR, July 2018 (Published) URL BibTeX

Empirical Inference Conference Paper The Mirage of Action-Dependent Baselines in Reinforcement Learning Tucker, G., Bhupatiraju, S., Gu, S., Turner, R., Ghahramani, Z., Levine, S. Proceedings of the 35th International Conference on Machine Learning (ICML), 80:5022-5031, Proceedings of Machine Learning Research, (Editors: Dy, Jennifer and Krause, Andreas), PMLR, July 2018 (Published) PDF URL BibTeX

Empirical Inference Conference Paper Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning Eysenbach, B., Gu, S., Ibarz, J., Levine, S. 6th International Conference on Learning Representations (ICLR), May 2018 (Published) Videos URL BibTeX

Empirical Inference Conference Paper Sample and Feedback Efficient Hierarchical Reinforcement Learning from Human Preferences Pinsler, R., Akrour, R., Osa, T., Peters, J., Neumann, G. IEEE International Conference on Robotics and Automation, (ICRA), 596-601, IEEE, May 2018 (Published) DOI BibTeX

Empirical Inference Conference Paper Temporal Difference Models: Model-Free Deep RL for Model-Based Control Pong*, V., Gu*, S., Dalal, M., Levine, S. 6th International Conference on Learning Representations (ICLR), May 2018, *equal contribution (Published) URL BibTeX

Empirical Inference Article Approximate Value Iteration Based on Numerical Quadrature Vinogradska, J., Bischoff, B., Peters, J. IEEE Robotics and Automation Letters, 3(2):1330-1337, January 2018 (Published) DOI BibTeX

Empirical Inference Article Intrinsic Motivation and Mental Replay enable Efficient Online Adaptation in Stochastic Recurrent Networks Tanneberg, D., Peters, J., Rueckert, E. Neural Networks, 109:67-80, 2018 (Published) DOI URL BibTeX

Empirical Inference Article Inverse Reinforcement Learning via Nonparametric Spatio-Temporal Subgoal Modeling Šošić, A., Rueckert, E., Peters, J., Zoubir, A., Koeppl, H. Journal of Machine Learning Research, 19(69):1-45, 2018 (Published) URL BibTeX

Empirical Inference Conference Paper Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning Gu, S., Lillicrap, T., Turner, R. E., Ghahramani, Z., Schölkopf, B., Levine, S. Advances in Neural Information Processing Systems 30 (NIPS 2017), 3849-3858, (Editors: Guyon I. and Luxburg U.v. and Bengio S. and Wallach H. and Fergus R. and Vishwanathan S. and Garnett R.), Curran Associates, Inc., 31st Annual Conference on Neural Information Processing Systems, December 2017 (Published) URL BibTeX

Empirical Inference Conference Paper Efficient Online Adaptation with Stochastic Recurrent Neural Networks Tanneberg, D., Peters, J., Rueckert, E. IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids), 198-204, IEEE, November 2017 (Published) DOI BibTeX

Empirical Inference Conference Paper Online Learning with Stochastic Recurrent Neural Networks using Intrinsic Motivation Signals Tanneberg, D., Peters, J., Rueckert, E. Proceedings of the 1st Annual Conference on Robot Learning (CoRL), 167-174, Proceedings of Machine Learning Research, (Editors: Sergey Levine, Vincent Vanhoucke and Ken Goldberg), PMLR, November 2017 (Published) URL BibTeX

Empirical Inference Article Generalized exploration in policy search van Hoof, H., Tanneberg, D., Peters, J. Machine Learning, 106(9-10):1705-1724 , (Editors: Kurt Driessens, Dragi Kocev, Marko Robnik‐Sikonja, and Myra Spiliopoulou), October 2017, Special Issue of the ECML PKDD 2017 Journal Track (Published) DOI BibTeX

Empirical Inference Conference Paper Goal-driven dimensionality reduction for reinforcement learning Parisi, S., Ramstedt, S., Peters, J. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 4634-4639, IEEE, September 2017 (Published) DOI BibTeX

Empirical Inference Conference Paper Hybrid control trajectory optimization under uncertainty Pajarinen, J., Kyrki, V., Koval, M., Srinivasa, S., Peters, J., Neumann, G. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 5694-5701, September 2017 (Published) DOI BibTeX

Empirical Inference Conference Paper Sequence Tutor: Conservative fine-tuning of sequence generation models with KL-control Jaques, N., Gu, S., Bahdanau, D., Hernández-Lobato, J. M., Turner, R. E., Eck, D. Proceedings of the 34th International Conference on Machine Learning (ICML), 70:1645-1654, Proceedings of Machine Learning Research, (Editors: Doina Precup, Yee Whye Teh), PMLR, August 2017 (Published) Arxiv URL BibTeX

Empirical Inference Conference Paper State-Regularized Policy Search for Linearized Dynamical Systems Abdulsamad, H., Arenz, O., Peters, J., Neumann, G. Proceedings of the Twenty-Seventh International Conference on Automated Planning and Scheduling, (ICAPS), 419-424, (Editors: Laura Barbulescu, Jeremy Frank, Mausam and Stephen F. Smith), AAAI Press, June 2017 (Published) URL BibTeX

Empirical Inference Conference Paper Deep Reinforcement Learning for Robotic Manipulation with Asynchronous Off-Policy Updates Gu*, S., Holly*, E., Lillicrap, T., Levine, S. Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), IEEE, Piscataway, NJ, USA, May 2017, *equal contribution (Published) Arxiv DOI BibTeX

Empirical Inference Conference Paper A Learning-based Shared Control Architecture for Interactive Task Execution Farraj, F. B., Osa, T., Pedemonte, N., Peters, J., Neumann, G., Giordano, P. IEEE International Conference on Robotics and Automation (ICRA), 329-335, IEEE, May 2017 (Published) DOI BibTeX

Empirical Inference Conference Paper Layered direct policy search for learning hierarchical skills End, F., Akrour, R., Peters, J., Neumann, G. IEEE International Conference on Robotics and Automation (ICRA), 6442-6448, IEEE, May 2017 (Published) DOI BibTeX

Empirical Inference Conference Paper Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic Gu, S., Lillicrap, T., Ghahramani, Z., Turner, R. E., Levine, S. Proceedings International Conference on Learning Representations (ICLR), April 2017 (Published) PDF URL BibTeX

Empirical Inference Conference Paper Policy Search with High-Dimensional Context Variables Tangkaratt, V., van Hoof, H., Parisi, S., Neumann, G., Peters, J., Sugiyama, M. Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence (AAAI), 2632-2638, (Editors: Satinder P. Singh and Shaul Markovitch), AAAI Press, February 2017 (Published) URL BibTeX

Empirical Inference Article Manifold-based multi-objective policy search with sample reuse Parisi, S., Pirotta, M., Peters, J. Neurocomputing, 263:3-14, (Editors: Madalina Drugan, Marco Wiering, Peter Vamplew, and Madhu Chetty), 2017, Special Issue on Multi-Objective Reinforcement Learning (Published) DOI BibTeX

Empirical Inference Article Model-based Contextual Policy Search for Data-Efficient Generalization of Robot Skills Kupcsik, A., Deisenroth, M., Peters, J., Ai Poh, L., Vadakkepat, V., Neumann, G. Artificial Intelligence, 247:415-439, 2017, Special Issue on AI and Robotics (Published) DOI URL BibTeX

Empirical Inference Article Non-parametric Policy Search with Limited Information Loss van Hoof, H., Neumann, G., Peters, J. Journal of Machine Learning Research , 18(73):1-46, 2017 (Published) URL BibTeX

Empirical Inference Book Chapter Policy Gradient Methods Peters, J., Bagnell, J. In Encyclopedia of Machine Learning and Data Mining, 982-985, 2nd, (Editors: Sammut, Claude and Webb, Geoffrey I.), Springer US, 2017 (Published) URL BibTeX

Empirical Inference Article Stability of Controllers for Gaussian Process Dynamics Vinogradska, J., Bischoff, B., Nguyen-Tuong, D., Peters, J. Journal of Machine Learning Research, 18(100):1-37, 2017 (Published) URL BibTeX

Empirical Inference Miscellaneous f-Divergence constrained policy improvement Belousov, B., Peters, J. 2017 (Published) arXiv BibTeX

Empirical Inference Conference Paper Catching heuristics are optimal control policies Belousov, B., Neumann, G., Rothkopf, C., Peters, J. Advances in Neural Information Processing Systems 29 (NIPS 2016), 1426-1434, (Editors: D. D. Lee, M. Sugiyama, U. V. Luxburg, I. Guyon, and R. Garnett), Curran Associates, Inc., 30th Annual Conference on Neural Information Processing Systems, December 2016 (Published) URL BibTeX

Empirical Inference Conference Paper Stable Reinforcement Learning with Autoencoders for Tactile and Visual Data van Hoof, H., Chen, N., Karl, M., van der Smagt, P., Peters, J. Proceedings of the IEEE/RSJ Conference on Intelligent Robots and Systems (IROS), 3928-3934, IEEE, October 2016 (Published) DOI BibTeX

Empirical Inference Probabilistic Numerics Conference Paper Approximate dual control maintaining the value of information with an application to building control Klenske, E. D., Hennig, P., Schölkopf, B., Zeilinger, M. N. In European Control Conference (ECC), 800-806, June 2016 PDF DOI BibTeX

Empirical Inference Conference Paper Continuous Deep Q-Learning with Model-based Acceleration Gu, S., Lillicrap, T., Sutskever, I., Levine, S. Proceedings of the 33nd International Conference on Machine Learning (ICML), 48:2829-2838, JMLR Workshop and Conference Proceedings, (Editors: Maria-Florina Balcan and Kilian Q. Weinberger), JMLR.org, June 2016 (Published) URL BibTeX

Empirical Inference Probabilistic Numerics Article Dual Control for Approximate Bayesian Reinforcement Learning Klenske, E. D., Hennig, P. Journal of Machine Learning Research, 17(127):1-30, 2016 (Published) PDF URL BibTeX

Empirical Inference Probabilistic Numerics Article Gaussian Process-Based Predictive Control for Periodic Error Correction Klenske, E. D., Zeilinger, M., Schölkopf, B., Hennig, P. IEEE Transactions on Control Systems Technology , 24(1):110-121, 2016 (Published) PDF DOI BibTeX

Empirical Inference Article Hierarchical Relative Entropy Policy Search Daniel, C., Neumann, G., Kroemer, O., Peters, J. Journal of Machine Learning Research, 17(93):1-50, 2016 (Published) URL BibTeX

Empirical Inference Autonomous Motion Article Probabilistic Inference for Determining Options in Reinforcement Learning Daniel, C., van Hoof, H., Peters, J., Neumann, G. Machine Learning, Special Issue, 104(2):337-357, (Editors: Gärtner, T., Nanni, M., Passerini, A. and Robardet, C.), European Conference on Machine Learning im Machine Learning, Journal Track, 2016, Best Student Paper Award of ECML-PKDD 2016 DOI BibTeX

2019 Progress Report