Empirische Inferenz Members Publications

Reinforcement Learning

000ddd
Ball-in-a-cup was learned with the EM-like policy search approach called PoWER [File Icon].
2016 progress report

Members

Thumb ticker sm 12009745 10103538825457245 7502907506146263960 n
Empirische Inferenz
Research Group Leader
no image
Empirische Inferenz
no image
Empirische Inferenz
Thumb ticker sm hennig lowres cropped
Probabilistic Numerics, Empirische Inferenz
Affiliated Researcher
no image
Empirische Inferenz
no image
Empirische Inferenz
no image
Empirische Inferenz
no image
Empirische Inferenz

Publications

Empirical Inference Article PAC-Bayesian Inequalities for Martingales Seldin, Y., Laviolette, F., Cesa-Bianchi, N., Shawe-Taylor, J., Auer, P. IEEE Transactions on Information Theory, 58(12):7086-7093, June 2012 (Published) PDF Web DOI BibTeX

Empirical Inference Conference Paper Hierarchical Relative Entropy Policy Search Daniel, C., Neumann, G., Peters, J. In Fifteenth International Conference on Artificial Intelligence and Statistics, 22:273-281, JMLR Proceedings, (Editors: Lawrence, N. D. and Girolami, M.), JMLR.org, AISTATS, April 2012 PDF Web BibTeX

Empirical Inference Conference Paper Structured Apprenticeship Learning Boularias, A., Kroemer, O., Peters, J. In European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD 2012), 2012 PDF Web BibTeX

Empirical Inference Article Policy Search for Motor Primitives in Robotics Kober, J., Peters, J. Machine Learning, 84(1-2):171-203, July 2011 PDF PDF DOI BibTeX

Empirical Inference Conference Paper Relative Entropy Inverse Reinforcement Learning Boularias, A., Kober, J., Peters, J. In JMLR Workshop and Conference Proceedings Volume 15: AISTATS 2011, 182-189, (Editors: Gordon, G. , D. Dunson, M. Dudík ), MIT Press, Cambridge, MA, USA, Fourteenth International Conference on Artificial Intelligence and Statistics, April 2011 PDF Web BibTeX

Empirical Inference Conference Paper A Non-Parametric Approach to Dynamic Programming Kroemer, O., Peters, J. In Advances in Neural Information Processing Systems 24, 1719-1727, (Editors: J Shawe-Taylor and RS Zemel and P Bartlett and F Pereira and KQ Weinberger), Twenty-Fifth Annual Conference on Neural Information Processing Systems (NIPS 2011), 2011 PDF Web BibTeX

Empirical Inference Probabilistic Numerics Conference Paper Optimal Reinforcement Learning for Gaussian Systems Hennig, P. In Advances in Neural Information Processing Systems 24, 325-333, (Editors: J Shawe-Taylor and RS Zemel and P Bartlett and F Pereira and KQ Weinberger), Twenty-Fifth Annual Conference on Neural Information Processing Systems (NIPS 2011), 2011 PDF Web BibTeX

Empirical Inference Conference Paper PAC-Bayesian Analysis of Contextual Bandits Seldin, Y., Auer, P., Laviolette, F., Shawe-Taylor, J., Ortner, R. In Advances in Neural Information Processing Systems 24, 1683-1691, (Editors: J Shawe-Taylor and RS Zemel and P Bartlett and F Pereira and KQ Weinberger), Twenty-Fifth Annual Conference on Neural Information Processing Systems (NIPS 2011), 2011 PDF PDF Web BibTeX

Empirical Inference Conference Paper PILCO: A Model-Based and Data-Efficient Approach to Policy Search Deisenroth, M., Rasmussen, C. In Proceedings of the 28th International Conference on Machine Learning, ICML 2011, 465-472, (Editors: L Getoor and T Scheffer), Omnipress, 2011 Web BibTeX

Empirical Inference Autonomous Motion Conference Paper Relative Entropy Policy Search Peters, J., Mülling, K., Altun, Y. In Proceedings of the Twenty-Fourth National Conference on Artificial Intelligence, 1607-1612, (Editors: Fox, M. , D. Poole), AAAI Press, Menlo Park, CA, USA, Twenty-Fourth National Conference on Artificial Intelligence (AAAI-10), July 2010 PDF Web BibTeX

Empirical Inference Article Gaussian Process Dynamic Programming Deisenroth, M., Rasmussen, C., Peters, J. Neurocomputing, 72(7-9):1508-1524, March 2009 PDF PDF DOI BibTeX