Header logo is


2019


Learning to Explore in Motion and Interaction Tasks
Learning to Explore in Motion and Interaction Tasks

Bogdanovic, M., Righetti, L.

IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), IEEE, November 2019 (conference)

Abstract
Model free reinforcement learning suffers from the high sampling complexity inherent to robotic manipulation or locomotion tasks. Most successful approaches typically use random sampling strategies which leads to slow policy convergence. In this paper we present a novel approach for efficient exploration that leverages previously learned tasks. We exploit the fact that the same system is used across many tasks and build a generative model for exploration based on data from previously solved tasks to improve learning new tasks. The approach also enables continuous learning of improved exploration strategies as novel tasks are learned. Extensive simulations on a robot manipulator performing a variety of motion and contact interaction tasks demonstrate the capabilities of the approach. In particular, our experiments suggest that the exploration strategy can more than double learning speed, especially when rewards are sparse. Moreover, the algorithm is robust to task variations and parameter tuning, making it beneficial for complex robotic problems.

mg

arXiv [BibTex]

2019


arXiv [BibTex]


EM-Fusion: Dynamic Object-Level SLAM With Probabilistic Data Association
EM-Fusion: Dynamic Object-Level SLAM With Probabilistic Data Association

Strecke, M., Stückler, J.

In International Conference on Computer Vision, October 2019, arXiv:1904.11781 (inproceedings)

ev

preprint Project page Poster DOI [BibTex]

preprint Project page Poster DOI [BibTex]


no image
Robust Humanoid Locomotion Using Trajectory Optimization and Sample-Efficient Learning

Yeganegi, M. H., Khadiv, M., Moosavian, S. A. A., Zhu, J., Prete, A. D., Righetti, L.

Proceedings International Conference on Humanoid Robots, IEEE, 2019 IEEE-RAS International Conference on Humanoid Robots, October 2019 (conference)

Abstract
Trajectory optimization (TO) is one of the most powerful tools for generating feasible motions for humanoid robots. However, including uncertainties and stochasticity in the TO problem to generate robust motions can easily lead to intractable problems. Furthermore, since the models used in TO have always some level of abstraction, it can be hard to find a realistic set of uncertainties in the model space. In this paper we leverage a sample-efficient learning technique (Bayesian optimization) to robustify TO for humanoid locomotion. The main idea is to use data from full-body simulations to make the TO stage robust by tuning the cost weights. To this end, we split the TO problem into two phases. The first phase solves a convex optimization problem for generating center of mass (CoM) trajectories based on simplified linear dynamics. The second stage employs iterative Linear-Quadratic Gaussian (iLQG) as a whole-body controller to generate full body control inputs. Then we use Bayesian optimization to find the cost weights to use in the first stage that yields robust performance in the simulation/experiment, in the presence of different disturbance/uncertainties. The results show that the proposed approach is able to generate robust motions for different sets of disturbances and uncertainties.

mg

https://arxiv.org/abs/1907.04616 link (url) [BibTex]

https://arxiv.org/abs/1907.04616 link (url) [BibTex]


no image
Variational Autoencoders Recover PCA Directions (by Accident)

Rolinek, M., Zietlow, D., Martius, G.

In Proceedings IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) 2019, June 2019 (inproceedings)

Abstract
The Variational Autoencoder (VAE) is a powerful architecture capable of representation learning and generative modeling. When it comes to learning interpretable (disentangled) representations, VAE and its variants show unparalleled performance. However, the reasons for this are unclear, since a very particular alignment of the latent embedding is needed but the design of the VAE does not encourage it in any explicit way. We address this matter and offer the following explanation: the diagonal approximation in the encoder together with the inherent stochasticity force local orthogonality of the decoder. The local behavior of promoting both reconstruction and orthogonality matches closely how the PCA embedding is chosen. Alongside providing an intuitive understanding, we justify the statement with full theoretical analysis as well as with experiments.

al

arXiv link (url) Project Page [BibTex]

arXiv link (url) Project Page [BibTex]


no image
Efficient Humanoid Contact Planning using Learned Centroidal Dynamics Prediction

Lin, Y., Ponton, B., Righetti, L., Berenson, D.

International Conference on Robotics and Automation (ICRA), pages: 5280-5286, IEEE, May 2019 (conference)

mg

DOI [BibTex]

DOI [BibTex]


Leveraging Contact Forces for Learning to Grasp
Leveraging Contact Forces for Learning to Grasp

Merzic, H., Bogdanovic, M., Kappler, D., Righetti, L., Bohg, J.

In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA) 2019, IEEE, International Conference on Robotics and Automation, May 2019 (inproceedings)

Abstract
Grasping objects under uncertainty remains an open problem in robotics research. This uncertainty is often due to noisy or partial observations of the object pose or shape. To enable a robot to react appropriately to unforeseen effects, it is crucial that it continuously takes sensor feedback into account. While visual feedback is important for inferring a grasp pose and reaching for an object, contact feedback offers valuable information during manipulation and grasp acquisition. In this paper, we use model-free deep reinforcement learning to synthesize control policies that exploit contact sensing to generate robust grasping under uncertainty. We demonstrate our approach on a multi-fingered hand that exhibits more complex finger coordination than the commonly used two- fingered grippers. We conduct extensive experiments in order to assess the performance of the learned policies, with and without contact sensing. While it is possible to learn grasping policies without contact sensing, our results suggest that contact feedback allows for a significant improvement of grasping robustness under object pose uncertainty and for objects with a complex shape.

am mg

video arXiv [BibTex]

video arXiv [BibTex]


no image
Control What You Can: Intrinsically Motivated Task-Planning Agent

Blaes, S., Vlastelica, M., Zhu, J., Martius, G.

In Advances in Neural Information Processing (NeurIPS’19), pages: 12520-12531, Curran Associates, Inc., NeurIPS'19, 2019 (inproceedings)

Abstract
We present a novel intrinsically motivated agent that learns how to control the environment in the fastest possible manner by optimizing learning progress. It learns what can be controlled, how to allocate time and attention, and the relations between objects using surprise based motivation. The effectiveness of our method is demonstrated in a synthetic as well as a robotic manipulation environment yielding considerably improved performance and smaller sample complexity. In a nutshell, our work combines several task-level planning agent structures (backtracking search on task graph, probabilistic road-maps, allocation of search efforts) with intrinsic motivation to achieve learning from scratch.

al

link (url) Project Page [BibTex]

link (url) Project Page [BibTex]


no image
Falsification of hybrid systems using symbolic reachability and trajectory splicing

Bogomolov, S., Frehse, G., Gurung, A., Li, D., Martius, G., Ray, R.

In International Conference on Hybrid Systems: Computation and Control, pages: 1-10, HSCC’19, ACM, 2019 (inproceedings)

al

DOI [BibTex]

DOI [BibTex]


no image
Learning to Disentangle Latent Physical Factors for Video Prediction

Zhu, D., Munderloh, M., Rosenhahn, B., Stückler, J.

In German Conference on Pattern Recognition (GCPR), 2019, to appear (inproceedings)

ev

dataset & evaluation code video preprint [BibTex]

dataset & evaluation code video preprint [BibTex]


no image
3D Birds-Eye-View Instance Segmentation

Elich, C., Engelmann, F., Kontogianni, T., Leibe, B.

In German Conference on Pattern Recognition (GCPR), 2019, arXiv:1904.02199, to appear (inproceedings)

ev

[BibTex]

[BibTex]

2005


no image
Magnetization reversal behavior of nanogranular CoCrPt alloy thin films studied with magnetic transmission X-ray microscopy

Fischer, P., Im, M., Eimüller, T., Schütz, G., Shin, S.

In 286, pages: 311-314, Boulder, CO, USA, 2005 (inproceedings)

mms

[BibTex]

2005


[BibTex]


no image
A dynamical systems approach to learning: a frequency-adaptive hopper robot

Buchli, J., Righetti, L., Ijspeert, A.

In Proceedings of the VIIIth European Conference on Artificial Life ECAL 2005, pages: 210-220, Springer Verlag, 2005 (inproceedings)

mg

[BibTex]

[BibTex]


no image
From Dynamic Hebbian Learning for Oscillators to Adaptive Central Pattern Generators

Righetti, L., Buchli, J., Ijspeert, A.

In Proceedings of 3rd International Symposium on Adaptive Motion in Animals and Machines – AMAM 2005, Verlag ISLE, Ilmenau, 2005 (inproceedings)

mg

[BibTex]

[BibTex]


no image
Defects distribution of Pr2Fe14B hard magnetic magnet from amorphous to nanostructures characterized by positron annihilation spectroscopy

Wu, Y. C., Sprengel, W., Reimann, K., Reichle, K. J., Goll, D., Würschum, R., Schaefer, H. E.

In PRICM 5. Proceedings of the Fifth Pacific RIM International Conference on Advanced Materials and Processing, 475-479, pages: 2123-2126, Materials Science Forum, Trans Tech, Beijing, China, 2005 (inproceedings)

mms

[BibTex]

[BibTex]


no image
Implementing sub-ns time resolution into magnetic X-ray microscopies

Puzic, A., Stoll, H., Fischer, P., Van Waeyenberge, B., Raabe, J., Denbeaux, G., Haug, T., Weiss, D., Schütz, G.

In T115, pages: 1029-1031, Malmö/Lund, Sweden, 2005 (inproceedings)

mms

[BibTex]

[BibTex]


no image
Learning to Feel the Physics of a Body

Der, R., Hesse, F., Martius, G.

In Computational Intelligence for Modelling, Control and Automation, CIMCA 2005 , 2, pages: 252-257, Washington, DC, USA, 2005 (inproceedings)

Abstract
Despite the tremendous progress in robotic hardware and in both sensorial and computing efficiencies the performance of contemporary autonomous robots is still far below that of simple animals. This has triggered an intensive search for alternative approaches to the control of robots. The present paper exemplifies a general approach to the self-organization of behavior which has been developed and tested in various examples in recent years. We apply this approach to an underactuated snake like artifact with a complex physical behavior which is not known to the controller. Due to the weak forces available, the controller so to say has to develop a kind of feeling for the body which is seen to emerge from our approach in a natural way with meandering and rotational collective modes being observed in computer simulation experiments.

al

[BibTex]

[BibTex]

2003


no image
Grain boundary phase transitions in the Al-Mg system and their influence on high-strain rate superplasticity

Straumal, B. B., Lopez, G. A., Mittemeijer, E. J., Gust, W., Zhilyaev, A. P.

In 216-217, pages: 307-312, Moscow, Russia, 2003 (inproceedings)

mms

[BibTex]

2003


[BibTex]


no image
Influence of grain boundary phase transitions on the diffusion-related properties

Straumal, B., Baretzky, B.

In Proceedings of the International Conference on Diffusion, Segregation and Stresses in Materials, pages: 53-64, Defect and Diffusion Forum, Scitec Publications Ltd., Moscow, Russia, 2003 (inproceedings)

mms

[BibTex]

[BibTex]


no image
Are carbon nanostructures an efficient hydrogen storage medium?

Hirscher, M., Becher, M., Haluska, M., von Zeppelin, F., Chen, X., Dettlaff-Weglikowska, U., Roth, S.

In 356-357, pages: 433-437, Annecy, France, 2003 (inproceedings)

mms

[BibTex]

[BibTex]


no image
Grain boundary faceting phase transition and thermal grooving in Cu

Straumal, B. B., Polyakov, S. A., Bischoff, E., Mittemeijer, E. J., Gust, W.

In Proceedings of the International Conference on Diffusion, Segregation and Stresses in Materials, 216/217, pages: 93-100, Diffusion and Defect Data, Pt. A, Defect and Diffusion Forum, Scitec Publ., Moscow, 2003 (inproceedings)

mms

[BibTex]

[BibTex]


no image
Evolution of Fault-tolerant Self-replicating Structures

Righetti, L., Shokur, S., Capcarre, M.

In Advances in Artificial Life, pages: 278-288, Lecture Notes in Computer Science, Springer Berlin Heidelberg, 2003 (inproceedings)

Abstract
Designed and evolved self-replicating structures in cellular automata have been extensively studied in the past as models of Artificial Life. However, CAs, unlike their biological counterpart, are very brittle: any faulty cell usually leads to the complete destruction of any emerging structures, let alone self-replicating structures. A way to design fault-tolerant structures based on error-correcting-code has been presented recently [1], but it required a cumbersome work to be put into practice. In this paper, we get back to the original inspiration for these works, nature, and propose a way to evolve self-replicating structures, faults here being only an idiosyncracy of the environment.

mg

link (url) DOI [BibTex]

link (url) DOI [BibTex]


no image
Grain boundary faceting phase transition and thermal grooving in Cu

Straumal, B. B., Polyakov, S. A., Bischoff, E., Mittemeijer, E. J., Gust, W.

In Proceedings of the International Conference on Diffusion, Segregation and Stresses in Materials, 216/217, pages: 93-100, Diffusion and Defect Data, Pt. A, Defect and Diffusion Forum, Scitec Publ., Moscow, 2003 (inproceedings)

mms

[BibTex]

[BibTex]


no image
Coercivity mechanism in nanocrystalline and bonded magnets

Goll, D., Kronmüller, H.

In Bonded Magnets. Proceedings of the NATO Advanced Research Workshop on Science and Technology of Bonded Magnets, 118, pages: 115-127, NATO Science Series: Series 2, Mathematics, Physics and Chemistry, Kluwer Acad. Publ., Newark, USA, 2003 (inproceedings)

mms

[BibTex]

[BibTex]


no image
Investigation of Electromigration in Copper Interconnects by Noise Measurements

Emelianov, V., Ganesan, G., Puzic, A., Schulz, S., Eizenberg, M., Habermeier, H., Stoll, H.

In Noise as a Tool for Studying Materials, pages: 271-281, Proceedings of SPIE, Santa Fe, New Mexico, 2003 (inproceedings)

mms

[BibTex]

[BibTex]