Back

Autonomous Learning Members Publications

Dual-Force: Enhanced Offline Diversity Maximization under Imitation Constraints

Dual force
The paper presents an offline algorithm that leverages Van der Waals force, Successor Features, and pre-trained Functional Reward Encoding to maximize skill diversity under imitation constraints, eliminate the need for a skill discriminator, handle non-stationary rewards, and enable stable, efficient training with zero-shot recall of all trained skills.

Members

no image
Autonomous Learning
no image
Autonomous Learning
Thumb ticker sm georg 2018 crop small
Empirische Inferenz, Autonomous Learning
Senior Research Scientist

Publications

Autonomous Learning Conference Paper Dual-Force: Enhanced Offline Diversity Maximization under Imitation Constraints Kolev, P., Vlastelica, M., Martius, G. In Seventeenth European Workshop on Reinforcement Learning, August 2024 (Accepted) URL BibTeX