Back
Capsule networks (see e.g. Hinton et al., 2018) aim to encode knowledge of and reason about the relationship between an object and its parts. Here we specify a generative model for such data, and derive a variational algorithm for inferring the transformation of each model object in a scene, and the assignments of observed parts to the objects. We derive a learning algorithm for the object models, based on variational expectation maximization (Jordan et al., 1999). We also study an alternative inference algorithm based on the RANSAC method of Fischler and Bolles (1981). We apply these inference methods to (i) data generated from multiple geometric objects like squares and triangles (“constellations”), and (ii) data from a parts-based model of faces. Recent work by Kosiorek et al. (2019) has used amortized inference via stacked capsule autoencoders (SCAEs) to tackle this problem—our results show that we significantly outperform them where we can make comparisons (on the constellations data). Joint work with Alfredo Nazabal and Nikolaos Tsagkas. If there is time, I will also make some comments on Structured Generative Models for Scene Understanding (arXiv 2302.03531).
Prof. Chris Williams (University of Edinburgh)
Chris Williams is Professor of Machine Learning in the School of Informatics, University of Edinburgh. He is interested in a wide range of theoretical and practical issues in machine learning, statistical pattern recognition, probabilistic graphical models and computer vision. This includes theoretical foundations, the development of new models and algorithms, and applications. His main areas of research are in visual object recognition and image understanding, models for understanding time-series, AI for data analytics, unsupervised learning, and Gaussian processes. He obtained his MSc (1990) and PhD (1994) at the University of Toronto, under the supervision of Geoff Hinton. He was a member of the Neural Computing Research Group at Aston University from 1994 to 1998, and has been at the University of Edinburgh since 1998. He was elected a Fellow of the Royal Society of Edinburgh in 2021, is a Fellow of the European Laboratory for Learning and Intelligent Systems (ELLIS), a Turing Fellow at the Alan Turing Institute (UK), and was program co-chair of the NeurIPS conference in 2009.
More information