Groups and Crowds

Institute Homepage

Institute Homepage Sign In

Back

Research Overview

3D Reconstruction

Depth Fusion using Successive Reprojections

3D Shape Completion

Deep Marching Cubes

Deep, Probabilistic and Semantic 3D Reconstruction

Learning Deep Representations of 3D

3D Datasets and Benchmarks

Sparsity Invariant CNNs

Efficient volumetric inference with OctNet

Motion Estimation and Scene Understanding

SphereNet

Unsupervised Learning of Flow with Occlusions

Slow Flow

Global Localization and Affordance Learning

Object Scene Flow

Deep Discrete Flow

Generative Models and Image Synthesis

Convergence and Stability of GAN training

Geometric Image Synthesis

Perceiving Systems Members Publications

Groups and Crowds

Combined image 3 — Top row illustrates the hierarchical correlation clustering formulation for multi-person tracking []. A dotted line indicates that the edge is a cut. The detection graph is partitioned into 7 components, indicating 7 people (top left), which are associated by the global clustering, resulting in 4 persons (top right). Middle row shows qualitative results of tracking and segmentation on the MOT16 benchmark. The solid line under each bounding box indicates the lifetime of the track. Bottom row illustrates the Deepcut model [] for multi-person pose estimation. Initial detections (bottom left) and pairwise terms between all detections are jointly clustered and each part is labeled corresponding to its part class. Bottom right shows the predicted pose sticks.

People are often a central element of visual scenes. It has been a long-standing goal in computer vision to develop computational models that enable machines to detect crowds of people, analyze their motion and poses, infer their actions and reason about the consequences. Our research addresses a wide range of challenges in visual understanding of people in real-world crowded scenes. These include multi-person tracking [] [], multi-person pose estimation [], segmentation [] and person re-identification [].

For multi-target tracking, our work [] proposed to link, cluster and track targets jointly across space and time. We defined a novel mathematical abstraction for tracking in the form of a minimum cost multicut problem. In order to avoid that distinct but similar looking targets are assigned to the same track, we formulated tracking as a minimum cost lifted multicut problem [].

Our work [] presented a novel method to re-identify people in different images, where a second-pooling method is utilized to fuse the feature maps from the pose and the appearance estimator. The method significantly advanced the state-of-the-art on many challenging public benchmarks.

This work forms a foundation for our ongoing work on estimating detailed 3D motions of people in crowded scenes.

Members

Perceiving Systems

Siyu Tang

Guest Scientist

Perceiving Systems

Michael Black

Director

Perceiving Systems

Peter Vincent Gehler

Research Group Leader

Publications

Perceiving Systems Conference Paper Customized Multi-Person Tracker Ma, L., Tang, S., Black, M. J., Van Gool, L. In Computer Vision – ACCV 2018, Springer International Publishing, Asian Conference on Computer Vision, December 2018 PDF BibTeX

Perceiving Systems Conference Paper Part-Aligned Bilinear Representations for Person Re-identification Suh, Y., Wang, J., Tang, S., Mei, T., Lee, K. M. In European Conference on Computer Vision (ECCV), 11218:418-437, Springer, Cham, September 2018 pdf supplementary DOI BibTeX

Perceiving Systems Article Motion Segmentation & Multiple Object Tracking by Correlation Co-Clustering Keuper, M., Tang, S., Andres, B., Brox, T., Schiele, B. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018 pdf DOI BibTeX

Perceiving Systems Conference Paper Multiple People Tracking by Lifted Multicut and Person Re-identification Tang, S., Andriluka, M., Andres, B., Schiele, B. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 3701-3710, IEEE Computer Society, Washington, DC, USA, IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), July 2017 DOI BibTeX

Perceiving Systems Conference Paper DeepCut: Joint Subset Partition and Labeling for Multi Person Pose Estimation Pishchulin, L., Insafutdinov, E., Tang, S., Andres, B., Andriluka, M., Gehler, P., Schiele, B. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 4929-4937, IEEE, IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), June 2016 code pdf supplementary DOI BibTeX