Towards Accurate Marker-less Human Shape and Pose Estimation over Time

Institute Homepage

Institute Homepage DE Sign In

Back

Perceiving Systems Conference Paper 2017

Code

Perceiving Systems

Yinghao Huang

Guest Scientist

Perceiving Systems

Federica Bogo

Perceiving Systems

Christoph Lassner

Affiliated Researcher

Perceiving Systems

Angjoo Kanazawa

Perceiving Systems

Peter Vincent Gehler

Research Group Leader

Perceiving Systems

Ijaz Akhter

Perceiving Systems

Michael Black

Director

Perceiving Systems

Javier Romero

Affiliated Researcher

Existing markerless motion capture methods often assume known backgrounds, static cameras, and sequence specific motion priors, limiting their application scenarios. Here we present a fully automatic method that, given multiview videos, estimates 3D human pose and body shape. We take the recently proposed SMPLify method [12] as the base method and extend it in several ways. First we fit a 3D human body model to 2D features detected in multi-view images. Second, we use a CNN method to segment the person in each image and fit the 3D body model to the contours, further improving accuracy. Third we utilize a generic and robust DCT temporal prior to handle the left and right side swapping issue sometimes introduced by the 2D pose estimator. Validation on standard benchmarks shows our results are comparable to the state of the art and also provide a realistic 3D shape avatar. We also demonstrate accurate results on HumanEva and on challenging monocular sequences of dancing from YouTube.

Author(s):	Yinghao Huang and Federica Bogo and Christoph Lassner and Angjoo Kanazawa and Peter V. Gehler and Javier Romero and Ijaz Akhter and Michael J. Black
Links:	Code
Book Title:	International Conference on 3D Vision (3DV)
Pages:	421--430
Year:	2017

Project(s):	Optimizing Human Pose and Shape Humans from Video
Bibtex Type:	Conference Paper (inproceedings)

DOI:	10.1109/3DV.2017.00055

Electronic Archiving:	grant_archive
Attachments:	pdf

BibTex

@inproceedings{MuVS:3DV:2017,
  title = {Towards Accurate Marker-less Human Shape and Pose Estimation over Time},
  booktitle = {International Conference on 3D Vision (3DV)},
  abstract = {Existing markerless motion capture methods often assume known backgrounds, static cameras, and sequence specific motion priors, limiting their application scenarios. Here we present a fully automatic method that, given multiview videos, estimates 3D human pose and body shape. We take the recently proposed SMPLify method [12] as the base method and extend it in several ways. First we fit a 3D human body model to 2D features detected in multi-view images. Second, we use a CNN method to segment the person in each image and fit the 3D body model to the contours, further improving accuracy. Third we utilize a generic and robust DCT temporal prior to handle the left and right side swapping issue sometimes introduced by the 2D pose estimator. Validation on standard benchmarks shows our results are comparable to the state of the art and also provide a realistic 3D shape avatar. We also demonstrate accurate results on HumanEva and on challenging monocular sequences of dancing from YouTube.},
  pages = {421--430},
  year = {2017},
  slug = {muvs-3dv-2017},
  author = {Huang, Yinghao and Bogo, Federica and Lassner, Christoph and Kanazawa, Angjoo and Gehler, Peter V. and Romero, Javier and Akhter, Ijaz and Black, Michael J.}
}

Research

Departments

Research Groups

People

Contact

Our Institute

Our History

Career

Doctoral Programs

Training

Service Units

Central Scientific Facilities

Workshops

Campus Services

Impact

Cooperation

Partners and Initiatives

Research

Departments

Research Groups

People

Contact

Our Institute

Our History

Career

Doctoral Programs

Training

Service Units

Central Scientific Facilities

Workshops

Campus Services

Impact

Cooperation

Partners and Initiatives

BibTex