Perceiving Systems Talk Biography
17 February 2026 at 16:00 - 17:00 | virtual

SAM3D Body

ORGANIZERS
Thumb ticker sm mmexport1647962767824 3
Perceiving Systems
  • Research Scientist
Thumb ticker xxl qsums2og 400x400

SAM 3D Body (3DB) is a promptable model for single-image full-body 3D human mesh recovery (HMR) that demonstrates state-of-the-art performance, with strong generalization and consistent accuracy in diverse in-the-wild conditions. 3DB estimates the human pose of the body, feet, and hands. It is the first model to use a new parametric mesh representation, Momentum Human Rig (MHR), which decouples skeletal structure and surface shape. 3DB employs an encoder–decoder architecture and supports auxiliary prompts, including 2D keypoints and masks, enabling userguided inference similar to the SAM family of models. We derive high-quality annotations from a multi-stage annotation pipeline that uses various combinations of manual keypoint annotation, differentiable optimization, multi-view geometry, and dense keypoint detection. Our data engine efficiently selects and processes data to ensure data diversity, collecting unusual poses and rare imaging conditions. We present a new evaluation dataset organized by pose and appearance categories, enabling nuanced analysis of model behavior. Our experiments demonstrate superior generalization and substantial improvements over prior methods in both qualitative user preference studies and traditional quantitative analysis. Both 3DB and MHR are open-source.

Speaker Biography

Nicolas Ugrinovic Kehdy (Carnegie Mellon University)

Postdoctoral Researcher

Nicolas is a Postdoctoral Researcher at Carnegie Mellon University, working with Kris Kitani. Previously, he worked at Meta (FAIR) as part of the SAM3D team. He obtained his PhD from IRI-UPC under the supervision of Francesc Moreno-Noguer and Alberto Sanfeliu. During this time, he interned at Amazon Science in Tubingen. He was also a visitor at Stanford University in Leonidas Guibas Lab and Naver Labs Europe with Gregory Rogez. His research is in areas of computer vision and deep learning, specifically on 3D human pose/motion estimation, human motion generation, and reconstruction of 3D bodies.