OctNet: Learning Deep 3D Representations at High Resolutions

Institute Homepage

Institute Homepage DE Sign In

Back

Autonomous Vision Perceiving Systems Conference Paper 2017

pdf suppmat Project Page Video

Autonomous Vision

Gernot Riegler

Perceiving Systems, Autonomous Vision

Osman Ulusoy

Autonomous Vision, Perceiving Systems

Andreas Geiger

Guest Scientist

We present OctNet, a representation for deep learning with sparse 3D data. In contrast to existing models, our representation enables 3D convolutional networks which are both deep and high resolution. Towards this goal, we exploit the sparsity in the input data to hierarchically partition the space using a set of unbalanced octrees where each leaf node stores a pooled feature representation. This allows to focus memory allocation and computation to the relevant dense regions and enables deeper networks without compromising resolution. We demonstrate the utility of our OctNet representation by analyzing the impact of resolution on several 3D tasks including 3D object classification, orientation estimation and point cloud labeling.

Author(s):	Gernot Riegler and Osman Ulusoy and Andreas Geiger
Links:	pdf suppmat Project Page Video
Book Title:	Proceedings IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017
Pages:	6620-6629
Year:	2017
Month:	July
Day:	21-26
Publisher:	IEEE

Project(s):	Learning Deep Representations of 3D Efficient volumetric inference with OctNet Efficient and Scalable Inference
Bibtex Type:	Conference Paper (inproceedings)

Address:	Piscataway, NJ, USA
Event Name:	IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Event Place:	Honolulu, HI, USA

Electronic Archiving:	grant_archive
ISBN:	978-1-5386-0457-1
ISSN:	1063-6919

BibTex

@inproceedings{Riegler2017CVPR,
  title = {OctNet: Learning Deep 3D Representations at High Resolutions},
  booktitle = {Proceedings IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017},
  abstract = {We present OctNet, a representation for deep learning with sparse 3D data. In contrast to existing models, our representation enables 3D convolutional networks which are both deep and high resolution. Towards this goal, we exploit the sparsity in the input data to hierarchically partition the space using a set of unbalanced octrees where each leaf node stores a pooled feature representation. This allows to focus memory allocation and computation to the relevant dense regions and enables deeper networks without compromising resolution. We demonstrate the utility of our OctNet representation by analyzing the impact of resolution on several 3D tasks including 3D object classification, orientation estimation and point cloud labeling.},
  pages = {6620-6629},
  publisher = {IEEE},
  address = {Piscataway, NJ, USA},
  month = jul,
  year = {2017},
  slug = {riegler2016arxiv},
  author = {Riegler, Gernot and Ulusoy, Osman and Geiger, Andreas},
  month_numeric = {7}
}

Research

Departments

Research Groups

People

Contact

Our Institute

Our History

Career

Doctoral Programs

Training

Service Units

Central Scientific Facilities

Workshops

Campus Services

Impact

Cooperation

Partners and Initiatives