MonoPlane: Exploiting Monocular Geometric Cues for Generalizable 3D Plane Reconstruction

  • Zhao Wang
  • , Jiachen Liu
  • , Sheng Zhang
  • , Yishu Li
  • , Sili Chen
  • , Sharon X. Huang
  • , Yong Jin Liu
  • , Hengkai Guo

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

This paper presents a generalizable 3D plane detection and reconstruction framework named MonoPlane. Unlike previous robust estimator-based works (which require multiple images or RGB-D input) and learning-based works (which suffer from domain shift), MonoPlane combines the best of two worlds and establishes a plane reconstruction pipeline based on monocular geometric cues, resulting in accurate, robust and scalable 3D plane detection and reconstruction in the wild. Specifically, we first leverage large-scale pre-trained neural networks to obtain the depth and surface normals from a single image. These monocular geometric cues are then incorporated into a proximity-guided RANSAC framework to sequentially fit each plane instance. We exploit effective 3D point proximity and model such proximity via a graph within RANSAC to guide the plane fitting from noisy monocular depths, followed by image-level multi-plane joint optimization to improve the consistency among all plane instances. We further design a simple but effective pipeline to extend this single-view solution to sparse-view 3D plane reconstruction. Extensive experiments on a list of datasets demonstrate our superior zero-shot generalizability over baselines, achieving state-of-the-art plane reconstruction performance in a transferring setting. Our code is available at https://github.com/thuzhaowang/MonoPlane.

Original languageEnglish (US)
Title of host publication2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages8481-8488
Number of pages8
ISBN (Electronic)9798350377705
DOIs
StatePublished - 2024
Event2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024 - Abu Dhabi, United Arab Emirates
Duration: Oct 14 2024Oct 18 2024

Publication series

NameIEEE International Conference on Intelligent Robots and Systems
ISSN (Print)2153-0858
ISSN (Electronic)2153-0866

Conference

Conference2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024
Country/TerritoryUnited Arab Emirates
CityAbu Dhabi
Period10/14/2410/18/24

All Science Journal Classification (ASJC) codes

  • Control and Systems Engineering
  • Software
  • Computer Vision and Pattern Recognition
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'MonoPlane: Exploiting Monocular Geometric Cues for Generalizable 3D Plane Reconstruction'. Together they form a unique fingerprint.

Cite this