Indoor segmentation and support inference from RGBD images

Nathan Silberman, Derek Hoiem, Pushmeet Kohli, Robert Fergus

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We present an approach to interpret the major surfaces, objects, and support relations of an indoor scene from an RGBD image. Most existing work ignores physical interactions or is applied only to tidy rooms and hallways. Our goal is to parse typical, often messy, indoor scenes into floor, walls, supporting surfaces, and object regions, and to recover support relationships. One of our main interests is to better understand how 3D cues can best inform a structured 3D interpretation. We also contribute a novel integer programming formulation to infer physical support relations. We offer a new dataset of 1449 RGBD images, capturing 464 diverse indoor scenes, with detailed annotations. Our experiments demonstrate our ability to infer support relations in complex scenes and verify that our 3D scene cues and inferred support lead to better object segmentation.

Original languageEnglish (US)
Title of host publicationComputer Vision, ECCV 2012 - 12th European Conference on Computer Vision, Proceedings
Pages746-760
Number of pages15
Volume7576 LNCS
EditionPART 5
DOIs
StatePublished - 2012
Event12th European Conference on Computer Vision, ECCV 2012 - Florence, Italy
Duration: Oct 7 2012Oct 13 2012

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 5
Volume7576 LNCS
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other12th European Conference on Computer Vision, ECCV 2012
CountryItaly
CityFlorence
Period10/7/1210/13/12

Fingerprint

Segmentation
Integer programming
Integer Programming
Annotation
Verify
Formulation
Experiments
Interaction
Demonstrate
Experiment
Object
Relationships
Interpretation

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Silberman, N., Hoiem, D., Kohli, P., & Fergus, R. (2012). Indoor segmentation and support inference from RGBD images. In Computer Vision, ECCV 2012 - 12th European Conference on Computer Vision, Proceedings (PART 5 ed., Vol. 7576 LNCS, pp. 746-760). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 7576 LNCS, No. PART 5). https://doi.org/10.1007/978-3-642-33715-4_54

Indoor segmentation and support inference from RGBD images. / Silberman, Nathan; Hoiem, Derek; Kohli, Pushmeet; Fergus, Robert.

Computer Vision, ECCV 2012 - 12th European Conference on Computer Vision, Proceedings. Vol. 7576 LNCS PART 5. ed. 2012. p. 746-760 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 7576 LNCS, No. PART 5).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Silberman, N, Hoiem, D, Kohli, P & Fergus, R 2012, Indoor segmentation and support inference from RGBD images. in Computer Vision, ECCV 2012 - 12th European Conference on Computer Vision, Proceedings. PART 5 edn, vol. 7576 LNCS, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), no. PART 5, vol. 7576 LNCS, pp. 746-760, 12th European Conference on Computer Vision, ECCV 2012, Florence, Italy, 10/7/12. https://doi.org/10.1007/978-3-642-33715-4_54
Silberman N, Hoiem D, Kohli P, Fergus R. Indoor segmentation and support inference from RGBD images. In Computer Vision, ECCV 2012 - 12th European Conference on Computer Vision, Proceedings. PART 5 ed. Vol. 7576 LNCS. 2012. p. 746-760. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 5). https://doi.org/10.1007/978-3-642-33715-4_54
Silberman, Nathan ; Hoiem, Derek ; Kohli, Pushmeet ; Fergus, Robert. / Indoor segmentation and support inference from RGBD images. Computer Vision, ECCV 2012 - 12th European Conference on Computer Vision, Proceedings. Vol. 7576 LNCS PART 5. ed. 2012. pp. 746-760 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 5).
@inproceedings{330504d97c14436aa988c5dca1b4c2dc,
title = "Indoor segmentation and support inference from RGBD images",
abstract = "We present an approach to interpret the major surfaces, objects, and support relations of an indoor scene from an RGBD image. Most existing work ignores physical interactions or is applied only to tidy rooms and hallways. Our goal is to parse typical, often messy, indoor scenes into floor, walls, supporting surfaces, and object regions, and to recover support relationships. One of our main interests is to better understand how 3D cues can best inform a structured 3D interpretation. We also contribute a novel integer programming formulation to infer physical support relations. We offer a new dataset of 1449 RGBD images, capturing 464 diverse indoor scenes, with detailed annotations. Our experiments demonstrate our ability to infer support relations in complex scenes and verify that our 3D scene cues and inferred support lead to better object segmentation.",
author = "Nathan Silberman and Derek Hoiem and Pushmeet Kohli and Robert Fergus",
year = "2012",
doi = "10.1007/978-3-642-33715-4_54",
language = "English (US)",
isbn = "9783642337147",
volume = "7576 LNCS",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
number = "PART 5",
pages = "746--760",
booktitle = "Computer Vision, ECCV 2012 - 12th European Conference on Computer Vision, Proceedings",
edition = "PART 5",

}

TY - GEN

T1 - Indoor segmentation and support inference from RGBD images

AU - Silberman, Nathan

AU - Hoiem, Derek

AU - Kohli, Pushmeet

AU - Fergus, Robert

PY - 2012

Y1 - 2012

N2 - We present an approach to interpret the major surfaces, objects, and support relations of an indoor scene from an RGBD image. Most existing work ignores physical interactions or is applied only to tidy rooms and hallways. Our goal is to parse typical, often messy, indoor scenes into floor, walls, supporting surfaces, and object regions, and to recover support relationships. One of our main interests is to better understand how 3D cues can best inform a structured 3D interpretation. We also contribute a novel integer programming formulation to infer physical support relations. We offer a new dataset of 1449 RGBD images, capturing 464 diverse indoor scenes, with detailed annotations. Our experiments demonstrate our ability to infer support relations in complex scenes and verify that our 3D scene cues and inferred support lead to better object segmentation.

AB - We present an approach to interpret the major surfaces, objects, and support relations of an indoor scene from an RGBD image. Most existing work ignores physical interactions or is applied only to tidy rooms and hallways. Our goal is to parse typical, often messy, indoor scenes into floor, walls, supporting surfaces, and object regions, and to recover support relationships. One of our main interests is to better understand how 3D cues can best inform a structured 3D interpretation. We also contribute a novel integer programming formulation to infer physical support relations. We offer a new dataset of 1449 RGBD images, capturing 464 diverse indoor scenes, with detailed annotations. Our experiments demonstrate our ability to infer support relations in complex scenes and verify that our 3D scene cues and inferred support lead to better object segmentation.

UR - http://www.scopus.com/inward/record.url?scp=84867713871&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84867713871&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-33715-4_54

DO - 10.1007/978-3-642-33715-4_54

M3 - Conference contribution

AN - SCOPUS:84867713871

SN - 9783642337147

VL - 7576 LNCS

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 746

EP - 760

BT - Computer Vision, ECCV 2012 - 12th European Conference on Computer Vision, Proceedings

ER -