Structure Discovery in Multi-modal Data: a Region-based Approach

Collet, Alvaro; Srinivasa, Siddhartha S.; Hebert, Martial

doi:10.1184/R1/6561008.v1

Structure Discovery in Multi-modal Data: a Region-based Approach

journal contribution

posted on 2011-05-01, 00:00 authored by Alvaro Collet, Siddhartha S. Srinivasa, Martial Hebert

The ability of a perception system to discern what is important in a scene and what is not is an invaluable asset, with multiple applications in object recognition, people detection and SLAM, among others. In this paper, we aim to analyze all sensory data available to separate a scene into a few physically meaningful parts, which we term structure, while discarding background clutter. In particular, we consider the combination of image and range data, and base our decision in both appearance and 3D shape. Our main contribution is the development of a framework to perform scene segmentation that preserves physical objects using multi-modal data. We combine image and range data using a novel mid-level fusion technique based on the concept of regions that avoids any pixel-level correspondences between data sources. We associate groups of pixels with 3D points into multi-modal regions that we term regionlets, and measure the structure-ness of each regionlet using simple, bottom-up cues from image and range features. We show that the highest-ranked regionlets correspond to the most prominent objects in the scene. We verify the validity of our approach on 105 scenes of household environments.

History

Date

2011-05-01

Usage metrics

Keywords

Cameras Computational modeling Humans Image color analysis Image segmentation Shape Three dimensional displays Adaptive Agents and Intelligent Robotics

Licence

In Copyright

Structure Discovery in Multi-modal Data: a Region-based Approach

History

Date

Usage metrics

Categories

Keywords

Licence

Exports