posted on 2012-08-01, 00:00authored byDavid F. Fouhey, Vincent Delaitre, Abhinav GuptaAbhinav Gupta, Alexei Efros, Ivan Laptev, Josef Sivic
<p>We present an approach which exploits the coupling between human actions and scene geometry. We investigate the use of human pose as a cue for single-view 3D scene understanding. Our method builds upon recent advances in still-image pose estimation to extract functional and geometric constraints about the scene. These constraints are then used to improve state-of-the-art single-view 3D scene understanding approaches. The proposed method is validated on a collection of monocular time lapse sequences collected from YouTube and a dataset of still images of indoor scenes. We demonstrate that observing people performing different actions can significantly improve estimates of 3D scene geometry.</p>