Variational Stereo Vision with Sharp Discontinuities and Occlusion Handling

00:00 02-07-2009
<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; DIRECTION: ltr; LINE-HEIGHT: 150%; unicode-bidi: embed; TEXT-ALIGN: justify"><font face="Arial" size="2">In binocular stereo vision we have a pair of images captured by two cameras in different locations. The objective of a <i>dense</i> stereo vision method is to recover the depth of each pixel in one of the captured images (the reference image). The central task is the establishment of the <i>correspondence</i> function, mapping each pixel in the reference image to the point in the counter part image<span>  </span>projected from the same location <span class="625485419-27062009">on </span>the scene. The correspondence map is commonly presented by the <i>disparity</i> function revealing the shift in location of the corresponding pixels.</font></p> <p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; DIRECTION: ltr; LINE-HEIGHT: 150%; unicode-bidi: embed; TEXT-ALIGN: justify"><font face="Arial"><font size="2">I'll start my talk with an introduction to stereo vision and <span class="625485419-27062009">the </span>correspondence problem. Then I'll address the problem of correspondence establishment in binocular stereo vision. We suggest a novel variational approach that considers both the discontinuities and half-occlusions. Our algorithm is defined in a spatially continuous setting providing inherent sub-pixel evaluations for the disparity function. The depth discontinuities are preserved by use of the celebrated <i>Mumford-Shah</i> framework in a novel formulation. The proposed method concurrently evaluates a <i>dense</i> disparity, the half-occlusion map, and a discontinuity function revealing the location of the disparity boundaries. We evaluate our method on real data sets from Middlebury site showing superior performance in comparison to the state of the art variational method. Performance assessment with the Middlebury measures ranks our method among the top 3 stereo matching algorithms<span class="625485419-27062009"> (as for February 2009)</span><span class="625485419-27062009">,</span> <span class="625485419-27062009">considering sub-pixel accuracy.  (</span></font></font><font face="Arial" size="2">Joint work with Prof. Nir Sochen, Tel-Aviv University)</font></p> <p class="MsoNormal" dir="ltr" style="DIRECTION: ltr; unicode-bidi: embed; TEXT-ALIGN: left" align="left"><span style="FONT-SIZE: 14px; FONT-FAMILY: Arial">Light refreshments will be served at 10:45 am. Everybody is welcome.</span></p>