Download Part 3

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
TANGENT
ALERT!
What happens when no correspondence is possible?
Highly mismatched stereo-pairs lead to ‘binocular rivalry’
Open question:
Can rivalry and fusion coexist?
Computational theories for solving the correspondence problem:
Given the underconstrained matching problem (100! Possible pairings in an RDS with
100 dots), what assumptions can we bring to bear?
Assumption 1: Epipolar constraint
Marr-Poggio’s network-based formulation of the problem:
Assumptions:
1.
2.
3.
Surface opacity
/ match uniqueness
Surface continuity
Match compatibility
Sample result of Marr-Poggio’s network:
Enhancing the Marr-Poggio’s model:
Edge-based matching rather than pixel matching.
Advantages:
1. Edge orientation and polarity provide additional matching constraints
2. Greater consistency with known physiology (matching begins in V1)
Disadvantages:
Enhancing the Marr-Poggio’s model:
Edge-based matching rather than pixel matching.
Advantages:
1. Edge orientation and polarity provide additional matching constraints
2. Greater consistency with known physiology (matching begins in V1)
Disadvantages:
1. Depth information is sparse; an additional process of interpolation is
is needed.
Enhancing the Marr-Poggio’s model:
Edge-based matching rather than pixel matching.
Advantages:
1. Edge orientation and polarity provide additional matching constraints
2. Greater consistency with known physiology (matching begins in V1)
Disadvantages:
1. Depth information is sparse; an additional process of interpolation is
is needed.
Open problems:
1. How to match stereo pairs where assumptions are violated?
2. How to make use of monocular shape cues?
Physiological mechanisms of stereopsis:
Hubel and Wiesel (1962):
Binocular cells in V1 not sensitive to disparity (in cats)
Barlow et al (1967):
V1 cells sensitive to disparity
Hubel and Wiesel (1970):
V1 cells not sensitive but V2 cells are (monkeys)
Poggio and Fischer (1977):
V1 cells sensitive to small disparities and V2 cells
sensitive to large disparities (awake fixating monkeys)
Cue integration:
Processing Framework Proposed by Marr
Recognition
3D structure; motion characteristics; surface properties
Shape
From
stereo
Motion
flow
Shape
From
motion
Color
estimation
Edge extraction
Image
Shape
From
contour
Shape
From
shading
Shape
From
texture
Motion Perception:
-Detecting motion and motion boundaries
-Extracting 2D motion fields
-Recovering 3D structure from motion
Motion as space-time orientation:
Computational models of motion detectors:
Delay and compare networks
Other ways of constructing movement detectors:
Psychophysical support from Anstis’ experiment (1990)
TANGENT
ALERT!
Accounting for eye-motion
Q. When do we see an object move?
A. When its image moves on the retina.
Is this really true?
TANGENT
ALERT!
Accounting for eye-motion (contd.)
The corollary discharge model (Teuber, 1960)
Predictions: 1. Pushing on the eyeball would cause the world to -------2. A stabilized after-image would appear to ------- when the eye is
moved voluntarily
3. If your eye was paralyzed with curare and you then attempted to
move it, you would see the world --------
From local motion estimates to global ones:
Local motion estimates are ambiguous due to the ‘Aperture Problem’
Subjective plaids video
From local motion estimates to global ones (contd):
Theoretically, the ‘Aperture Problem’ can be overcome by pooling
information across multiple contours.