Download Multiresolution Methods for Image Processing

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
Multiresolution Methods for Image Processing
Chiara Olivieri
Department of Computer and Information Science
[email protected]
Matematica, Forme, Immagini, 19/03/2010
C. Olivieri
Multiresolution Methods
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
Outline
1
Human Vision Model
2
Multiresolution Representations
3
The Steerable Pyramid Transform
C. Olivieri
Multiresolution Methods
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
A Model for Human Vision System
The center of the human vision is the
primary visual cortex.
It is located in the posterior pole of the
occipital cortex;
It is highly specialized for processing
information about static and moving objects
and is excellent in pattern recognition.
The neurons of the primary visual cortex (V1
neurons) can discriminate small changes
in visual orientations, spatial frequencies
and colors.
Seems to be that early responses of V1
neurons act as sets of selective
spatiotemporal filters.
C. Olivieri
Multiresolution Methods
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
A Top/Down Approach in Human Vision
Let’s think for a moment we are
looking at a beautiful sight out of the
windows...
we always look first at the scene
in its totality;
after we focus on the details of
the scene.
This is the intuitive approach of a
multiresolution transform.
A multiresolution transform can be
seen as a simplification and
restriction of the complex and high
non-linear functioning of V1 neurons.
C. Olivieri
Multiresolution Methods
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
A Multiresolution Framework
A multiresolution representation bases its main
concepts on the system of the sight of human being.
It provides a simple hierarchical framework for interpreting the
image information.
At different resolutions, the details of an image generally
characterize different physical structures of the scene.
At coarse resolution, the details correspond to larger structures.
At high resolution, we obtain the finer details.
The basis functions of these representations are well localized
in spatial position, orientation, and scale.
C. Olivieri
Multiresolution Methods
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
A Sparse Representation
The basis functions subdivide the information of the original
image in a set of subband of coefficients.
The structural information of the image is then subdivided in few
large-magnitude coefficients spread in several subbands.
The intuitive explanation is that images contain smooth areas
interspersed with occasional sharp transitions (e.g., edges).
The smooth regions produce small-amplitude coefficients, and
the transitions produce sparse large-amplitude coefficients.
As a rule of thumb...
Large-magnitude coefficients tend to lie along ridges with
orientation matching that of the subband.
Large-magnitude coefficients also tend to occur at the same
relative spatial locations in subbands at adjacent scales, and
orientations.
C. Olivieri
Multiresolution Methods
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
Histogram of the Coefficients
C. Olivieri
Multiresolution Methods
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
General Model for Decomposition
Given a starting image I, a typical multiresolution algorithm for
decomposition has the following steps:
Algorithm
1
filter I with a set of band-pass filters;
every band-pass filter should select a specific orientation;
2
filter I with a low pass filter;
obtaining a new image of reducted resolution;
3
4
since now we have more information than needed,
downsample I, obtaining an new image I 0 of half size;
repeat the first step on the downsampled image I 0 .
C. Olivieri
Multiresolution Methods
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
Example: the Steerable Pyramid Transform
First level of resolution
C. Olivieri
Multiresolution Methods
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
Example: the Steerable Pyramid Transform
Second level of resolution
C. Olivieri
Multiresolution Methods
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
Example: the Steerable Pyramid Transform
Third level of resolution
C. Olivieri
Multiresolution Methods
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
A Completely Adaptive Representation
The steerable pyramid transform is a multiresolution
representation that is a little different from classical wavelet.
A filter is called ”steerable” if it can be written as a linear sum of
rotated versions of itself.
The steerable pyramid can be more formally described with the
frame theory.
The formulation of the filters of the steerable pyramid allows a
complete control on the number of orientations we want to
analyze.
The maximum number of scales is fixed by the size of the image.
C. Olivieri
Multiresolution Methods
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
Good and Not so Good Properties...
The steerable filters provide an invertible and aliasing-free
reconstruction and can be easily adapted to create sets of odd
and even filters.
Problems
Advantages
invariant for translation;
invariant for rotation;
perfect-reconstruction with
Fourier domain
implementation.
the space-domain
implementation is not
perfect-reconstruction;
overcompleteness by a factor
of 4K /3, where K is the
number of orientation bands;
tricky 3D extension.
C. Olivieri
Multiresolution Methods
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
Pyramidal Decomposition
C. Olivieri
Multiresolution Methods
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
Set of Basis Filters
In order to preserve the invertibility of the reconstruction we
must set some constraints on the set of steerable filters.
Requisites
|H0 (r )|2 + |L0 (r )|2 = 1
PK −1
|H0 (r )|2 + |L0 (r )|2 [|L1 (r )|2 + k =0 Bk (r )2 ] =
1
r L
=
2r P
L
|L(r )|2 + K −1 |Bk (r , φ)|2
k =0
2
C. Olivieri
Multiresolution Methods
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
Steerable Filters
C. Olivieri
Multiresolution Methods
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
Denoising with Steerable Pyramid
A denoising algorithm with the Steerable Pyramid Transform
(and in general with every wavelet transform) consists of three
main stages:
Denoising
1
decomposition of the starting image in a set of
coefficient images;
2
estimation of the noise-free coefficients from the noisy
ones;
3
reconstruction of the image from the updated
coefficients.
C. Olivieri
Multiresolution Methods
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
Denoising through Thresholding
In the wavelet domain, the noise is uniformly spread
throughout the coefficients, while most of the image
information is concentrated in the few largest ones.
The most straightforward way to distinguish noise from
information consists of the thresholding of the wavelet
coefficients.
Thresholding functions
hard thresholding f (x) = 0 if |x| ≤ T ,
x elsewhere
soft thresholding
f (x) = 0 if |x| ≤ T ,
x + T if x < −T
x − T if x > T ,
”our” thresholding f (x) = x/(1 + exp(−S(|x| − T )))
C. Olivieri
Multiresolution Methods
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
Thresholding Functions
C. Olivieri
Multiresolution Methods
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
Results
C. Olivieri
Multiresolution Methods
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
Results
C. Olivieri
Multiresolution Methods
Human Vision Model
Multiresolution Representations
The Steerable Pyramid Transform
Results
C. Olivieri
Multiresolution Methods
Appendix
For Further Reading
For Further Reading I
S. Mallat.
A Wavelet Tour of Signal Processing, Third Edition: The
Sparse Way.
Academic Press, 2008.
J. Portilla, V. Strela, M.J. Wainwright, E.P. Simoncelli.
Image denoising using scale mixtures of Gaussians in the
wavelet domain.
IEEE Transactions on Image Processing, 2003.
C. Olivieri
Multiresolution Methods
Related documents