Download On Maxent

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
On Maxent
Jorge Soberon
University of Kansas
The idea
• There is an unknown
probability
distribution, denoted
by p.
• The probabilities are
defined over the grid of
cells G
• Probability of what?
Probability of pixel g
being suitable for the
species
#
S
#
S
#
S
#
S
#
S
#
S
S
##
S
#
S
#S
#
#SS
#S
##S
S
#
S
#
S
#
S
#
S
#
S
#
S
#
S
#
S
• The values of p(g) are
probabilities, therefore,
they add to 1, and since,
in general, |G| is normally
large, for example, 105 to
107, then the probabilities
tend to be small.
• We wish to estimate p.
Our estimate is called
pˆ
p (g)  0
 p (g)  1
gG
The “features”
• Maxent assumes that for
each cell g in G, there
are “features” that give
a continuous value per
cell:
• f1(g),f2(g),… fn(g)
• Features are average
temperature, minimum
temperature, total
precipitation, elevation,
and so on…
6000
Z
4000
2000
0
• We also have a number
of data points, meaning
the observations.
• Those datapoints define
the mean value of the
features
• That is, we take the mean
value of each feature,
taken over the values in
the cells where the
species was observed
8000
And the data points…
2
1
y
0
Y
p [ y]
4
-1
-2
2
X
0
p [ x]
x
– 1) Have the same means
of features as the
observed means
– 2) It is as flat as possible
(maximizes entropy)
6000
Z
4000
2000
0
• The core of idea of
maxent is:
• Find the probability
distribution that:
8000
The guts of Maxent I.
2
1
y
0
Y
p [ y]
4
-1
-2
2
X
0
p [ x]
x
The guts of Maxent II.
• Mathematical arguments
shows that the
Maximum Entropy
distribution will be a
Gibbs distribution
• To prevent
“overfitting”, there are a
regularization factors
• Minimize
• Subject to
pˆ 
e
λ f ( x)
Z
 f ( x)
ei1 i i
v

Z
pˆ[ fi ]  p [ fi ]   j

 v
1 m
ln  Z    λf ( x j )    i i
m j 1

 i 1
q [ f1 ]  p [ f1 ]
.
.
q [ f m ]  p [ f m ]
The output of Maxent
• The estimated probabilities
of suitability for every patch
• An accumulated value
which increases the
numbers.
• Means that if we randomly
sample pixels, t% of them
will have
• A(x)  t
pˆ ( x )
A( x)  100
p
pˆ ( x )  ˆ ( x )
pˆ ( x)
300
300
4000
6000
8000
-100
GARP
10000
-200
2000
0
Temperature
0
100
200
200
100
0
Temperature
-100
E-SPACE
Precipitation
0
2000
4000
6000
8000
10000
100
0
Temperature
100
0
-100
MAXENT
-200
-100
BIOCLIM
-200
Temperature
200
200
300
300
Precipitation
0
2000
4000
6000
Precipitation
8000
10000
0
2000
4000
6000
Precipitation
8000
10000