* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Treshold partitioning …
Linear least squares (mathematics) wikipedia , lookup
Rotation matrix wikipedia , lookup
Eigenvalues and eigenvectors wikipedia , lookup
Determinant wikipedia , lookup
Principal component analysis wikipedia , lookup
Matrix (mathematics) wikipedia , lookup
Four-vector wikipedia , lookup
Jordan normal form wikipedia , lookup
System of linear equations wikipedia , lookup
Orthogonal matrix wikipedia , lookup
Singular-value decomposition wikipedia , lookup
Cayley–Hamilton theorem wikipedia , lookup
Non-negative matrix factorization wikipedia , lookup
Matrix calculus wikipedia , lookup
Gaussian elimination wikipedia , lookup
Threshold partitioning
for iterative aggregation – disaggregation method
Ivana Pultarova
Czech Technical University in Prague, Czech Republic
ILAS 2004
1
We consider column stochastic irreducible matrix B of type N × N.
The Problem is to find stationary probability vector xp, || xp || = 1,
Bx p x p .
We explore the iterative aggregation-disaggregation method (IAD).
Notation:
Spectral decomposition of B, B = P + Z, P 2 = P, ZP = PZ = 0, r(Z) < 1
(spectral radius).
Number of aggregation groups n, n < N.
Restriction matrix R of type n × N. The elements are 0 or 1, all column sums
are 1.
Prolongation N × n matrix S(x) for any positive vector x :
(S(x))ij := xi iff (R)ji = 1, then divide all elements in each column with the
sum of the column.
Projection N × N matrix P(x) = S(x) R.
|| . || denote 1-norm.
ILAS 2004
2
Iterative aggregation disaggregation algorithm:
step 1. Take the first approximation x0 RN, x0 > 0, and set k = 0.
s
step 2. Solve RB S(xk) zk+1 = zk+1, zk+1 Rn, || zk+1 || = 1, for
(appropriate) integer s, (solution on the coarse level).
step 3. Disaggregate xk+1,1 = S(xk) zk+1.
t
step 4. Compute xk+1 = B xk+1,1 for appropriate integer t,
(smoothing on the fine level).
step 5. Test whether || xk+1 – xk|| is less then a prescribed tolerance. If not,
increase k and go to step 2. If yes, consider xk+1 be the solution of
the problem.
ILAS 2004
3
Propositon 1.
If s = t then the computed approximations xk, k = 1,2,…, follow the formulae
s
a) B P(xk) xk+1 = xk+1,
s
b) xk+1 = (I – Z P(xk))-1 xp,
c) xk+1 – xp = J(xk) (xk – xp),
where J(x) = Bs(I – P(x) Zs)-1(I – P(x))
and also J(x) = Bs(I – P(x) + P(x) J(x)).
Proposition 2.
s
Let V be a global core matrix associated with B . Then
J(x) = V(I – P(x) V)-1(I – P(x)) and J(x) = V(I – P(x) + P(x) J(x)).
ILAS 2004
4
Note.
s
The global core matrix V is here ηP + Z . Using Z k→ 0 for k → ∞,
s
we have V = ηP + Z ≥ 0 for a given η and for a sufficiently large s.
s
s
This is equivalent to B = P + Z ≥ (1- η) P.
ILAS 2004
5
Local convergence.
It is known that for arbitrary integers t ≥ 1 and s ≥ 1 there exists
a neighborhood Ω of xp such, that if xk Ω then xr Ω, r = k +1, k + 2,…,
and that
k
|| xk+1 - xp|| ≤ c α || xk - xp||,
where c R and α ≤ min{|| Vloc ||μ, ||(I-P(xp))Z(I-P(xp))||μ}, where ||.||μ is
some special norm in RN.
Here, Vloc is a local core matrix associated with B.
Thus, the local convergence rate of IAD algorithm is the same or better
comparing with the Jacobi iteration of the original matrix B.
ILAS 2004
6
Global convergence.
From Proposition 2 we have
||J(xk)|| ≤ ||V|| ||I – P(xk)|| + ||V|| ||P(xk)|| ||J(xk)||,
i.e.
||J(xk)|| (1 – η) < 2η.
So that the sufficient condition for the global convergence of IAD is
η < 1/3, i.e. the relation
B s > (2/3) P
is the sufficient condition for the global convergence of IAD method.
(It also means r(Z s) ≤ 1/3. B s ≥ (2/3) P is equivalent to P/3 + Z s ≥ 0.
Then P + 3Z s ≥ 0 is a spectral decomposition of an irreducible column
stochastic matrix and then r(Z s) ≤ 1/3.)
ILAS 2004
7
In practical computation of large problems we cannot verify the validity of
s
relation B ≥ η P > 0 to estimate the value of s.
k
But, we can predict the constant k for which B > 0. The value is known to
be less than or equal to N 2- 2 N + 2.
ILAS 2004
8
We propose a new method for achieving B s ≥ ηP > 0 with some η > 0.
Let I – B = M – W be a regular splitting, M -1 ≥ 0, W ≥ 0. Then the solution
of Problem is identical with solution of
(M – W) x = 0.
Denoting Mx = y and setting y := y / || y ||, we have
(I – WM -1) y = 0,
where WM -1 is column stochastic matrix.
Thus, the solution of the Problem is transformed to the solution of
WM -1 y = y,
|| y || = 1,
for any regular splitting M, W of the matrix I – B.
ILAS 2004
9
The good choice of M, W.
According to IAD algorithm we will use a block diagonal matrix M which is
composed of blocks M1, … Mn , each of them invertible.
To achieve (WM -1) s > 0 for low s, we need
Mi-1> 0, i =1,…, n,
nnz (WM -1) >> nnz (B), (number of nonzeros).
ILAS 2004
10
Algorithm of a good partitioning.
step 1. For an apropriate threshold τ, 0 < τ < 1, use Tarjan’s parametrized
algorithm to find the irreducible diagonal blocks Bi, i = 1,…,n,
of the permuted matrix B, (we now suppose “B := permuted B”).
step 2. Compose the block diagonal matrix BTar from the blocks Bi,
i = 1,…,n, and set
M = I – BTar / 2 and W = M – (I – B).
Properties of WM -1 .
WM -1 is irreducible.
Diagonal blocks of WM -1 are positive.
(WM -1) s is positive for s ≤ n2 - 2n + 3, n is the number of aggregation
groups. (n = 3 → s = 2)
The second largest eigenvalue of the aggregated n × n matrix is
approximately the same as that of WM -1.
ILAS 2004
11
Example 1.
B1
B
...
0
0
B1 ... 0
.... ... ...
... B1
T
...
0 0...
1 0... 0
B1
... ...
...
0 0... 1
1
0
...
0
0... ...
... ... ...
0... ... 0
Matrix B is composed from n × n blocks of size m. We set ε = 0.01,
δ = 0.01. Then B is normalized.
ILAS 2004
12
Example 1.
a) IAD method for WM -1
and threshold Tarjan’s
block matrix M, s = 1,
r(ZWM) = 0.9996. (Exact
solution – red, the last of
approximations - black circles).
b) Power iterations for
WM -1 and the same M as
in a), s = 1, r(ZWM) =
0.9996. (Exact solution – red,
the last of 500 approximations black circles. No local
convergence effect.).
c) Rates of convergence
of a) and b).
ILAS 2004
13
Example 2.
B1
B
...
0
0
B1 ... 0
.... ... ...
... B1
T
...
0 0...
1 0... 0
B1
... ...
...
0 0... 1
1
0
...
0
0... ...
... ... ...
0... ... 0
Matrix B is composed from n × n blocks of size m. We set ε = 0.01,
δ = 0.01. Then B := B + C (10% of C are 0.1) and normalized.
ILAS 2004
14
Example 2.
IAD for B and
WM -1.
Power method
for B and
WM -1.
Convergence
rate for IAD
and power
method.
ILAS 2004
15
Example 2.
Another
random entries.
a)
IAD for B and
WM -1.
b)
Power method
for B and
WM -1.
c)
Convergence
rate for IAD
and power
method.
ILAS 2004
16
I. Marek and P. Mayer
Convergence analysis of an aggregation/disaggregation iterative method for computation
stationary probability vectors
Numerical Linear Algebra With Applications, 5, pp. 253-274, 1998
I. Marek and P. Mayer
Convergence theory of some classes of iterative aggregation-disaggregation methods for
computing stationary probability vectors of stochastic matrices
Linear Algebra and Its Applications, 363, pp. 177-200, 2003
G. W. Stewart
Introduction to the numerical solutions of Markov chains, 1994
A. Berman, R. J. Plemmons
Nonnegative matrices in the mathematical sciences, 1979
G. H. Golub, C. F. Van Loan
Matrix Computations, 1996
ETC.
ILAS 2004
17