Download Control of Dynamic Discrete-Event Systems

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Hendrik Wade Bode wikipedia , lookup

Control (management) wikipedia , lookup

Control theory wikipedia , lookup

Perceptual control theory wikipedia , lookup

Control system wikipedia , lookup

Resilient control systems wikipedia , lookup

Transcript
Control of Dynamic
Discrete-Event Systems
Lenko Grigorov
Master’s Thesis, QU
supervisor: Dr. Karen Rudie
Discrete-Event Systems
(background)

Discrete-Event Systems are systems where
events (changes of state) occur:




spontaneously
logically ordered relative to each other
not tied to a continuous global time
Common representation of DESs:


Finite-state machines
G=(,Q,,q0,Qf)
(Cassandras and Lafortune, Introduction to Discrete Event Systems, 1999)
Dec. 2003
Lenko Grigorov, QU
2
Motivation

Control of a specific class of DESs






dynamic (change with time)
relatively large
with continuous lifecycle
with requirements with different levels of
stringency
Such systems are common in real life
Classical DES control methods are not
suitable
Dec. 2003
Lenko Grigorov, QU
3
Outline
1.
2.
3.
4.
Dec. 2003
Definition of Dynamic Discrete-Event
Systems
Redundancy for Modular Architecture
Optimal DDES control
Experiment
Lenko Grigorov, QU
4
DES Modular
Architecture
(background)


Separate small DES
modules
Combined using a
synchronous composition


an event can happen in a
module  it can happen in
the system
if modules have common
events, these events
happen simultaneously
(Cassandras and Lafortune, Introduction to Discrete Event Systems, 1999)
Dec. 2003
Lenko Grigorov, QU
5
Dynamic DES Model

Time



Sets of modules



Mi = {M1i, M2i, …, Mni}, i{0, 1, 2, …}
||Mi = M1i || M2i || … || Mni
DDES G={(||Mi, i) | i{0, 1, 2, …})


discrete
increases by one after every event
at time i, Gi = ||Mi
No restrictions on the sets Mi
Dec. 2003
Lenko Grigorov, QU
6
Redundancy for
Modular Architecture

Mi  Mi+1  


Given operation 



thus some part of ||Mi may be reused to compute
||Mi+1
commutative
associative
How to compute A = A1  A2  …  An so
that recomputing A after a structure change
is least expensive?

Dec. 2003
redundant storage of intermediate results
Lenko Grigorov, QU
7
Redundancy Structures

Stack structure – simple



Tree structure – robust to random changes


use when: the result of the operation does not
increase exponentially, older modules are stable
disadvantages: large size when used with
synchronous composition
use when: the oldest elements have highest
chance to change
Hybrid structure – small footprint


Dec. 2003
use when: there is small storage space
disadvantages: may demand more computations,
while savings in space are insignificant
Lenko Grigorov, QU
8
Complexity of
Redundancy Structures
Dec. 2003
Lenko Grigorov, QU
9
Standard Online Control
(background)




Construct a limited-depth tree of the possible
future behavior of the system
For each node, determine if the string leading
to it is acceptable
Propagate the information back to the root
Disable events leading to “unsafe” states


where we cannot prevent the execution of an
unacceptable string
Repeat this after each execution of an event
(Chung, Lafortune, and Lin, “Limited lookahead policies in supervisory
control of discrete event systems”, 1992)
Dec. 2003
Lenko Grigorov, QU
10
Valuation of Event Strings

Two functions are defined by the user

The value function gives the “value” of event
strings, according to some criteria




The goal function indicates which strings
accomplish a task



Dec. 2003
v(s)  R, s  L(G)
greater v(s)  string is more desirable
v(s) = -  string is unacceptable
g(s)  {0,1}, s  L(G)
no need to investigate the look-ahead tree further
similar to final (marked) states, but works on strings
Lenko Grigorov, QU
11
Optimal DDES Control
Algorithm

Online control using the value and goal
functions

exploration of a branch in the tree is carried until





Dec. 2003
a goal is generated,
an unacceptable string is generated, or
the depth limit is reached
the value function is used to obtain the benefit of
the different paths (event strings)
the controller selects the path which may yield the
greatest benefit
Lenko Grigorov, QU
12
Advantages (1)

Attempts to guide the system to the
maximal benefit for the user


quality depends on the way the system
evolves
The use of the value function renders
the control process more robust to
failures

Dec. 2003
it does the best possible with the available
resources
Lenko Grigorov, QU
13
Advantages (2)

The algorithm adapts automatically to
different types of dynamics in the system

structural changes


changes of goals


the constituent modules change
changes in the requirements for the system behavior
changes in the evaluation of events

events have varying costs


Dec. 2003
depending on the event string
depending on time
Lenko Grigorov, QU
14
Advantages (3)

Requirements on the system behavior
can have many levels of stringency


The method does not need access to
the complete system model


not only acceptable/unacceptable
can work with large systems
The algorithm can be implemented as
modular software
Dec. 2003
Lenko Grigorov, QU
15
Issues in
Optimal DDES Control

The control may not be optimal

if the tree depth is too limited


if the tree depth is too big


the controller cannot observe far enough along the event
strings to compute relevant costs and payoffs
the controller bases its decisions on the current system,
while the system may change in the future
The complexity of the algorithm is affected by
the particular value and goal functions used


Dec. 2003
may be significantly greater than the complexity of
standard online control
O(kNv(s)g(s))
Lenko Grigorov, QU
16
Experiment


Different number of trains enter or leave a
system of railroads
A set of requirements on the system behavior


no trains can be at the same section of a track,
etc.
Comparison between optimal DDES control
and simple online control
(Experiment based on: Chung, Lafortune, and Lin, “Supervisory control
using variable lookahead policies”, 1994)
Dec. 2003
Lenko Grigorov, QU
17
Experiment Results

The overall system behavior is much closer to
the requirements




strings have higher value
more trains arrive at train stations per unit time
more balanced use of the resources
Disadvantages


Dec. 2003
significant increase in the time to make a decision
can work only with a much smaller tree-depth
Lenko Grigorov, QU
18
Conclusion and Contributions

The use of redundancy structures can
reduce the number of computations
needed to rebuild the model of a
system after it changes


different types of redundancy structures
available for different applications
can be used in other areas

Dec. 2003
not limited to the synchronous composition of
modules
Lenko Grigorov, QU
19
Conclusion and Contributions

The proposed control method can be used
successfully to supervise dynamic discreteevent systems





Dec. 2003
achieves near-optimal control
adapts automatically to dynamics in the system
allows a very flexible definition of requirements
is more robust to system failures
is easily implementable as modular software
Lenko Grigorov, QU
20
Selected References



C. G. Cassandras and S. Lafortune. Introduction to
Discrete Event Systems. Kluwer Academic Publishers,
Norwell, Massachusetts, USA, 1999,
Sheng-Luen Chung, Stéphane Lafortune, and Feng
Lin. Limited lookahead policies in supervisory control
of discrete event systems. IEEE Transactions on
Automatic Control, 37(12):1921–1935, 1992,
Sheng-Luen Chung, Stéphane Lafortune, and Feng
Lin. Supervisory control using variable lookahead
policies. Discrete Event Dynamic Systems: Theory
and Applications, 4:237–268, 1994.
Dec. 2003
Lenko Grigorov, QU
21