Download Graphic Profiling of User Group Activity

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Profiling User Group Activity With SAS® Software
H. Pat Artis
Morino Associates
Vienna, VA 22180
the user's daily activities. The reader should note that this same
technique could be used to quantify activity at other levels of
granularity like hourly, weekly, or monthly observations,
1.0 Introduction
One of the most common requests made to CPE analysts respon~
sible for performance reporting by their management is to develop
report formats that highlight or easily identify significant changes
As a measure of a significant difference, the standard deviation
provides an adaptive measure that allows us to quantify change.
To accomplish this objective, one need only calculate the Zstatistic for the observation to be evaluated. Simply described, a
Z-statistic is a measure of how many important differences a
measure is larger or smaller than the mean of the population form
which it was selected, The Z-statistic is calculated:
that occur in an environment. For the performance analyst to
understand tlie managers motivation for asking for such reports, it
is important to remember that the one of the most basic concepts
of management is the by control and understanding changes.
Unfortunately, the development of such reports is not a trivial task.
To quantify and identify changes, the analyst must first determine
what magnitude of variance of a given measurement is a sign of a
significant change, and second present the information in a form
that is easily understood by the recipient of the report,
XCi)
s
One other results of the Z-transformation is that the range of
Z-statistics Over which Z can be expected to vary is well understood. For most cases (i.e., 99%+ of all observations), the resulting Z~statistics can be expected to range between -3.5 and + 3.5.
Negative Z-statistics correspond to values smaller than the mean
and positive Z-statistics correspond to observation largerthan the
mean. Thus, the sign of the Z~statistic provides an easily understood indicator of the nature of the change that occurred. For the
daily observations of transaction volume introduced above, Friday's observation is 0.97 standard deviations greater than the five
day mean. While not an overwhelming difference, it is certainly a
change in the nature of the system's workload worthy of note
In this paper, we will examine a graphical reporting tool that is
based on the use of the Z-statistic to address these management
reporting requirements. The quantification of the activity of TSO
user groups will be used as an example for this discussion.
2.0 Quantifying Changes
To quantify changes, the analyst must select or identify threshold
values which indicate significant changes in the selected
measurements. For any given measurement, the experienced
analyst can likely select a threshold value to evaluate changes.
Unfortunately, when a large number of measurements are to be
included in a single report or when a given report format is to be
produced for a large community of users, the maintenance of
tables of such threshold values becomes an unacceptable task.
Failure to maintain such tables of values has historically been one
of the most common reasons that complex exception reporting
schemes have failed to realize their potential.
One additional benefit of the Z-statistiCs is that they do not require
the recipient of the report to have any knowledge of the prior values
that have defined the measure in question, Rather, the Z-statistic
provides a simple one number indicator of change that quantifies
its direction and magnitude relative to the past history for the user
in question.
Rather than attempting to maintain a table of threshold values for
every measurement or group of users, it is desirable to develop an
adaptive means of identifying significant changes on a user group
andlor measurement basis. Fortunately, simple central tendency
statistics provides us with an adaptive means of identifying such
changes calted the standard deviation. Consider the following
daily transaction counts for a given user:
Monday
Tuesday
Wednesday
Thursday
Friday
3.0 Graphical Presentation
Using the Z-statistics discussed in the prior section, simplegraphical reports can be developed to highlight the changes in selected
workloads. A sample report employing this notion is shown in
Figure 1. The development ofthe report is discussed in the following paragraphs.
2,313
2,065
1,926
2,053
2,290
For the sample study, the activity of groups of TSO users at a
weekly level was selected for study using data collected with
TSO/MON, User groups were identified in this study by the first
three characters of the TSO LOGON ID recorded by the measurement tool. Three metrics were selected to profile the users in this
case study. They were:
To identify if Friday represented a significant change in the users
activity, we must first determine what is normal for that user on a
daily basis, Calculating the mean and standard deviation we find:
X
2,219
s
166
- X
Z(i)
average T30 short response time,
total CPU time, and
total TSO transactions.
Where the mean provides us a measure of the central tendency for
the user and the standard deviation provides us an indication of
"what a significant difference" is for the observations that quantify
The selection of these variables was a matter of convenience and
do not represent a suggested set that should be considered for
194
any group of T80 users.
While this case study concentrated on T80 measurements, the
same reporting structure could include any other workload or
service measurements that can be summarized on the basis of
user group. For example, batch job statistics, network service
levels, and DA8D consumption could easily be computed and
included in a single page report for each group of users.
To calculate the mean and standard deviation values, the activity
for the T80 user groups was summarized at a weekly level. The
most recent ten observations (i.e., last ten weeks) for each user
group was maintained in a 8AS dataset. The selection of the
number of past observations to maintain is a key decision since the
volatility (and hence the ability to quickly identify change) of the
mean and standard deviation values tends to decrease as the the
number of observations grows larger.
4.0 Remarks
A simple method for quantifying and graphically profiling the
changes in User group and/or workload activities has been presented. Using this technique, the analyst can implement an adaptive reporting system that quantifies and identifies changes in
computer workloads.
Once the mean and standard deviations are calculated for the
measurements describing a given user, the Z-statistic value can
be calculated for the most recent observation. As shown in the
figure, this Z-statistic was then used to develop a simple box plot.
(Z-statistic values greater than 3.0 or less than -3.0 are truncated to
3.0 and -3.0 respectively in this report.) In addition to the box plot,
the most recent five weekly observations for the measurements are
also shown in the report.
~ T~~~:-~~::~~:f:::~-~~:~-::~::::f:~:~;-;~:~~:::i
0.0
.~<
0.39>>=·
*\
•
:: l::--:::::::-:::::::::-:::j-:::::::-::::~:~~:--:::::J:::::::::-:::::::::--::::::.
29AtJG86
22AUG86
15.'.0086
OBAUG86
o~,
0.41
0.34
0.53
0.37
0.33
3: 03. 5-4
2: 58, 25
4:50.95
7:34.63
3: 21.15
Figure 1. Sample Report for TSO Users
195
430.0
258.0
254.0
883.0
488.0