Download Document

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Characterizing and Exploiting
Task-Load Variability and Correlation
for Energy Management
in Multi-Core Systems
Soner Yaldiz , Alper Demir, Serdar Tasiran
Koç University, Istanbul, Turkey
Paolo Ienne, Yusuf Leblebici
Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland
ESTIMedia 2005
Multi-Core Soft Real-Time Systems
• Chip-level multiprocessing for massive performance
– Energy management problem
• Real-time multimedia applications
– Audio, video processing
• Soft real-time systems
– Tolerance to deadline misses
processors
T1
task graph
T2
start
end
T3
MPEG2 video frames
t
ESTIMedia 2005
T4
time
t + TDEADLINE
2
Variability and Correlation
• This work: First approach to consider variability and
correlations for multiprocessor energy management
variability
Task1
workload
workload
Task2
• Capture by Stochastic Models
• Exploit for Energy Management
Voltage
Taski
positive
correlation
workload
probability
V1
V2
– Dynamic Voltage Scaling (DVS)
deadline
ESTIMedia 2005
Time
3
Motivating Example
• Application composed of two tasks on a single processor
T1
start
T2
end
TDEADLINE = 2 sec
• Task loads low (2) or high (10) with equal probability
probability
T1,T2
50%
2
50%
10
instructions
• Processor Operating Modes
– Slow Mode -> 6 instructions-per-second
– Fast Mode -> 10 instructions-per-second
ESTIMedia 2005
4
Task Load Combinations
probability
T1
start
T2
T1,T2
end
TDEADLINE = 2 sec
50%
2
50%
10
instructions
Probabilities for task load combinations:
Independent
T1
T2
2
10
25% 25%
10 25% 25%
2
Positively Correlated
T1
T2
2
10
2
50% 0
10
0 50%
ESTIMedia 2005
Negatively Correlated
T1
T2
2
10
0 50%
10 50%
0
2
5
Motivating Example
Independent
Application
–
2 tasks
Processor modes
–
–
Slow 6 inst/sec
Fast 10 inst/sec
Deadline
–
2 sec
Target
Positively
Correlated
T1
T2
2
2
10
25% 25%
10 25% 25%
1.0
0.75
Positive Correlation
Target
100%
Assumption Independent
Reality
2
10
T1
T2
2
10
2
50%
0
2
0
50%
10
0
50%
10 50%
0
never happens !
Slow mode -> 12 instructions in 2 sec
Misses desired performance
Assumption Independent
Reality
T1
T2
0.50
75%
Negatively
Correlated
Negative Correlation
Fast mode -> 20 instructions in 2 sec
Suboptimal energy
ESTIMedia 2005
6
OUTLINE
• Stochastic Modeling
• Energy Management Scheme
– OFFLINE Optimization
– ONLINE Adjustments
• Experimental Results
• Conclusions
ESTIMedia 2005
7
Stochastic Modeling Flow
start
T1
T2
OBSERVATIONS
Task1
Task2
1
2
10
2
5
5
3
2
5
4
10
2
5
5
5
6
2
10
7
2
5
8
5
5
end
• Computational Demand (CD) of a task
– Number of CPU cycles for execution
• Demands are represented by dist
– Quantized for manageability
• dist is obtained from a set of traces
• Demand of tasks constitutes an
‘observation’
– (T1,T2) = ( 5, 5 ) observed 3 out of 8.
– dist ( 5,5 ) = 3/8
T1
T2
dist
5
10
2
2/8
2/8
5
3/8
10
ESTIMedia 2005
2
1/8
8
Case Study: MPEG2
•
MPEG2 video decoding
•
Slice-based task decomposition(Olukotun et.al,1998)
– Widely-used and computationally intensive
– VLD ( Variable-length decoding)
– MC ( Motion compensation )
VLD0, MC0
VLD1, MC1
VLD2, MC2
...
..
.
Data Precedence
Experimental Data:
–
–
–
–
Task Assignment
10 movie segments
19 slices, 38 tasks
24 frames-per-second
~ 14000 frames per movie
Processor Precedence
ESTIMedia 2005
9
Variability of MPEG2 Task Loads
2- Longaggregate
Tails
aggregate
Worst-Case causes overdesign
1- Similarity
Traning set predicts workload
for others
one movie
ESTIMedia 2005
one movie
10
Correlation among MPEG2 Task Loads
High Correlation
aggregate
statistics
one movie
...
...
...
ESTIMedia 2005
...
11
Critical Path
•
•
•
Summation of worst-case task loads
: 64 million cycles
Observed worst-case total load
: 28 million cycles
Ignoring correlations lead to far from optimal
ESTIMedia 2005
12
OUTLINE
• Stochastic Modeling
• Energy Management Scheme
– OFFLINE Optimization
– ONLINE Adjustments
• Experimental Results
• Conclusions
ESTIMedia 2005
13
OFFLINE: Optimization Formulation
• Each task i has fixed voltage Vi for all periods
• GOAL: Determine optimal Vi’s
minimize
average energy consumption
subject to
completion probability
• Nonlinear constrained optimization problem with 38 variables
– One voltage per task
• Stochastic programming formulation
– Based on stochastic application model
• Optimized voltages stored for run-time look-up
ESTIMedia 2005
14
ONLINE Adjustments
• When low load is detected, lower the task voltage
– Preserving probabilistic performance
• Very small run-time expense
– Few comparisons and arithmetic operations
Load lower than expected
Slow down further
ESTIMedia 2005
15
OUTLINE
• Stochastic Modeling
• Energy Management Scheme
– OFFLINE Optimization
– ONLINE Adjustments
• Experimental Results
• Conclusions
ESTIMedia 2005
16
Experimental Setup
•
Compared with approaches for multiprocessor systems:
– I (Zhang et. al, DAC2002 )
• Ignores variability, correlations
• 100% completion
• Worst-case task load
– II ( Hua et. al, EMSOFT2003 )
• Ignores correlations
• Completion Probability
• Marginal load distribution
•
Training set: 8 movie segments out of 10
•
Test set has 2 movies not included in training set.
•
Three completion probabilities PCON
– 0.90, 0.95, 0.99
•
Two deadlines
– Normal , Tight
ESTIMedia 2005
17
Experiment I : Normal Deadline
Movie #
PCON=0.90
I
Avg E
Avg Pr
II
860 154
PCON=0.95
PCON=0.99
OFLN
ONLN
I
II
OFLN
ONLN
I
II
100
98
833
147
100
97
764
129
0.9026
1. Significant energy savings
0.9511
OFLN ONLN
100
91
0.9899
2. Desired completion probability achieved
ESTIMedia 2005
18
Experiment II : Tight Deadline
• II (Hua2003) fails with
tight deadline
– Ignores correlations
• ONLN improves more
• Accurate stochastic model
Avg E
Avg Pr
100
95
0.9030
100
91
0.9515
100
70
0.9898
ESTIMedia 2005
19
Experiment III: Comparison with GOD
Single Movie
OFFLINE
ONLINE
GOD
PCON = 0.99
100
66
52
PCON = 0.95
100
86
72
PCON = 0.90
100
92
76
• GOD
– Ideal, Unrealizable, Non-causal
– For every individual frame
• Knows load of each task
• Computes optimal voltages
• There is still room for future work
– “application state” structure
ESTIMedia 2005
20
Conclusions
• Demonstrated significant variability and
correlations among workloads of MPEG2 tasks
• Our stochastic models capture essential
characteristics of applications
– Accurately predict performance
• Novel energy management scheme based on
stochastic models
– Significant energy savings
ESTIMedia 2005
21
- Questions ?
Characterizing and Exploiting
Task-Load Variability and Correlation
for Energy Management
in Multi-Core Systems
Soner Yaldiz , Alper Demir, Serdar Tasiran
Koç University, Istanbul, Turkey
Paolo Ienne, Yusuf Leblebici
Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland
ESTIMedia 2005
Related documents