Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
CPU Scheduling Chapter 5 CPU Scheduling • • • • • • Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor Scheduling Thread Scheduling UNIX example Basic Concepts • Maximum CPU utilization obtained with multiprogramming • CPU–I/O Burst Cycle – Process execution consists of a cycle of CPU execution and I/O wait • CPU times are generally much shorter than I/O times. CPU-I/O Burst Cycle Process B Process A Histogram of CPU-burst Times Schedulers • Process migrates among several queues – Device queue, job queue, ready queue • Scheduler selects a process to run from these queues • Long-term scheduler: – load a job in memory – Runs infrequently • Short-term scheduler: – Select ready process to run on CPU – Should be fast • Middle-term scheduler – Reduce multiprogramming or memory consumption CPU Scheduler • CPU scheduling decisions may take place when a process: 1. Switches from running to waiting state (by sleep). 2. Switches from running to ready state (by yield). 3. Switches from waiting to ready (by an interrupt). 4. Terminates (by exit). • • Scheduling under 1 and 4 is nonpreemptive. All other scheduling is preemptive. Dispatcher • Dispatcher module gives control of the CPU to the process selected by the shortterm scheduler; this involves: – switching context – switching to user mode – jumping to the proper location in the user program to restart that program • Dispatch latency – time it takes for the dispatcher to stop one process and start another running Scheduling Criteria • CPU utilization – keep the CPU as busy as possible • Throughput – # of processes that complete their execution per time unit • Turnaround time (TAT) – amount of time to execute a particular process • Waiting time – amount of time a process has been waiting in the ready queue • Response time – amount of time it takes from when a request was submitted until the first response is produced, not output (for time-sharing environment) “The perfect CPU scheduler” • Minimize latency: response or job completion time • Maximize throughput: Maximize jobs / time. • Maximize utilization: keep I/O devices busy. – Recurring theme with OS scheduling • Fairness: everyone makes progress, no one starves Scheduling Algorithms FCFS • First-come First-served (FCFS) (FIFO) – Jobs are scheduled in order of arrival – Non-preemptive • Problem: – Average waiting time depends on arrival order – Troublesome for time-sharing systems • Convoy effect short process behind long process • Advantage: really simple! First Come First Served Scheduling • Example: Process Burst Time P1 24 P2 3 P3 3 • Suppose that the processes arrive in the order: P1 , P2 , P3 P1 0 24 Waiting time for P1 = 0; P2 = 24; P3 = 27 P3 P2 27 30 Average waiting time: (0 + 24 + 27)/3 = 17 • Suppose that the processes arrive in the order: P2 , P3 , P1 . Waiting time for P1 = 6; P2 = 0; P3 = 3 P2 0 P3 3 P1 6 Average waiting time: (6 + 0 + 3)/3 = 3 30 Shortest-Job-First (SJR) Scheduling • Associate with each process the length of its next CPU burst. Use these lengths to schedule the process with the shortest time • Two schemes: – nonpreemptive – once CPU given to the process it cannot be preempted until completes its CPU burst – preemptive – if a new process arrives with CPU burst length less than remaining time of current executing process, preempt. This scheme is know as the Shortest-Remaining-Time-First (SRTF) • SJF is optimal – gives minimum average waiting time for a given set of processes Shortest Job First Scheduling • • Example: Process P1 P2 P3 P4 Non preemptive SJF P1 0 Average waiting time = (0 + 6 + 3 + 7)/4 = 4 P3 4 5 2 Arrival Time Burst Time 0 7 2 4 4 1 5 4 P1(7) 7 P2 8 P4 12 16 P1‘s wating time = 0 P2‘s wating time = 6 P2(4) P3(1) P4(4) P3‘s wating time = 3 P4‘s wating time = 7 Shortest Job First Scheduling Cont’d • • Example: Process Arrival Time Burst Time P1 0 7 P2 2 4 P3 4 1 P4 5 4 Average waiting time = (9 + 1 + 0 +2)/4 = 3 Preemptive SJF P1 P2 P3 P2 P4 P1 P1(7) 0 2 P1(5) 4 P2(4) P2(2) P3(1) P4(4) 5 7 11 16 P1‘s wating time = 9 P2‘s wating time = 1 P3‘s wating time = 0 P4‘s wating time = 2 Shortest Job First Scheduling Cont’d • Optimal scheduling • However, there are no accurate estimations to know the length of the next CPU burst Shortest Job First Scheduling Cont’d • • Optimal for minimizing queueing time, but impossible to implement. Tries to predict the process to schedule based on previous history. Predicting the time the process will use on its next schedule: t( n+1 ) = Here: w * t( n ) + ( 1 - w ) * T( n ) t(n+1) is time of next burst. t(n) is time of current burst. T(n) is average of all previous bursts . W is a weighting factor emphasizing current or previous bursts. Priority Scheduling • A priority number (integer) is associated with each process • The CPU is allocated to the process with the highest priority (smallest integer highest priority in Unix but lowest in Java). – Preemptive – Non-preemptive • SJF is a priority scheduling where priority is the predicted next CPU burst time. • Problem Starvation – low priority processes may never execute. • Solution Aging – as time progresses increase the priority of the process. Round Robin (RR) • Each process gets a small unit of CPU time (time quantum), usually 10-100 milliseconds. After this time has elapsed, the process is preempted and added to the end of the ready queue. • If there are n processes in the ready queue and the time quantum is q, then each process gets 1/n of the CPU time in chunks of at most q time units at once. No process waits more than (n1)q time units. Round Robin Scheduling • time quantum = 20 Process P1 P2 P3 P4 Burst Time 53 17 68 24 Wait Time 57 +24 = 81 20 37 + 40 + 17= 94 57 + 40 = 97 P1 P2 P3 P4 P1 P3 P4 P1 P3 P3 0 20 37 57 77 97 117 121 134 154 162 P1(33) 24P1(13) P1(53) 57 P2(17) 20 P3(68) P4(24) 37 57 P3(48) 40 40 17 P3(28) P3(8) P4(4) Average wait time = (81+20+94+97)/4 = 73 Round Robin Scheduling • Typically, higher average turnaround than SJF, but better response. • Performance – q large FCFS – q small q must be large with respect to context switch, otherwise overhead is too high. Turnaround Time Varies With The Time Quantum TAT can be improved if most process finish their next CPU burst in a single time quantum. Multilevel Queue • Ready queue is partitioned into separate queues: EX: • foreground (interactive) background (batch) • Each queue has its own scheduling algorithm EX – foreground – RR – background – FCFS • Scheduling must be done between the queues – Fixed priority scheduling; (i.e., serve all from foreground then from background). Possibility of starvation. – Time slice – each queue gets a certain amount of CPU time which it can schedule amongst its processes; EX 80% to foreground in RR 20% to background in FCFS Multilevel Queue Scheduling Multi-level Feedback Queues Implement multiple ready queues – Different queues may be scheduled using different algorithms – Just like multilevel queue scheduling, but assignments are not static • Jobs move from queue to queue based on feedback – Feedback = The behavior of the job, • EX does it require the full quantum for computation, or • does it perform frequent I/O ? • Need to select parameters for: – Number of queues – Scheduling algorithm within each queue – When to upgrade and downgrade a job Example of Multilevel Feedback Queue • Three queues: – Q0 – RR with time quantum 8 milliseconds – Q1 – RR time quantum 16 milliseconds – Q2 – FCFS • Scheduling – A new job enters queue Q0 which is served FCFS. When it gains CPU, job receives 8 milliseconds (RR). If it does not finish in 8 milliseconds, job is moved to queue Q1. – At Q1 job is again served FCFS and receives 16 additional milliseconds (RR). If it still does not complete, it is preempted and moved to queue Q2. – AT Q2 job is served FCFS Multilevel Feedback Queues Multiple-Processor Scheduling • CPU scheduling more complex when multiple CPUs are available Different rules for homogeneous or heterogeneous processors. • Load sharing in the distribution of work, such that all processors have an equal amount to do. • Asymmetric multiprocessing – only one processor accesses the system data structures, alleviating the need for data sharing • Symmetric multiprocessing (SMP) – each processor is self-scheduling Each processor can schedule from a common ready queue OR each one can use a separate ready queue. Thread Scheduling • On operating system that support threads the kernel-threads (not processes) that are being scheduled by the operating system. • Local Scheduling (process-contention-scope PCS )– How the threads library decides which thread to put onto an available LWP • PTHREAD_SCOPE_PROCESS • Global Scheduling (system-contention-scope SCS )– How the kernel decides which kernel thread to run next • PTHREAD_SCOPE_PROCESS Linux Scheduling • Two algorithms: time-sharing and real-time • Time-sharing – Prioritized credit-based – process with most credits is scheduled next – Credit subtracted when timer interrupt occurs – When credit = 0, another process chosen – When all processes have credit = 0, recrediting occurs • Based on factors including priority and history • Real-time – Defined by Posix.1b – Real time Tasks assigned static priorities. All other tasks have dynamic (changeable) priorities. The Relationship Between Priorities and Time-slice length List of Tasks Indexed According to Prorities Conclusion • We’ve looked at a number of different scheduling algorithms. • Which one works the best is application dependent. – General purpose OS will use priority based, round robin, preemptive – Real Time OS will use priority, no preemption. References • Some slides from – Text book slides – Kelvin Sung - University of Washington, Bothell – Jerry Breecher - WPI – Einar Vollset - Cornell University