Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
CS6703 GRID AND CLOUD COMPUTING Unit 3 Courtesy : Dr GnanasekaranThangavel UNIT III VIRTUALIZATION Cloud deployment models: public, private, hybrid, community – Categories of cloud computing: Everything as a service: Infrastructure, platform, software - Pros and Cons of cloud computing – Implementation levels of virtualization – virtualization structure – virtualization of CPU, Memory and I/O devices – virtual clusters and Resource Management – Virtualization for data center automation. 2 This unit mainly deals of following cloud computing concepts 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 3 Three service models Four deployment models including community cloud Five major characteristics Five virtualization levels Six design objectives Seven layers Eight other common characteristics Virtual clusters Resource Management Virtualization for data center automation Definition of Cloud Computing The practice of using a network of remote servers hosted on the Internet to store, manage, and process data, rather than a local server or a personal computer. Cloud computing is a model for enabling ubiquitous, convenient, ondemand network access to a shared pool of configurable computing resources (e.g., networks, servers, storage, applications, and services) that can be rapidly provisioned and released with minimal management effort or service provider interaction. This cloud model is composed of five essential characteristics, three service models, and four deployment models. 4 The following image shows that cloud computing is composed of five essential characteristics, three deployment models, and four service models as shown in the following figure: 5 Cloud Ecosystem and Enabling Technologies 6 Cost Model 7 Cloud Design Objectives 8 1. Shifting computing from desktops to data centers 2. Service provisioning and cloud economics 3. Scalability in performance 4. Data privacy protection 5. High quality of cloud services 6. New standards and interfaces Essential Characteristics: On-demand self-service. A consumer can unilaterally provision computing capabilities, such as server time and network storage, as needed automatically without requiring human interaction with each service provider. Broad network access. Capabilities are available over the network and accessed through standard mechanisms that promote use by heterogeneous thin or thick client platforms (e.g., mobile phones, tablets, laptops, and workstations). Resource pooling. The provider’s computing resources are pooled to serve multiple consumers using a multi-tenant model, with different physical and virtual resources dynamically assigned and reassigned according to consumer demand. There is a sense of location independence in that the customer generally has no control or knowledge over the exact location of the provided resources but may be able to specify location at a higher level of abstraction (e.g., country, state, or data center). Examples of resources include storage, processing, memory, and network bandwidth. 9 Rapid elasticity. Capabilities can be elastically provisioned and released, in some cases automatically, to scale rapidly outward and inward commensurate with demand. To the consumer, the capabilities available for provisioning often appear to be unlimited and can be appropriated in any quantity at any time. Measured service. Cloud systems automatically control and optimize resource use by leveraging a metering capability1 at some level of abstraction appropriate to the type of service (e.g., storage, processing, bandwidth, and active user accounts). Resource usage can be monitored, controlled, and reported, providing transparency for both the provider and consumer of the utilized service. 10 Service Models Software as a Service (SaaS). The capability provided to the consumer is to use the provider’s applications running on a cloud infrastructure. The applications are accessible from various client devices through either a thin client interface, such as a web browser (e.g., web-based email), or a program interface. The consumer does not manage or control the underlying cloud infrastructure including network, servers, operating systems, storage, or even individual application capabilities, with the possible exception of limited user-specific application configuration settings. SaaS Examples: Google Apps, Salesforce, Workday, Concur, Citrix GoToMeeting, CiscoWebEx Common SaaS Use-Case: Replaces traditional on-device software 11 Platform as a Service (PaaS). The capability provided to the consumer is to deploy onto the cloud infrastructure consumer- created or acquired applications created using programming languages, libraries, services, and tools supported by the provider. The consumer does not manage or control the underlying cloud infrastructure including network, servers, operating systems, or storage, but has control over the deployed applications and possibly configuration settings for the application-hosting environment. AWS Elastic Beanstalk, Windows Azure, Heroku, Force.com, Google App Engine, Apache Stratos. Common PaaS Use-Case: Increases developer productivity and utilization rates while also decreasing an application’s time-to-market 12 Infrastructure as a Service (IaaS). The capability provided to the consumer is to provision processing, storage, networks, and other fundamental computing resources where the consumer is able to deploy and run arbitrary software, which can include operating systems and applications. The consumer does not manage or control the underlying cloud infrastructure but has control over operating systems, storage, and deployed applications; and possibly limited control of select networking components (e.g., host firewalls). IaaS Examples: Amazon Web Services (AWS), Cisco Metapod, Google Compute Engine (GCE), Joyent Common IaaS Use-Case: Extends current data center infrastructure for temporary workloads (e.g. increased Christmas holiday site traffic) 13 Cloud service models offer customers varying levels of control over assets and services, which presents performance visibility challenges. 14 The platform and ecosystem views of cloud computing represent a new paradigm, and promote a new way of computing. 15 More granular classification on the basis of service provided are listed below: Storage-as-a-service Database-as-a-service Information-as-a-service Process-as-a-service Application-as-a-service Platform-as-a-service Integration-as-a-service Security-as-a-service Management/Governance-as-a-service Testing-as-a-service Infrastructure-as-a-service 16 Deployment Models Private cloud. The cloud infrastructure is provisioned for exclusive use by a single organization comprising multiple consumers (e.g., business units). It may be owned, managed, and operated by the organization, a third party, or some combination of them, and it may exist on or off premises. Public cloud. The cloud infrastructure is provisioned for open use by the general public. It may be owned, managed, and operated by a business, academic, or government organization, or some combination of them. It exists on the premises of the cloud provider. Amazon Elastic Compute Cloud (EC2), IBM’s Blue Cloud, Sun Cloud, Google AppEngine and Windows Azure Services Platform. 17 Community cloud. The cloud infrastructure is provisioned for exclusive use by a specific community of consumers from organizations that have shared concerns (e.g., mission, security requirements, policy, and compliance considerations). It may be owned, managed, and operated by one or more of the organizations in the community, a third party, or some combination of them, and it may exist on or off premises. Hybrid cloud. The cloud infrastructure is a composition of two or more distinct cloud infrastructures (private, community, or public) that remain unique entities, but are bound together by standardized or proprietary technology that enables data and application portability (e.g., cloud bursting for load balancing between clouds). 18 19 Cloud deployment models The concept of cloud computing has evolved from cluster, grid and utility computing. Cluster and grid computing leverage the use of many computers in parallel to solve problems of 20 any size. Utility and Software as a Service (SaaS) provide computing resource as a service with notation of pay per use. Cloud computing is a high throughput computing (HTC) paradigm whereby the infrastructure provides the service through a large data centre or server farms. The cloud computing model enables users to share to resources from anywhere at any time through their connected devices. All computations in cloud applications are distributed to servers in a data centre, cloud platforms are systems distributed through virtualization. Cloud deployment models The major cloud deployment models are 1. Public Cloud 2. Private Cloud, 3. Hybrid Cloud 4. Community Cloud 21 Advantages of Cloud Computing Cost Savings : Perhaps, the most significant cloud computing benefit is in terms of IT cost savings. Businesses, no matter what their type or size, exist to earn money while keeping capital and operational expenses to a minimum. With cloud computing, you can save substantial capital costs with zero in-house server storage and application requirements. The lack of on-premises infrastructure also removes their associated operational costs in the form of power, air conditioning and administration costs. You pay for what is used and disengage whenever you like - there is no invested IT capital to worry about. It’s a common misconception that only large businesses can afford to use the cloud, when in fact, cloud services are extremely affordable for smaller businesses. Reliability: With a managed service platform, cloud computing is much more reliable and consistent than in-house IT infrastructure. Most providers offer a Service Level Agreement which guarantees 24/7/365 and 99.99% availability. Your organization can benefit from a massive pool of redundant IT resources, as well as quick failover mechanism - if a server fails, hosted applications and services can easily be transited to any of the available servers. 22 Manageability :Cloud computing provides enhanced and simplified IT management and maintenance capabilities through central administration of resources, vendor managed infrastructure and SLA backed agreements. IT infrastructure updates and maintenance are eliminated, as all resources are maintained by the service provider. You enjoy a simple webbased user interface for accessing software, applications and services – without the need for installation - and an SLA ensures the timely and guaranteed delivery, management and maintenance of your IT services. Strategic Edge: Ever-increasing computing resources give you a competitive edge over competitors, as the time you require for IT procurement is virtually nil. Your company can deploy mission critical applications that deliver significant business benefits, without any upfront costs and minimal provisioning time. Cloud computing allows you to forget about technology and focus on your key business activities and objectives. It can also help you to reduce the time needed to market newer applications and services. 23 Lower computer costs: You do not need a high-powered and high-priced computer to run cloud computing's web-based applications. Since applications run in the cloud, not on the desktop PC, your desktop PC does not need the processing power or hard disk space demanded by traditional desktop software. When you are using web-based applications, your PC can be less expensive, with a smaller hard disk, less memory, more efficient processor... In fact, your PC in this scenario does not even need a CD or DVD drive, as no software programs have to be loaded and no document files need to be saved. Improved performance: With few large programs hogging your computer's memory, you will see better performance from your PC. Computers in a cloud computing system boot and run faster because they have fewer programs and processes loaded into memory… Reduced software costs: Instead of purchasing expensive software applications, you can get most of what you need for free-ish! most cloud computing applications today, such as the Google Docs suite. better than paying for similar commercial software which alone may be justification for switching to cloud applications. 24 Instant software updates: Another advantage to cloud computing is that you are no longer faced with choosing between obsolete software and high upgrade costs. When the application is web-based, updates happen automatically available the next time you log into the cloud. When you access a web-based application, you get the latest version without needing to pay for or download an upgrade. Improved document format compatibility. You do not have to worry about the documents you create on your machine being compatible with other users' applications or OSes There are potentially no format incompatibilities when everyone is sharing documents and applications in the cloud. Unlimited storage capacity: Cloud computing offers virtually limitless storage. Your computer's current 1 Tbyte hard drive is small compared to the hundreds of Pbytes available 25 in the cloud. Increased data reliability: Unlike desktop computing, in which if a hard disk crashes and destroy all your valuable data, a computer crashing in the cloud should not affect the storage of your data. if your personal computer crashes, all your data is still out there in the cloud, still accessible In a world where few individual desktop PC users back up their data on a regular basis, cloud computing is a data-safe computing platform! Universal document access: That is not a problem with cloud computing, because you do not take your 26 documents with you. Instead, they stay in the cloud, and you can access them whenever you have a computer and an Internet connection Documents are instantly available from wherever you are Latest version availability: When you edit a document at home, that edited version is what you see when you access the document at work. The cloud always hosts the latest version of your documents as long as you are connected, you are not in danger of having an outdated version Easier group collaboration: Sharing documents leads directly to better collaboration. Many users do this as it is an important advantages of cloud computing multiple users can collaborate easily on documents and projects Device independence. You are no longer tethered to a single computer or network. Changes to computers, applications and documents follow you through the cloud. Move to a portable device, and your applications and documents are still available. 27 Disadvantages of Cloud Computing Downtime : As cloud service providers take care of a number of clients each day, they can become overwhelmed and may even come up against technical outages. This can lead to your business processes being temporarily suspended. Additionally, if your internet connection is offline, you will not be able to access any of your applications, server or data from the cloud. Security :Although cloud service providers implement the best security standards and industry certifications, storing data and important files on external service providers always opens up risks. Using cloud-powered technologies means you need to provide your service provider with access to important business data. Meanwhile, being a public service opens up cloud service providers to security challenges on a routine basis. The ease in procuring and accessing cloud services can also give nefarious users the ability to scan, identify and exploit loopholes and vulnerabilities within a system. For instance, in a multi-tenant cloud architecture where multiple users are hosted on the same server, a hacker might try to break into the data of other users hosted and stored on the same server. However, such exploits and loopholes are not likely to surface, and the likelihood of a compromise is not great. 28 Vendor Lock-In: Although cloud service providers promise that the cloud will be flexible to use and integrate, switching cloud services is something that hasn’t yet completely evolved. Organizations may find it difficult to migrate their services from one vendor to another. Hosting and integrating current cloud applications on another platform may throw up interoperability and support issues. For instance, applications developed on Microsoft Development Framework (.Net) might not work properly on the Linux platform. Limited Control :Since the cloud infrastructure is entirely owned, managed and monitored by the service provider, it transfers minimal control over to the customer. The customer can only control and manage the applications, data and services operated on top of that, not the backend infrastructure itself. Key administrative tasks such as server shell access, updating and firmware management may not be passed to the customer or end user. 29 Requires a constant Internet connection: Cloud computing is impossible if you cannot connect to the Internet. Since you use the Internet to connect to both your applications and documents, if you do not have an Internet connection you cannot access anything, even your own documents. A dead Internet connection means no work and in areas where Internet connections are few or inherently unreliable, this could be a deal-breaker. Can be slow: Even with a fast connection, web-based applications can sometimes be slower than accessing a similar software program on your desktop PC. Everything about the program, from the interface to the current document, has to be sent back and forth from your computer to the computers in the cloud. If the cloud servers happen to be backed up at that moment, or if the Internet is having a slow day, you would not get the instantaneous access you might expect from desktop applications. 30 Does not work well with low-speed connections: Similarly, a low-speed Internet connection, such as that found with dial-up services, makes cloud computing painful at best and often impossible. Web-based applications require a lot of bandwidth to download, as do large documents. Features might be limited: This situation is bound to change, but today many web-based applications simply are not as full-featured as their desktop-based applications. For example, you can do a lot more with Microsoft PowerPoint than with Google Presentation's web-based offering Stored data might not be secure: With cloud computing, all your data is stored on the cloud. The questions is How secure is the cloud? 31 Can unauthorized users gain access to your confidential data? Stored data can be lost: Theoretically, data stored in the cloud is safe, replicated across multiple machines. But on the off chance that your data goes missing, you have no physical or local backup. Put simply, relying on the cloud puts you at risk if the cloud lets you down. HPC Systems: Not clear that you can run compute-intensive HPC applications that use MPI/OpenMP! Scheduling is important with this type of application as you want all theVM to be co-located to minimize communication latency! General Concerns: Each cloud systems uses different protocols and different APIs may not be possible to run applications between cloud based systems Amazon has created its own DB system (not SQL 92), and workflow system (many popular workflow systems out there) so your normal applications will have to be adapted to execute on these platforms. 32 33 Implementation Levels of Virtualization Virtualization technology benefits the computer and IT industries by enabling users to share expensive hardware resources by multiplexing VMs on the same set of hardware hosts. Virtual workspaces: An abstraction of an execution environment that can be made dynamically available to authorized clients by using well-defined protocols, Resource quota (e.g. CPU, memory share), Software configuration (e.g. O/S, provided services). Implement on Virtual Machines (VMs): Abstraction of a physical host machine, Hypervisor intercepts and emulates instructions from VMs, and allows management of VMs, VMWare, Xen, etc. Provide infrastructure API: Plug-ins to hardware/support structures 34 App App App OS OS OS Hypervisor Hardware Virtualized Stack Virtual Machines VM technology allows multiple virtual machines to run on a single physical machine. App App App App App Guest OS (Linux) Guest OS (NetBSD) Guest OS (Windows) VM VM VM Virtual Machine Monitor (VMM) / Hypervisor Hardware Xen VMWare UML Denali etc. Berkeley Software Distribution (BSD) is a Unix operating system derivative developed and distributed by the Computer Systems Research Group (CSRG) 35 Performance: Para-virtualization (e.g. Xen) is very close to raw physical performance! 35 DENALI offers converged infrastructure that is industry recognized and experienced in delivering enterprise desktop replacement initiatives. Robust solution design to support budget minded or more full-featured needs Extensive solution delivery Meets challenging demand of clinical environments User-mode Linux (UML) is the only prominent example of a virtualization technology which uses the capabilities of a stock Linux kernel. Xen Project is a hypervisor using a microkernel design, providing services that allow multiple computer operating systems to execute on the same computer hardware concurrently. 36 Virtualization in General Advantages of virtual machines: Run operating systems where the physical hardware is unavailable, Easier to create new machines, backup machines, etc., Software testing using “clean” installs of operating systems and software, Emulate more machines than are physically available, Timeshare lightly loaded systems on one host, Debug problems (suspend and resume the problem machine), Easy migration of virtual machines (shutdown needed or not). Run legacy systems! 37 What is the purpose and benefits? Cloud computing enables companies and applications, which are system infrastructure dependent, to be infrastructure-less. By using the Cloud infrastructure on “pay as used and on demand”, all of us can save in capital and operational investment! Clients can: Put their data on the platform instead of on their own desktop PCs and/or on their own servers. They can put their applications on the cloud and use the servers within the cloud to do processing and data manipulations etc. In computing, virtualization means to create a virtual version of a device or resource, such as a server, storage device, network or even an operating system where the framework divides the resource into one or more execution environments. 38 Difference between Traditional and Virtual machines A traditional computer runs with a host operating system specially tailored for its hardware architecture After virtualization, different user applications managed by their own operating systems (guest OS) can run on the same hardware, independent of the host OS. The Virtualization layer is the middleware between the underlying hardware and virtual machines represented in the system, also known as virtual machine monitor (VMM) or hypervisor. With sufficient storage, any computer platform can be installed in another host computer, even if they use processors with different instruction sets and run with distinct operating systems on the same hardware. 39 Virtualization Layers The virtualization software creates the abstraction of VMs by interposing (place or insert between one thing and another) a virtualization layer at various levels of a computer system. Common virtualization layers include 1. the instruction set architecture (ISA) level, 2. hardware level, 3. operating system level, 4. library support level, and 5. application level 40 Virtualization Ranging from Hardware to Applications in Five Abstraction Levels Wine is a free and open source compatibility layer software application that aims to allow applications designed for Microsoft Windows to run on Unix-like operating systems 41 Common Language Runtime (CLR), the virtual machine component of Microsoft's .NET framework, manages the execution of .NET programs. Operating-system-level virtualization is a server virtualization method in which the kernel of an operating system allows the existence of multiple isolated userspace instances, instead of just one. Such instances, which are sometimes called containers, software containers,[1] virtualization engines (VEs) or jails (FreeBSD jail or chroot jail), may look and feel like a real server from the point of view of its owners and users. Bochs (pronounced "box") is a portable IA-32 and x86-64 IBM PC compatible emulator and debugger mostly written in C++ and distributed as free software under the GNU Lesser General Public License. It supports emulation of the processor(s) (including protected mode), memory, disks, display, Ethernet, BIOS and common hardware peripherals of PCs. 42 1.Virtualization at Instruction Set Architecture (ISA) level: At the ISA level, virtualization is performed by emulating a given ISA by the ISA of the host machine. 43 Instruction set emulation leads to virtual ISAs created on any hardware machine. e.g, MIPS binary code can run on an x-86-based host machine with the help of ISA emulation. With this approach, it is possible to run a large amount of legacy binary code written for various processors on any given new hardware host machine. code interpretation – dynamic binary translation - virtual instruction set architecture (V-ISA) Advantage: • It can run a large amount of legacy binary codes written for various processors on any given new hardware host machines • best application flexibility Shortcoming & limitation: • One source instruction may require tens or hundreds of native target instructions to perform its function, which is relatively slow. • V-ISA requires adding a processor-specific software translation layer in the complier. 44 2.Virtualization at Hardware Abstraction level: Hardware-level virtualization is performed right on top of the bare hardware. On the one hand, this approach generates a virtual hardware environment for aVM. On the other hand, the process manages the underlying hardware through virtualization. The idea is to virtualize a computer’s resources, such as its processors, memory, and I/O devices. The intention is to upgrade the hardware utilization rate by multiple users concurrently. Advantage: • Has higher performance and good application isolation Shortcoming & limitation: • Very expensive to implement (complexity) 45 3.Virtualization at Operating System (OS) level: OS-level virtualization creates isolated containers on a single physical server and the OS instances to utilize the hardware and software in data centers. The containers behave like real servers. OS-level virtualization is commonly used in creating virtual hosting environments to allocate hardware resources among a large number of mutually distrusting users. Advantage: • Has minimal startup/shutdown cost, low resource requirement, and high scalability; synchronize VM and host state changes. Shortcoming & limitation: • All VMs at the operating system level must have the same kind of guest OS • Poor application flexibility and isolation. 46 Virtualization at OS Level 47 Advantages of OS Extension for Virtualization 1. VMs at OS level has minimum startup/shutdown costs 2. OS-level VM can easily synchronize with its environment Disadvantage of OS Extension for Virtualization All VMs in the same OS container must have the same or similar guest OS, which restrict application flexibility of different VMs on the same physical machine. 48 4.Library Support level: Since most systems provide well-documented APIs, such an interface becomes another candidate for virtualization. Virtualization with library interfaces is possible by controlling the communication link between applications and the rest of a system through API hooks. The software tool WINE has implemented this approach to support Windows applications on top of UNIX hosts. Another example is the vCUDA which allows applications executing within VMs to leverage GPU hardware acceleration. Advantage: • It has very low implementation effort Shortcoming & limitation: • poor application flexibility and isolation 49 5.User-Application Level Virtualization at the application level virtualizes an application as a VM. On a traditional OS, an application often runs as a process. Therefore, application-level virtualization is also known as process-level virtualization. The most popular approach is to deploy high level language (HLL) VMs. In this scenario, the virtualization layer sits as an application program on top of the operating system, and the layer exports an abstraction of a VM that can run programs written and compiled to a particular abstract machine definition. Other forms of application-level virtualization are known as application isolation, application sandboxing, or application streaming. Advantage: • has the best application isolation Shortcoming & limitation: • low performance, low application flexibility and high implementation complexity. 50 Virtualization Structures/Tools and Mechanisms In general, there are three typical classes of VM architecture. Figure showed the architectures of a 51 machine before and after virtualization. Before virtualization, the operating system manages the hardware. After virtualization, a virtualization layer is inserted between the hardware and the operating system. In such a case, the virtualization layer is responsible for converting portions of the real hardware into virtual hardware. Therefore, different operating systems such as Linux and Windows can run on the same physical machine, simultaneously. Depending on the position of the virtualization layer, there are several classes of VM architectures, namely the hypervisor architecture, para-virtualization, and host-based virtualization. The hypervisor is also known as the VMM (Virtual Machine Monitor). They both perform the same virtualization operations. Hypervisor A hypervisor is a hardware virtualization technique allowing multiple operating systems, called guests to run on a host machine.This is also called the Virtual Machine Monitor (VMM). Type 1: bare metal hypervisor • sits on the bare metal computer hardware like the CPU, memory, etc. • All guest operating systems are a layer above the hypervisor. • The original CP/CMS (Control Program/Cambridge Monitor System) hypervisor developed by IBM was of this kind. Type 2: hosted hypervisor • Run over a host operating system. • Hypervisor is the second layer over the hardware. • Guest operating systems run a layer over the hypervisor. • The OS is usually unaware of the virtualization 52 The XEN Architecture Xen is an open source hypervisor program developed by Cambridge University. Xen is a micro-kernel hypervisor, which separates the policy from the mechanism. Xen does not include any device drivers natively . It just provides a mechanism by which a guest OS can have direct access to the physical devices. As a result, the size of the Xen hypervisor is kept rather small. Xen provides a virtual environment located between the hardware and the OS. 53 Binary Translation with Full Virtualization Depending on implementation technologies, hardware virtualization can be classified into two categories: full virtualization and host-based virtualization. Full virtualization does not need to modify the host OS. It relies on binary translation to trap and to virtualizes the execution of certain sensitive, non virtualizable instructions. The guest OSes and their applications consist of noncritical and critical instructions. In a host-based system, both a host OS and a guest OS are used. A virtualization software layer is built between the host OS and guest OS. These two classes ofVM architecture are introduced next. 54 55 Binary Translation of Guest OS Requests Using a VMM This approach was implemented by VMware and many other software companies. VMware puts the VMM at Ring 0 and the guest OS at Ring 1. The VMM scans the instruction stream and identified the privileged, control- and behavior sensitive instructions. When these instructions are identified, they are trapped into the VMM, which emulates the behavior of these instructions. The method used in this emulation is called binary translation. Therefore, full virtualization combines binary translation and direct execution. 56 Trap: a device or enclosure designed to catch and retain animals, typically by allowing entry but not exit or by catching hold of a part of the body. Host-Based Virtualization An alternative VM architecture is to install a virtualization layer on top of the host OS. This host OS is still responsible for managing the hardware. This host-based architecture has some distinct advantages. First, the user can install this VM architecture without modifying the host OS. The virtualizing software can rely on the host OS to provide device drivers and other low-level services. This will simplify the VM design and ease its deployment. Second, the host-based approach appeals to many host machine configurations. Compared to the hypervisor/VMM architecture, the performance of the host-based architecture may also be low 57 Para -virtualization Para -virtualization needs to modify the guest operating systems. A para-virtualized VM provides special APIs requiring substantial OS modifications in user applications. Performance degradation is a critical issue of a virtualized system. 58 Full Virtualization vs. Para-Virtualization Full virtualization Does not need to modify guest OS, and critical instructions are emulated by software through the use of binary translation. VMware Workstation applies full virtualization, which uses binary translation to automatically modify x86 software on-the-fly to replace critical instructions. Advantage: no need to modify OS. Disadvantage: binary translation slows down the performance. Para virtualization Reduces the overhead, but cost of maintaining a paravirtualized OS is high. The improvement depends on the workload. Para virtualization must modify guest OS, non-virtualizable instructions are replaced by hyper calls that communicate directly with the hypervisor orVMM. Para virtualization is supported by Xen, Denali andVMware ESX. 59 60 Virtualization of CPU, Memory, and I/O Devices CPU Virtualization A VM is a duplicate of an existing computer system in which a majority of the VM instructions are 61 executed on the host processor in native mode. Thus, unprivileged instructions of VMs run directly on the host machine for higher efficiency. Other critical instructions should be handled carefully for correctness and stability. The critical instructions are divided into three categories: privileged instructions, control –sensitive instructions, and behavior-sensitive instructions. Privileged instructions execute in a privileged mode and will be trapped if executed outside this mode. Control-sensitive instructions attempt to change the configuration of resources used. Behavior-sensitive instructions have different behaviors depending on the configuration of resources, including the load and store operations over the virtual memory. A CPU architecture is virtualizable if it supports the ability to run the VM’s privileged and unprivileged instructions in the CPU’s user mode while theVMM runs in supervisor mode. When the privileged instructions including control- and behavior-sensitive instructions of a VM are executed, they are trapped in the VMM. In this case, the VMM acts as a unified mediator for hardware access from different VMs to guarantee the correctness and stability of the whole system. However, not all CPU architectures are virtualizable. RISC CPU architectures can be naturally virtualized because all control and behavior- sensitive instructions are privileged instructions. On the contrary, x86 CPU architectures are not primarily designed to support virtualization. 62 Memory Virtualization Virtual memory virtualization is similar to the virtual memory support provided by modern operating systems. In a traditional execution environment, the operating system maintains mappings of virtual memory to ma chine memory using page tables, which is a one-stage mapping from virtual memory to machine memory. However, in a virtual execution environment, virtual memory virtualization involves sharing the physical system memory in RAM and dynamically allocating it to the physical memory of theVMs. That means a two-stage mapping process should be maintained by the guest OS and the VMM, respectively: virtual memory to physical memory and physical memory to machine memory. 63 I/O Virtualization there are three ways to implement I /O virtualization: full device emulation, para- virtualization, and direct I /O. I/O virtualization. Generally, this approach emulates well-known, real-world devices. All the functions of a device or bus infrastructure, such as device enumeration, identification, interrupts, and DMA, are replicated in software. This software is located in the VMM and acts as a virtual device. The para-virtualization method of I /O virtualization is typically used in Xen. It is also known as the split driver model consisting of a frontend driver and a backend driver. It achieves beer device performance than full device emulation, it comes with a higher CPU overhead Direct I /O virtualization lets the VM access devices directly. It can achieve close-to native performance without high CPU costs. 64 Virtual Clusters and Resource Management A physical cluster is a collection of servers (physical machines) interconnected by a physical network such as a LAN Virtual clusters are built with VMs installed at distributed servers from one or more physical clusters. The VMs in a virtual cluster are interconnected logically by a virtual network across several physical networks. Figure illustrates the concepts of virtual clusters and physical clusters. Each virtual cluster is formed with physical machines or a VM hosted by multiple physical clusters. The virtual cluster boundaries are shown as distinct boundaries. 65 The provisioning of VMs to a virtual cluster is done dynamically to have the following interesting properties : • The virtual cluster nodes can be either physical or virtual machines. Multiple VMs running with different OSes can be deployed on the same physical node. • A VM runs with a guest OS, which is often different from the host OS, that manages the resources in the physical machine, where the VM is implemented . • The purpose of using VMs is to consolidate multiple functionalities on the same server. This will greatly enhance server utilization and application flexibility . • VMs can be colonized (replicated) in multiple servers for the purpose of promoting distributed parallelism, fault tolerance, and disaster recovery. The size (number of nodes) of a virtual cluster can grow or shrink dynamically, similar to the way an overlay network varies in size in a peer-to-peer (P2P) network. The failure of any physical nodes may disable some VMs installed on the failing nodes. But the failure of VMs will not pull down the host system. 66 Virtualization for Data-Center Automation Data-center automation means that huge volumes of hardware, software, and database resources in these data centers can be allocated dynamically to millions of Internet users simultaneously, with guaranteed QoS and cost-effectiveness. Google, Yahoo!, Amazon, Microso, HP, Apple, and I BM companies have invested billions of dollars in data-center construction and automation. 67 Server Consolidation in Data Centers In data centers, a large number of heterogeneous workloads can run on servers at various times. These heterogeneous workloads can be roughly divided into two categories: Chay workloads and 2. Noninteractive workloads. Chay workloads may burst at some point and return to a silent state at some other point. A web video service is an example of this, whereby a lot of people use it at night and few people use it during the day. Noninteractive workloads do not require people’s efforts to make progress after they are submitted. High-performance computing is a typical example of this. At various stages, the requirements for resources of these workloads are dramatically different. 1. 68 Server Consolidation in Data Centers conti .. It is common that most servers in data centers are underutilized. A large amount of 69 hardware, space, power, and management cost of these servers is wasted. Server consolidation is an approach to improve the low utility ratio of hardware resources by reducing the number of physical servers. Among several server consolidation techniques such as centralized and physical Consolidation, virtualization-based server consolidation is the most powerful. Consolidation enhances hardware utilization. Many underutilized servers are consolidated into fewer servers to enhance resource utilization. Consolidation also facilitates backup services and disaster recovery. This approach enables more agile provisioning and deployment of resources. In a virtual environment, the images of the guest OSes and their applications are readily cloned and reused. The total cost of ownership is reduced. In this sense, server virtualization causes deferred purchases of new servers, a smaller data-center footprint, lower maintenance costs, and lower power, cooling, and cabling requirements. This approach improves availability and business continuity. The crash of a guest OS has no effect on the host OS or any other guest OS. It becomes easier to transfer a VM from one server to another, because virtual servers are unaware of the underlying hardware. 70 Virtual Storage Management In system virtualization, virtual storage includes the storage managed by VMMs and guest OSes. Generally, the data stored in this environment can be classified into two categories: 1. 2. VM images and Application data. The VM images are special to the virtual environment, The application data includes all other data which is the same as the data in traditional OS environments. The most important aspects of system virtualization are encapsulation and isolation. Traditional operating systems and applications running on them can be encapsulated in VMs. Only one operating system runs in a virtualization while many applications run in the operating system. System virtualization allows multipleVMs to run on a physical machine and theVMs are completely isolated. 71 To achieve encapsulation and isolation both the system software and the hardware platform, such as CPUs and chipsets, are rapidly updated. However, storage is lagging. The storage systems become the main bottleneck of VM deployment. Parallax is a distributed storage system customized for virtualization environments. Content Addressable Storage (CAS) is a solution to reduce the total size of VM images, and therefore supports a large set of VM based systems in data centers. 72 Cloud OS for Virtualized Data Centers Data centers must be virtualized to serve as cloud providers. The table summarizes four virtual lnfra structure (VI) managers and OSes. These VI managers and OSes are specially tailored for virtualizing data centers which own a large number of servers in clusters. Nimbus, Eucalyptus, and Open Nebula are all open source software available to the general public. Only vSphere 4 is a proprietary OS for cloud resource virtualization and management over data centers. 73 Trust Management in Virtualized Data Centers A VMM changes the computer architecture. It provides a layer of software between the operating systems and system hardware to create one or more VMs on a single physical platform. VMM can provide secure isolation and a VM accesses hardware resources through the control of the VMM, so the VMM is the base of the security of a virtual system. Normally, one VM is taken as a management VM to have some privileges such as creating, suspending, resuming, or deleting a VM. Once a hacker successfully enters the VMM or management VM, the whole system is in danger. 74 VM-Based Intrusion Detection Intrusion detection is used to recognize the unauthorized access. An intrusion detection system (I DS) is built on operating systems, and is based on the characteristics of intrusion actions. A typical I DS can be classified as a host-based IDS (HIDS) or a network based IDS (NIDS), depending on the data source. A HI DS can be implemented on the monitored system. When the monitored system is haacked by hackers, the HI DS also faces the risk of being haacked. A NI DS is based on the flow of network traffic which can’t detect fake actions. Virtualization-based intrusion detection can isolate guest VMs on the same hardware platform. Even someVMs can be invaded successfully; they never influence other VMs 75 References 1. 2. 3. 4. 5. 6. 7. 8. 9. 76 Kai Hwang, Geoffery C. Fox and Jack J. Dongarra, “Distributed and Cloud Computing: Clusters, Grids, Clouds and the Future of Internet”, http://klucloudseminar.weebly.com/ http://cloudcomputingnet.com/cloud-computing-deployment-models/ http://gcauble.com/products/cloud-computing/ http://www.levelcloud.net/why-levelcloud/cloud-education-center/advantages-anddisadvantages-of-cloud-computing/ www.cse.unr.edu/~mgunes/cpe401/cpe401sp12/lect15_cloud.ppt http://slideplayer.com/slide/5862268/ https://msdn.microsoft.com/en-us/magazine/dn520239.aspx http://www.thoughtsoncloud.com/2014/01/cloud-computing-defined-characteristics-servicelevels/