* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Chapter 1
Extensible Storage Engine wikipedia , lookup
Entity–attribute–value model wikipedia , lookup
Oracle Database wikipedia , lookup
Microsoft Jet Database Engine wikipedia , lookup
Open Database Connectivity wikipedia , lookup
Concurrency control wikipedia , lookup
Functional Database Model wikipedia , lookup
Relational model wikipedia , lookup
Clusterpoint wikipedia , lookup
Chapter 1 THE WORLDS OF DATABASE SYSTEMS Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 1 CS157A (1) DBMS is Oracle Class Demo 8/29 and 9/5 Be in Team 8/29; Oracle installation ==== skip========== (3) Data Clustering means Semantically related data should be stored in nearby secondary storage (4) File is data structure on 2ndary storage Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 2 The Worlds of Database Systems 1.1 The Evolution of Database Systems 1.2 Overview of a Database Management System 1.3 Outline of Database-System Studies 1.4 References for Chapter 1 Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 3 The Worlds of Database Systems 1) Databases are involved with almost every business in the world. 2) Almost any website has a database behind the scene that serving up the information you request. Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 4 The Worlds of Database Systems 3) Corporations maintain all their important records in databases. 4) The power of databases comes from a powerful software that has developed over several decades and is called a Database Management System or DBMS. Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 5 Section 1.1 THE EVOLUTION OF DATABASE SYSTEMS Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 6 1.1 The Evolution of Database Systems What is a database? Database is a collection of information that exists over a long period of time, even many years. (TYLIN: when it starts, it never ends until dies) The term database refers to a collection of data that is managed by a DBMS. What the DBMS's do? Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 7 1.1 The Evolution of Database Systems (cont'd) A DBMS is expected to: 1. Allow users to create new databases by declaring the logical structure of the data (schema) using a specialized language called Data Definition Language (DDL). 2. Give users the ability to query (a question about the data) the data, modify the data using a specialized language called Data Manipulation Language (DML). 3. Support the storage of huge amount of data using very efficient access methods. Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 8 1.1 The Evolution of Database Systems (cont'd) A DBMS is expected to: (cont'd) 4. Enable durability, the recovery of the data in the case of failures. 5. Control access to data from many users concurrently without any unexpected interactions (called isolation) Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 9 1.1 The Evolution of Database Systems 1.1.1 Early Database Management Systems 1.1.2 Relational Database Systems 1.1.3 Smaller and Smaller Systems 1.1.4 Bigger and Bigger Systems 1.1.5 Information Integration Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 10 1.1.1 Early Database Management Systems The first DBMS's appeared in the late 1960's. These systems evolved from file systems that could just store large amount of data over a long period of time. They did not support the requirements we counted in previous slides. Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 11 1.1.1 Early Database Management Systems (cont'd) The first important applications of DBMS's are: Banking systems Airline reservation systems Corporate record keeping The early DBMS's used several different data models like 'hierarchical' or tree-based model and 'network' or graph-based model. These early DBMS's did not support a highlevel query language. Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 12 1.1.2 Relational Database Systems Relational Model (RM) was born in 1970 by a famous paper written by Ted Codd. TYLin: David Hsiao Column based RM Codd proposed a new two dimensional (table) organization of data, which in pure mathematics is called relation. Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 13 1.1.2 Relational Database Systems In this new model, the programmers were not involved with the storage structure. Queries could be expressed in a very high-level language. By 1990, relational database systems were the norm. Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 14 1.1.3 Smaller and Smaller Systems Originally, DBMS's were large, expensive software running on large computers. The size was necessary because storing a gigabyte of data required a large computer. But today, hundreds of gigabytes fit on a single disk and we can put it on a laptop! Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 15 1.1.3 Smaller and Smaller Systems Another important trend (may not be there any more) is the use of documents using XML (eXtensible Modeling Language) . (In CS267) Large collections of small documents can serve as a database, and methods of querying and manipulating them are different. Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 16 1.1.4 Bigger and Bigger Systems A gigabyte is not much data anymore! Corporate routinely use terabytes(1012 bytes) and petabytes (1015 bytes) of data storage. Here are some examples: Google holds petabytes of data for its crawler of the Web. Satellites send down petabytes of information. Amazon keeps millions of products' picture and info. YouTube keeps millions of movies. And so forth ...! Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 17 1.1.5 Information Integration Consider a large company with many divisions. Each division has its own database for its products and employees independent of other divisions. How can we integrate the information? Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 18 1.1.5 Information Integration One popular approach is the creation of data warehousing where information from many legacy databases is copied periodically. Another approach is the implementation of a middleware to integrate and translate data. Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 19 Section 1.2 OVERVIEW OF A DATABASE MANAGEMENT SYSTEM Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 20 1.2 Overview of a Database Management System 1.2.1 Data-Definition Language Commands 1.2.2 Overview of Query Processing 1.2.3 Storage and Buffer Management 1.2.4 Transaction Processing 1.2.5 The Query Processor Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 21 1.2.1 Data-Definition Language Commands Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 22 1.2.2 Overview of Query Processing Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 23 1.2.3 Storage and Buffer Management Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 24 1.2.4 Transaction Processing Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 25 1.2.5 The Query Processor Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 26 Section 1.3 OUTLINE OF DATABASE-SYSTEM STUDIES Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 27 1.3 Outline of Database-System Studies Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 28 Section 1.4 REFERENCES FOR CHAPTER 1 Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 29 References for Chapter 1 Dr. T. Y. Lin | SJSU | CS 157A | Fall 2011 30