* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Database basics
Microsoft SQL Server wikipedia , lookup
Entity–attribute–value model wikipedia , lookup
Microsoft Jet Database Engine wikipedia , lookup
Extensible Storage Engine wikipedia , lookup
Relational model wikipedia , lookup
Open Database Connectivity wikipedia , lookup
Functional Database Model wikipedia , lookup
DATABASE EVOLUTION & ENVIRONMENT Chandra S. Amaravadi PREVIOUSLY, IN INTRODUCTION In our introduction we discussed the following: Database importance Basic data organization concepts Database concept Development cycle Example application IN THIS DISCUSSION Evolution The database approach Database architecture DBMS architecture DBMS environment DBMS EVOLUTION EVOLUTION OF DBMS DBMS evolved out of file processing environment: Introduction of business computers in 1950’s, ‘60’s Used in TP applications (file processing) Led to problems of file processing (see next) Academic formulations of DBMS concepts Standardization of DBMS concepts (by CODASYL 1971) Hierarchical and network databases FILE PROCESSING APPROACH File processing evolved from transaction environment of the 1950’s. It involves application managing its own files. Application1 File1 Application2 File2 EVOLUTION OF DBMS.. Following are the problems caused by file processing: Uncontrolled redundancy Program data dependence Program maintenance Poor data quality Inability to get reports Application backlog EVOLUTION OF DBMS.. PROBLEMS CAUSED BY FILE PROCESSING Uncontrolled redundancy – Same data is duplicated in multiple files Program data dependence – Application programs dependent on structure of the data Program maintenance – Too much effort spent in changing programs to accommodate changes in file structures. Poor data quality – Redundancy can often lead to inconsistent updates of the data, leading in turn to problems of data quality. Inability to get reports – Since each file was tied to a particular program, it was hard to get reports involving multiple files Application backlog – Managers’ requests for report led application programmers to write new applications to fulfill the request. They were typically behind in fulfilling the requests, hence the backlog. FROM FILE PROCESSING TO DBMS.. Application + Data management Files Applications Applications File handling routines DBMS Files Files EVOLUTION OF DBMS.. (FYI) Developments in DBMS include: Codd’s Paper In 1970 Ingres in 1971 Oracle in 1977 Britton Lee Founded in 1979 Sybase in 1986 System R Teradata corp. SAP EVOLUTION OF DBMS.. (FYI) THE DBMS/DB APPROACH THE DBMS/DB APPROACH Entity classes Emp. cust. Cust. emp. Enterprise Data model Data base THE DBMS/DB APPROACH Integrated conceptualization of the data (??) Organized design of database Centralized management of data Checks on data quality System controlled access Retrieve data easily (reports, queries) Application1 Application2 Application DBMS File1 File2 Files Application GETTING INFORMATION FROM A DATABASE GETTING INFORMATION IN FILE PROCESSING CROSS REFERENCE REPORT CUSTOMER ACCTS. LOANS ACCT. DATA LOAN DATA GETTING INFORMATION IN DATABASE APPROACH Data from the database can be obtained through: Reports Queries [SQL, QBE] Forms “Views” GETTING INFORMATION IN DATABASE APPROACH ACCT ACCT # NAME DT. OPENED BALANCE 8895 Moody 4/16/13 $35,000 8896 Leak 4/22/14 $300 8897 Boring 1/10/16 $2000 LID AMOUT INT. RATE BALANCE 9978 $6,000 6.0% $2,440 9979 $5,000 7.1% $5,000 9992 $1,000 8.5% $400 LOAN How can we get Acct & Loan info. in one report? GETTING INFORMATION FROM MULTIPLE TABLES Rules for multi-table Select When data is retrieved from multiple tables, the following are the rules for writing queries: 1. In the Select part, precede each attribute name by name of table e.g. Select Acct.name, Loan.LID 2. In the From part, list tables separated by commas. 3. In the Where part, equate values of common key from both tables e.g. ..where Acct.Acct# = Loan.Acct# Write a query to list Name, balance, loan amt and balance for all customers. DATABASE & DBMS ARCHITECTURE THREE-SCHEMA ARCHITECTURE An architecture for databases introduced by ANSI/SPARC* A prescription for how data should be stored (in a macro sense) Provides benefits of logical and physical independence Lacking in file processing approach *Standards Planning and Requirements Committee THE THREE FORMS OF DATA External (view) Conceptual/ Base table (schema) 11101001 Internal/ Hardware level (file organization) THE THREE FORMS OF DATA.. External/ --- The view of data as seen by a user/ application program (views). Conceptual/ --- The view as seen by a database designer (base table) Internal --- The view of data as it is stored internally These three levels provide logical and physical data independence, the ability to change the structure of the data and the ability to change the internal storage structure, independently of the application program. THREE-SCHEMA ARCHITECTURE.. External Conceptual/ Logical Base Table View1 View2 Base Table Base Table Physical/ Internal Data is organized at three levels to provide logical/physical ______ ________________. THREE-SCHEMA ARCHITECTURE.. Basic concept is that of a view A view is the way data is presented It is a subset of the data The data resides in base tables A base table contains information about an Eclass Applns. access data via views Views are created in SQL or by forms/reports THE THREE TYPES OF MODELS External Conceptual Internal Views Create view Drop view Schemas Create table Alter table File Organizations Models Create index drop index DBMS Facilities ___________ , ___________ and ____ __________ are examples of models corresponding to the three levels of the three schema architecture. MAJOR COMPONENTS OF DBMS D B M S Kernel Data Defn. SQL Prog. Language Interface Data Dictionary Screen/ Report Gen. Appln. Gen. D B M S Kernel Export/Import DBMS COMPONENTS.. Data definition – the facility through which schema is defined. (how new tables are created). SQL interface – the facility through which SQL commands are typed in. Programming language interface – the facility which processes SQL commands embedded in application program. Also known as the host language interface. Data dictionary – the facility that records details about the schema, reports, data entry forms etc. Screen & reports- the facility through which data entry screens and reports are created. Appln. Generation- the facility through which applications are created. Export/Import -- the facility through which files can be imported/exported in different DBMS formats. DBMS Kernel -- the actual programs which interact with the O/S and carry out data I/O. ODBC -Open Database Connectivity – middleware to take SQL commands & return data. THE DIFFERENT CLASSES OF USERS IN A DBMS Administrators Developers DBMS Users AN INTEGRATED DATABASE ENVIRONMENT Data Administration Legacy Applications Dir. Server Developers End Users DBMS Client Enterprise Applications DBMS/SQL Server Database COMPONENTS OF AN INTEGRATED DATABASE ENVIRONMENT Directory server- a component that stores user names & PW (aka active directory or LDAP). DBMS client- a front end to provide access to DBMS functionality. Enterprise applications- various information systems of the organization. Legacy applications – older, mainframe-based applications. DBMS Server - a DBMS that can run queries (does not have user interface, only program interface). Database- collection of information DIFFERENT TYPES OF ENTERPRISE DATABASE ENVIRONMENTS SINGLE Single tier, single user, direct DBMS access USER CLIENT Two tier, multi-user, client server SERVER CLIENT three tier, multi-user, client server with middleware Middleware SERVER DATA WAREHOUSES, OLAP & DATA MINING Internal Database OLAP Internal Database Data Warehouse Decisions Data Mining Business Intelligence External Database Data warehousing refers to the use of high speed/high capacity servers to store historical data and to make this available to decision makers. OLAP is the process of analyzing historical data on a PC using mult-dimensional databases (i.e. non-relational databases) using aggregate data operations. Data mining refers to identification of patterns from data. DISCUSSION What organization standardized DBMS concepts? Who was the chief architect of relational systems? What data-related functions were performed in the file processing approach? What are some of the basic features of a DBMS? What is the smallest unit of data in a database? Define the terms: schema, view, database, three-schema architecture. How is redundancy minimized in the database approach? What are advantages of the 3-schema architecture? What is the difference between 2-tier & 3-tier environments? REVIEW OF CONCEPTS Concept File processing Description Application programs managing their own files. Internal view The view of data as seen by a user/application program(views). The view as seen by a database designer (base table). The view of data as it is stored internally Select It is used to select data from a database. From It is used to list the tables. Where It is used to filter the results and apply conditions. Table A collection of records Schema The structure of a file or a database. Data mining Identification of patterns from data. Data warehousing Use of high speed servers to store historical data. OLAP A method of analyzing historical data External view Conceptual view REVIEW OF CONCEPTS.. Concept Description Single tier Database is directly accessed from a PC Two tier Client server architecture Three tier A middleware performs query translation Directory server A server that is used for authentication Middleware Software that performs middle-man functions Legacy applications Older mainframe applications DBMS server A server that can respond to queries Enterprise applications Newer applications of the organization