* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Database Environment
Survey
Document related concepts
Transcript
Basic Terminologies n Database Environment n n Data are facts concerning things such as people, objects or events. Information is data that have been processed and presented in a form suitable for human interpretation, often with the purpose of revealing trends or patterns. Process involves acquisition, storage, manipulation, retrieval and distribution. DATA DB212 © Peter Lo 2004 1 Database Concept n INFORMATION DB212 © Peter Lo 2004 2 Centralized Database A database is a shared collection of logically related data, designed to meet the information needs of multiple users in an organization. FILES n n DATABASE n n record 1 record 2 record 3 record 4 ::::::: DB212 © Peter Lo 2004 Process DATA 3 All data are located at a single site in a Centralized Database Advantage: u Provide greater control over accessing and updating data. Disadvantage: u Vulnerable to failure Examples: u Personal Computer Database, Central Computer Database, Client/Server Databases DB212 © Peter Lo 2004 4 1 Distributed Database Heterogeneous Databases n n A single logical database that is spread physically across computers in multiple locations. DB212 © Peter Lo 2004 n 5 Different computers and operating systems, different data models, different DBMS may be possible. Comprise of the following conditions: u The computer operating systems used at each location are the same or highly compatible. u The data models used at each location are the same. u The DBMS used at each location are the same or highly compatible. u The data at the various locations have common definitions and formats. DB212 © Peter Lo 2004 File Processing Approach Disadvantages n n n A traditional approach to information system design. Focuses on the data processing needs of individual departments in the organization. n n n n n DB212 © Peter Lo 2004 7 6 Uncontrolled Redundancy Inconsistent Data Inflexibility Limited Data Sharing Poor Enforcement of Standards Excessive Program Maintenance DB212 © Peter Lo 2004 8 2 Uncontrolled Redundancy Inconsistent Data n n n n n Duplication of data. Valuable storage space is wasted The same data may have to be input several times to update all occurrences of a data item Inconsistencies DB212 © Peter Lo 2004 n n 9 When the same data are stored in multiple locations, inconsistencies are inevitable. Inconsistencies in stored data are one of the most common sources of errors in computer applications. They lead to inconsistent documents and reports and undermine the confidence of users in the integrity of the information systems. DB212 © Peter Lo 2004 Inflexibility Limited Data Sharing n n n Inflexible and cannot easily respond to requests for a new or redesigned product. Leads to considerable frustration on the part of the users, who cannot understand why the computer system cannot give them information in a new format when they know it exists in the applications files. DB212 © Peter Lo 2004 11 n n 10 Each application has its own private files, and users have little opportunity to share data outside of their own applications. The same data may have to be entered several times to update files with duplicate data. In developing new applications, the designer often cannot exploit data containing in existing files; instead new files are designed that duplicate much of the existing data. DB212 © Peter Lo 2004 12 3 Poor Enforcement of Standards n n Excessive Program Maintenance The responsibility for system design and operation has been decentralized. Two types of inconsistencies may result from poor enforcement of standards: Synonyms and Homonyms. u A synonym results when two different names are used for the same data item (e.g. student number and matriculation number) u A homonym is a single name that is used for two different data items (e.g. in a bank the term balance might be used to designate a checking account balance in one department and a savings account balance in a different department) DB212 © Peter Lo 2004 13 Database Approach n Emphasizes the integration and sharing of data across the organization. DB212 © Peter Lo 2004 Requirement Analysis n n Descriptions of files, records, and data items are embedded within individual application programs. Any modification to a data (such as change of data name, data format, or method of access) requires that the program (or programs) also be modified. DB212 © Peter Lo 2004 14 Benefits of the Database Approach Requirement Analysis n n n Process Design Database Design Data Design Process Design Implementation Implementation Process-driven design Data-driven design n n 15 Minimal Data Redundancy Consistency of Data Integration of Data Sharing of Data Ease of Application Development DB212 © Peter Lo 2004 16 4 Minimal Data Redundancy Consistency of Data n n n n Data files are integrated into a single, logical structure. Improve performances Aware of redundancy DB212 © Peter Lo 2004 n 17 By controlling data redundancy, we greatly reduce the opportunities for inconsistency. u For example, if each address is stored only once, we cannot have disagreement on the stored values. When controlled redundancy is permitted in the database, the database system itself should enforce consistency by updating each occurrence of a data item when a change occurs. DB212 © Peter Lo 2004 Integration of Data Sharing of Data n n n Data are organized into a single, logical structure, with logical relationships defined between associated data entities. Easy for users to relate one item of data to another. DB212 © Peter Lo 2004 19 18 Permit multiple users to share a database concurrently DB212 © Peter Lo 2004 20 5 Ease of Application Development Cost of the Database Approach n n n n Greatly reduces the cost and time for developing new business applications as programmer is relieved from the burden of designing, building, and maintaining master files. Data are independent of the application programs that use them. Either data or the application programs that use the data can be changed without necessitating a change in the other factor. DB212 © Peter Lo 2004 21 n n n New, Specialized Personnel Need for Explicit Backup Interference with Shared Data Organizational Conflict DB212 © Peter Lo 2004 New, Specialized Personnel Need for Explicit Backup n n Need to hire train individuals u To maintain the new database software u To develop and enforce new programming standards u To achieve the highest possible performance u To manage the new staff DB212 © Peter Lo 2004 n n 23 22 Restoring damaged files and in providing validity checks on crucial data. To ensure that data are accurate and available whenever needed Either database management software or additional procedures have to provide these essential capabilities. DB212 © Peter Lo 2004 24 6 Interference with Shared Data n n Organizational Conflict The concurrent access to shared data via several application programs can lead to some problems. Example: u u n When two concurrent users both want to change the same or related data, inaccurate results can occur if access to the data is not properly synchronized. When data are used exclusively for updating, different users can obtain control of different segments of the database and lock up any use of the data (deadlock). DB212 © Peter Lo 2004 25 Components of the Database Environment Data administrators Repository DB212 © Peter Lo 2004 System developers End users User interface Application programs DBMS n A shared database requires a consensus on data definitions and ownership as well as responsibilities for accurate data maintenance. Experience has shown that conflicts on how to define data, data length and coding, rights to update shared data, and associated issues are frequent and difficult managerial issues to resolve. DB212 © Peter Lo 2004 26 Components of the Database Environment n n Database 27 Repository u Centralized knowledge base containing all data definitions, screen and report formats and definitions of other organizations and system components. Database Management System (DBMS) u Commercial software system used to create, maintain and provide controlled access to the database and repository. DB212 © Peter Lo 2004 28 7 Components of the Database Environment Components of the Database Environment n n n Database u A shared collection of logically related data, designed to meet the information needs of multiple users in an organization. Application Programs u Computer programs that are used to create and maintain the database and provide information to users. DB212 © Peter Lo 2004 29 n User Interface u Languages, menu and other facilities by which users interact with various system components. Data Administrators u Persons who are responsible for the overall information resources of an organization. DB212 © Peter Lo 2004 Components of the Database Environment Review Question n 1. n System Developers u Persons such as system analysts and programmers who design new application programs. End Users u Persons throughout the organization who add, delete and modify data in the database and who request or receive information from it. DB212 © Peter Lo 2004 31 2. 3. 4. 5. 30 Discuss the characteristics of traditional file processing system. Why is the system criticized? How did the database approach eliminate problems of file processing system? Describe the components of a database system, with the aid of a diagram. Explain why data redundancy is so common in traditional application systems. Where are data definitions maintained in each of the following environment? u Traditional file processing system u Database system DB212 © Peter Lo 2004 32 8