* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Stair Principles-Chapter 5 - University of Illinois at Chicago
Survey
Document related concepts
Transcript
CHAPTER 5 Organizing Data and Information The Hierarchy of Data Database Collection of data organized to meet users’ needs Database management system (DBMS) Software consisting of a group of programs that manipulate the database and provide an interface between the database and the application programs The Hierarchy of Data Data is generally organized in a hierarchy that begins with the smallest piece of data (a bit) and progresses through the hierarchy to a database. The Hierarchy of Data Character Basic building block of information, represented by a byte (0,1) Field A name, number, or combination of characters that describes an aspect of a business activity The Hierarchy of Data Record Collection of related fields File Collection of related records Database Collection of integrated and related files Data Entities, Attributes, and Keys Entity Generalized class of people, places, or things for which data is collected, stored, and maintained Attribute Characteristic of an entity Data item Specific value of an attribute Data Entities, Attributes, and Keys Key A field or set of fields in a record that is used to identify the record Primary key A field or set of fields that uniquely identifies the record Secondary key A field in a record that does not uniquely identify the record Keys and Attributes Key field Attributes Entities (records) The Traditional Approach to Data Management [Figure 5.3] Flaws in the Traditional Approach Data redundancy Duplication of data in separate files Data integrity The degree to which the data in any one file is accurate Program-data dependence Potential for incompatible programs and data between applications The Database Approach to Data Management Data management in which a pool of related data is shared by multiple application programs Rather than having separate data files, each application uses a collection of data that are either joined or related in the database. The Database Approach to Data Management [Figure 5.4] Advantages of the Database Approach Improved strategic use of corporate data Reduced data redundancy Improved data integrity Easier modification and updating Data and program independence Advantages of the Database Approach Better access to data and information Standardization of data access A framework for program development Better overall protection of the data Shared data and information resources Disadvantages of the Database Approach Relatively high cost of purchasing and operating a DBMS in a mainframe operating environment Specialized staff Increased vulnerability Database Considerations Content What data is to be collected at what cost? Access What data is to be provided to which users when appropriate? Database Considerations Logical structure How is the data to be arranged so that it makes sense to a given user? Physical organization Where is the data to be physically located? Types of Database Design Logical design An abstract model of how the database should be structured and arranged to meet an organization’s information needs Physical design A model of how the data will be organized and located within the database Data Modeling and Entity-Relationship Diagrams Data model A map or diagram of entities and their relationships Enterprise data modeling Data modeling done at the level of the entire organization Entity-Relationship (ER) Diagrams Diagrams that use basic graphical diagrams to show the organization of and relationships between data Relationships include: One-to-one (1:1) One-to-many (1:N) Many-to-many (N:M) An Entity-Relationship Diagram Attributes Entities [Figure 5.5] Relationship An ER diagram for a customer ordering database Database Models Hierarchical (tree) models Network models Relational models Hierarchical Database Model A model in which the data is organized in a top-down or inverted tree-like structure [Figure 5.6] Network Models An extension of the hierarchical model, in which a member may have many owners [Figure 5.7] Relational Models Data organized in tabular format (rows and columns) Relations: Two-dimensional tables into which data elements are placed Tuple: Each row of a table Attributes: Columns of the table Domain: Values for attributes or columns Relational Models [Figure 5.8] Data Manipulations Selecting Eliminating rows according to certain criteria Projecting Eliminating columns in a table Data Manipulations Joining Combining two or more tables Linking Joining tables that share at least one common data element Data Analysis and Normalization Data analysis Evaluation of data to uncover problems with the content of a database Anomalies Problems and irregularities in data Normalization Removing anomalies from a database Comparison of Database Models Hierarchical model Primary advantage: processing efficiency Network model More flexible than hierarchical models in terms of organizing data Relational database model Easier to control, more flexible, and more intuitive; by far the most widely used Database Characteristics Amount Database size depends on the number of records or files it contains Volatility A measure of the changes typically required in a given period of time Immediacy A measure of how rapidly changes must be made to data Database Management Systems Group of programs used as an interface between a database and application programs or a database and the user Classified by the type of database model they support Hierarchical Network Relational Storing and Retrieving Data Logical access path Application requests data from the DBMS Physical access path DBMS accesses a storage device to retrieve the data [Figure 5.14] Data Control Concurrency control Locks out simultaneous access to a record that is being updated or used by another program Schema The logical and physical structure of the data and relationships among the data in the database Providing a User View User view Portion of the database a user can access Subschema A file that contains a description of a subset of the database and identifies which users can perform modification on the data items in that subset Developed to create different views The Use of Schemas and Subschemas [Figure 5.15] Creating and Modifying the Database Data definition language (DDL) Collection of instructions and commands used to define and describe data and data relationships in a specific database [Figure 5.16] Creating and Modifying the Database Data dictionary A detailed description of all data used in the database [Figure 5.17] Data Dictionary Provides a standard definition of terms and data elements Assists programmers in designing and writing programs Simplifies database modifications Data Dictionary Helps achieve advantages of the database approach Reduced data redundancy Increased data reliability Faster program development Easier modification of data and information Manipulating Data and Generating Reports Data Manipulation Language (DML) Contains the commands used to manipulate the database Allows managers and other database users to access, modify, and make queries about data contained in the database to generate reports Structured Query Language (SQL) A standardized data manipulation language that has become an integral part of most relational database packages Selecting a Database Management System Begins by analyzing database needs and characteristics Performance Integration Features The vendor Cost Emerging Database Trends Distributed databases Actual data may be spread across several smaller databases connected via telecommunications devices Replicated database Holds a duplicate set of frequently used data Distributed Database HCIA p223 HCIA, Inc. uses a distributed database to provide up-to-date information to their customers. Data Warehouse A relational database management system designed specifically to support management decision making [Figure 5.21] Data Warehouse Data mart Subset of a data warehouse Brings the data warehouse concept to small and medium-size businesses On-line analytical processing (OLAP) Consists of programs used to store and deliver data warehouse information Data mining Automated discovery of patterns and relationships in a data warehouse Open Database Connectivity (ODBC) Standards that help ensure that specific software can be used with any ODBCcompliant database [Figure 5.22] Object-Oriented Databases Databases that store data as objects, which contain both the data and the processing instructions needed to complete the database transaction [Table 5.6] Image, Hypertext, and Hypermedia Databases Image databases Store data in the form of images Hypertext databases Allow users to search and manipulate alphanumeric data in an unstructured way Hypermedia databases Allow businesses to search and manipulate multimedia forms of data Spatial Data Technology Involves the use of an object-relational database Stores and accesses data according to the locations it describes Permits spatial queries and analysis Aspects of Database Administration Overall design and coordination of the database Development and maintenance of schemas and subschemas Development and maintenance of the data dictionary Implementation of the DBMS Aspects of Database Administration System and user documentation User support and training Overall operation of the DBMS Testing and maintaining the DBMS Establishing emergency or failure-recovery procedures Database Use, Policies, and Security What data should users have direct access to? Under what circumstances can data be transferred from a PC or small computer system to the large mainframe system (uploading)? Database Use, Policies, and Security Under what circumstances can data be transferred from a mainframe system to PCs or small computer system (downloading)? What procedures are needed to guarantee proper database use?