* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download ppt version - School of Computer Science Student WWW Server
Survey
Document related concepts
Transcript
Managing Information Systems Seventh Canadian Edition Laudon, Laudon and Brabston CHAPTER 6 Databases and Information Management Copyright © 2015 Pearson Canada Inc. 6-1 Organizing Data in a Traditional File Environment File organization terms and concepts • Bit: Smallest unit of data; binary digit (0,1) • Byte: Group of bits that represents a single character • Field: Group of words or a complete number • Record: Group of related fields • File: Group of records of same type Copyright © 2015 Pearson Canada Inc. 6-2 Organizing Data in a Traditional File Environment File Organization Term and Concepts • Database: Group of related files • Entity: Person, place, thing, event about which information is maintained • Attribute: Description of a particular entity • Key field: Identifier field used to retrieve, update, sort a record Copyright © 2015 Pearson Canada Inc. 6-3 [INSERT FIGURE 6.1] Copyright © 2015 Pearson Canada Inc. 6-4 Problems with the Traditional File Environment • Data redundancy and inconsistency • Program-data dependence • Lack of flexibility • Poor security • Lack of data sharing and availability Copyright © 2015 Pearson Canada Inc. 6-5 Problems with the Traditional File Environment: Data Redundancy & Inconsistency • Data redundancy: The presence of duplicate data in multiple data files so that the same data are stored in more than one place or location • Data inconsistency: The same attribute may have different values. Copyright © 2015 Pearson Canada Inc. 6-6 Problems with the Traditional File Environment: Program-Data Dependence • The coupling of data stored in files and the specific programs required to update and maintain those files such that changes in programs require changes to the data Copyright © 2015 Pearson Canada Inc. 6-7 Problems with the Traditional File Environment: Lack of Flexibility • A traditional file system can deliver routine scheduled reports after extensive programming efforts, but it cannot deliver ad-hoc reports or respond to unanticipated information requirements in a timely fashion Copyright © 2015 Pearson Canada Inc. 6-8 Problems with the Traditional File Environment: Security & Data Sharing • Poor security • Management may have no knowledge of who is accessing or making changes to the organization’s data • Lack of data sharing and availability • Information cannot flow freely across different functional areas or different parts of the organization. Copyright © 2015 Pearson Canada Inc. 6-9 The Database Approach to Data Management Database management systems • How a DBMS solves the problems of the traditional file environment • Relational DBMS • Operations of a relational DBMS • Hierarchical and network DBMS • Object-oriented DBMS Copyright © 2015 Pearson Canada Inc. 6-10 Relational DBMS • Represents data as two-dimensional tables called relations (a group of records makes a table (file) and a group of values for the set of fields makes a record (tuple, row); a field name labels each column of the table)) • Relates data across tables based on common data element (a group of tables (files) makes a database) • Examples: Access, DB2, Oracle, MS SQL Server Copyright © 2015 Pearson Canada Inc. 6-11 [INSERT FIGURE 6.4] Copyright © 2015 Pearson Canada Inc. 6-12 Operations of a Relational DBMS • Select: Creates subset of rows that meet specific criteria • Join: Combines relational tables to provide users with information (requires a field in common between the tables being joined) • Project: Create a subset consisting of certain columns of the table; enables users to have new tables containing only relevant information Copyright © 2015 Pearson Canada Inc. 6-13 [INSERT FIGURE 6.5] Copyright © 2015 Pearson Canada Inc. 6-14 Capabilities of Database Management Systems • Data Definition Language: • Specifies structure of the database • Data Dictionary • Stores definition of data elements and their characteristics • Querying and Reporting • Data manipulation language • Structured query language (SQL) Copyright © 2015 Pearson Canada Inc. 6-15 [INSERT FIGURE 6.7] Copyright © 2015 Pearson Canada Inc. 6-16 Designing Databases • Conceptual design: Abstract model of database from a business perspective • Physical design: Detailed description of business information needs • Entity-relationship diagram: Methodology for documenting databases illustrating relationships between database entities • Normalization: Process of creating small stable data structures from complex groups of data Copyright © 2015 Pearson Canada Inc. 6-17 Business Intelligence Infrastructure Data warehouse • Stores current and historical data from many core operational transaction systems • Consolidates and standardizes information for use across enterprise, but data cannot be altered • Data warehouse system will provide query, analysis, and reporting tools Copyright © 2015 Pearson Canada Inc. 6-18 Business Intelligence Infrastructure Hadoop • Designed to handle big data • enables distributed parallel processing of huge amounts of data across inexpensive computers Copyright © 2015 Pearson Canada Inc. 6-19 Tools for Business Intelligence • Tools for consolidating, analyzing, and providing access to vast amounts of data to help users make better business decisions • Tools include: • Software for database query and reporting • Online analytical processing (OLAP) • Data mining Copyright © 2015 Pearson Canada Inc. 6-20 [INSERT FIGURE 6.13] Copyright © 2015 Pearson Canada Inc. 6-21 Data Mining • Tools for analyzing large pools of data • Find hidden patterns and infer rules to predict trends – Associations – Sequences – Classifications – Clusters – Forecasts Copyright © 2015 Pearson Canada Inc. 6-22 Text Mining and Web Mining Text Mining • Extracts key elements from large unstructured data sets (e.g., stored e-mails) Sentiment analysis • Mines text comments in an e-mail message, blog, social media conversation, or survey to detect favourable and unfavourable opinions about specific subjects Copyright © 2015 Pearson Canada Inc. 6-23 Text Mining and Web Mining Web Mining • The discovery and analysis of useful patterns and information from the World Wide Web Copyright © 2015 Pearson Canada Inc. 6-24 [INSERT FIGURE 6.14] Copyright © 2015 Pearson Canada Inc. 6-25 Managing Data Resources Establishing an information policy • Specifies the organization’s rules for sharing, disseminating, acquiring, standardizing, classifying, and inventorying information • Data administration is responsible for specific policies and procedures through which data is managed • Data governance • Database administration Copyright © 2015 Pearson Canada Inc. 6-26 Managing Data Resources Ensuring Data Quality • Data Quality Audit – Structured survey of the accuracy and completeness of data in an information system • Data cleansing – consists of activities for detecting and correcting data in an information system Copyright © 2015 Pearson Canada Inc. 6-27 Managing Information Systems Seventh Canadian Edition Laudon, Laudon and Brabston CHAPTER 6 Databases and Information Management Copyright © 2015 Pearson Canada Inc. 6-28