* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Slide 1
Survey
Document related concepts
Transcript
Chapter 3 Data and Knowledge Management www.pearsoned.ca/jessup Robert Riordan, Carleton University 3-1 Learning Objectives 1. Describe why databases have become so important to organizations 2. Describe what databases and database management systems are and how they work Information Systems Today, 2/C/e 3-2 ©2008 Pearson Education Canada Learning Objectives 3. Explain how organizations are getting the most from their investment in database technologies 4. Describe what is meant by knowledge management and knowledge assets as well as benefits and challenges of deploying a knowledge management system Information Systems Today, 2/C/e 3-3 ©2008 Pearson Education Canada Database Technology • A collection of related data organized in a way that makes it valuable and useful • Allows organizations to retrieve, store, and analyze information easily • Is vital to an organization’s success in running operations and making decisions Information Systems Today, 2/C/e 3-4 ©2008 Pearson Education Canada Database Terminology Entities • Things we store information about. (i.e. persons, places, objects, events, etc.) • Have relationships to other entities (i.e. the entity Student has a relationship to the entity Grades in a University Student database Attributes • These are pieces of information about an entity (i.e. Student ID, Name, etc. for the entity Student) Information Systems Today, 2/C/e 3-5 ©2008 Pearson Education Canada Relationship of DBMS Concepts to Others? Information Systems Today, 2/C/e 3-6 ©2008 Pearson Education Canada Levels of a Database Management System (DBMS) Level Term Term Definitions Lowest Highest Field Individual characteristics about an ENTITY. Fields are also called attributes or columns depending on the type of DBMS Record A group of fields or attributes to describe a single instance of an ENTITY. These are also called rows depending on the DBMS File A collection of records or instances for a given ENTITY. These are also called tables, depending on the DBMS Database A collection of files or entities containing information to support a given system or a particular topic area Information Systems Today, 2/C/e 3-7 ©2008 Pearson Education Canada View of a Database Table or File Attribute (One Column) Attribute Type Record (One Row) Information Systems Today, 2/C/e 3-8 ©2008 Pearson Education Canada File Processing vs. Database Approach Summary File Processing Approach (Old School) • Storage Media: sequential tapes or files • Data: stored in long sequential files • Organization: redundant data in multiple files • Efficiency: data embedded to support processing • Updates: requires multiple updates in many files • Processing: slower query/faster processing Data Base Approach (New School-TODAY) • Storage Media: Direct Access Storage Device (DASD) • Data: stored in related tables • Organization: redundant data minimized/eliminated • Efficiency: data stored only in tables • Updates: requires few or one update for a data field • Processing: faster query/slower processing Information Systems Today, 2/C/e 3-9 ©2008 Pearson Education Canada Advantages of the Database Approach Information Systems Today, 2/C/e 3-10 ©2008 Pearson Education Canada Costs or Risks of the Database Approach Information Systems Today, 2/C/e 3-11 ©2008 Pearson Education Canada Database Systems Activities – Data Entry Employment Applications Enter Forms (Form Entry Screen) Information Systems Today, 2/C/e Example • Data is entered from paper employment applications into a form entry screen • The entry forms are designed to match the paper forms for ease of entry • The form data is processed by the entry program and then stored in the employment database (Form Entry Program) 3-12 (Employment DB) ©2008 Pearson Education Canada Database Systems Activities – Query Query – A database function that extracts and displays information from a database given selection parameters. SQL (Structured Query Language) • A language to select and extract data from a database • The industry standard language for relational databases QBE (Query by Example) • A technique that allows a user to design a query on a screen by dragging and placing the query field in their desired locations Example – Display applicants entered in the last 30 days • Query parameters are selected in the query request screen • The database program uses SQL to query and present the result (Query Request) Information Systems Today, 2/C/e (Query Program) 3-13 (Employment Query) ©2008 Pearson Education Canada Database Systems Activities – Report Report – A database function that extracts and formats information from a database for printing and presentation Report Generator • A specialized program that uses SQL to retrieve and manipulate data (aggregate, transform, or group) • Reports are designed using standard templates or can be custom generated to meet informational needs Example – Report on applicants entered in the last 30 days • Report parameters are selected in the report request screen • The database program uses SQL to query and present the result (Query Request) Information Systems Today, 2/C/e (Query Program) 3-14 (Employment Report) ©2008 Pearson Education Canada Designing Databases – Data Model Data Model • A map or diagram that represents entities and their relationships • Used by Database Administrators to design tables with their corresponding associations Example: ERD (Entity Relationship Diagram) Information Systems Today, 2/C/e 3-15 ©2008 Pearson Education Canada Designing Databases – Keys Database Keys Mechanisms used to identify, select, and maintain one or more records using an application program, query, or report Primary Key A unique attribute type used to identify a single instance of an entity Compound Primary Key A unique combination of attribute types used to identify a single instance of an entity Secondary Key An attribute that can be used to identify one or more records within a table with a given value Information Systems Today, 2/C/e 3-16 ©2008 Pearson Education Canada Designing Databases – Keys (Example) Primary Key ENTITIES - Student ID Entities are translated into Tables (Students and Grades) Secondary Key - Major Entities are joined by common attributes Compound Primary Key - Student ID - Course ID - Sec No. - Term Information Systems Today, 2/C/e 3-17 ©2008 Pearson Education Canada Designing Databases - Associations Associations • Define the relationships one entity has to another • Determine necessary key structures to access data • Come in three relationship types: - One-to-One - One-to-Many - Many-to-Many Foreign Key • An attribute that appears as a non-primary key in one entity (table) and as a primary key attribute in another entity (table) Information Systems Today, 2/C/e 3-18 ©2008 Pearson Education Canada Designing Databases - Associations Entity Relationship Diagram (ERD) • Diagramming tool used to express entity relationships • Very useful in developing complex databases Example • Each Home Stadium has a Team (One-to-One) • Each Team has Players (One-to-Many) • Each Team participates in Games • For each Player and Game there are Game Statistics Information Systems Today, 2/C/e 3-19 ©2008 Pearson Education Canada Designing Databases - Associations Information Systems Today, 2/C/e 3-20 ©2008 Pearson Education Canada Designing Databases – Associations (Example) Information Systems Today, 2/C/e 3-21 ©2008 Pearson Education Canada The Relational Model The Relational Model • The most common type of database model used today in organizations • Is a three-dimensional model compared to the traditional two-dimensional database models - Rows (first-dimension) - Columns (second-dimension) - Relationships (third-dimension) • The third-dimension makes this model so powerful because any row of data can be related to any other row or rows of data Information Systems Today, 2/C/e 3-22 ©2008 Pearson Education Canada The Relational Model - Example Information Systems Today, 2/C/e 3-23 ©2008 Pearson Education Canada The Relational Model - Normalization Normalization • A technique to make complex databases more efficient by eliminating as much redundant data as possible • Example: Database with redundant data (below) Information Systems Today, 2/C/e 3-24 ©2008 Pearson Education Canada The Relational Model - Normalization Normalized Database Information Systems Today, 2/C/e 3-25 ©2008 Pearson Education Canada The Relational Model – Data Dictionary Data Dictionary • Is a document that database designers prepare to help individuals enter data • Provides several pieces of information about each attribute in the database including: - Name - Key (is it a key or part of a key?) - Data Type (date, alphanumeric, numeric, etc.) - Valid Value (the format or numbers allowed) • Can be used to enforce Business Rules which are captured by the database designer to prevent illegal or illogical values from entering the database. (e.g. who has authority to enter certain kinds of data) Information Systems Today, 2/C/e 3-26 ©2008 Pearson Education Canada Online Transactional Processing (OLTP) Online Transactional Processing • The mechanism by which customers, suppliers, and employees process business transactions for an organization • These users conduct transactions online through internal systems and external websites for processing and storage Example Information Systems Today, 2/C/e 3-27 ©2008 Pearson Education Canada Operational vs. Informational Systems Information Systems Today, 2/C/e 3-28 ©2008 Pearson Education Canada Organizational Use of Databases Operational Informational Extract Data Extract Data Department Databases Data Warehouse • Day-to-day department transactions • Used primarily by departments • Extracted department transactions • Used for business analysis Information Systems Today, 2/C/e 3-29 Data Mart • Extracted subset of a data warehouse • Used for highly specific business analysis ©2008 Pearson Education Canada DATA WAREHOUSES, DATA MARTS, DATA MINING • Data _______ : collects business information from many sources in the enterprise • Data _______ : a subset of a data warehouse • Data _______ : an information-analysis tool for automated discovery of patterns and relationships in a data warehouse or a data mart • Online ______________ Processing -Graphical software tools that provide complex analysis of data stored in a database Information Systems Today, 2/C/e 3-31 31 Canada ©2008 Pearson Education DATA WAREHOUSES, DATA MARTS, DATA MINING • Data _______ are not transaction-oriented. Information Systems Today, 2/C/e 32 • Data _______ support online analytical processing (OLAP). 3-32 ©2008 Pearson Education Canada Online Analytical Processing (OLAP) Online Analytical Processing • Graphical software tools that provide complex analysis of data stored in a database • OLAP tools enable users to analyze different dimensions of data beyond summary and simple aggregations of normal database queries • The OLAP Server is the chief component of an OLAP system which understands how the data is organized and has special functions for analyzing data • OLAP can provide time series and trend analysis views of data, data drill-downs, and the ability to answer “what-if” and “why” questions as part of its function Information Systems Today, 2/C/e 3-33 ©2008 Pearson Education Canada Data Mining Data Mining • Is a method companies use to analyze information to better understand their customers, products, markets, or any other phase of the business for which they have data • With data mining tools you can graphically drill down, sort or extract data based on certain conditions; perform a variety of statistical analysis • Data mining applications are very powerful and use highly complex algorithms to analyze and to identify opportunities Information Systems Today, 2/C/e 3-34 ©2008 Pearson Education Canada Data Warehouse Example Information Systems Today, 2/C/e 3-35 ©2008 Pearson Education Canada Uses of Data Warehousing Information Systems Today, 2/C/e 3-36 ©2008 Pearson Education Canada Knowledge Management Definitions Knowledge Management The process an organization uses to gain the greatest value from its knowledge assets Knowledge Assets All underlying skills routines, practices, principles, formulae, methods, heuristics, and intuitions whether explicit or tacit Explicit Knowledge Anything that can be documented, archived, or codified often with the help of information systems Tacit Knowledge The processes and procedures on how to effectively perform a particular task stored in a person’s mind Information Systems Today, 2/C/e 3-37 ©2008 Pearson Education Canada Knowledge Management System (KMS) Best Practices Procedures and processes that are widely accepted as being among the most effective and/or efficient Primary Objective How to recognize, generate, store, share, manage this tacit knowledge (Best Practices) for deployment and use Technology Generally not a single technology but rather a collection of tools that include communication technologies (e.g. e-mail, groupware, instant messaging), and information storage and retrieval systems (e.g. database management system) to meet the Primary Objective Information Systems Today, 2/C/e 3-38 ©2008 Pearson Education Canada Benefits and Challenges of Knowledge Management Information Systems Today, 2/C/e 3-39 ©2008 Pearson Education Canada Roles in Database Development and Use Database Administrator (DBA) • Designs, develops and monitors performance of databases • Enforces policy and standards for data use and security Systems Programmer • Creates business applications that connect to databases • Tests the new systems and databases before use Information Systems Today, 2/C/e 3-41 Systems Analyst • Defines data requirements working with a DBA • Incorporates the database design into new program designs ©2008 Pearson Education Canada