Download Data Resource Management

Document related concepts

Big data wikipedia , lookup

Entity–attribute–value model wikipedia , lookup

Open Database Connectivity wikipedia , lookup

Extensible Storage Engine wikipedia , lookup

Microsoft Jet Database Engine wikipedia , lookup

Concurrency control wikipedia , lookup

Database wikipedia , lookup

Relational model wikipedia , lookup

Functional Database Model wikipedia , lookup

Healthcare Cost and Utilization Project wikipedia , lookup

Clusterpoint wikipedia , lookup

Database model wikipedia , lookup

Transcript
Chapter 5
Data Resource Management
McGraw-Hill/Irwin
Copyright © 2010 by the McGraw-Hill Companies, Inc. All rights reserved.
Learning Objectives

Explain the business value of implementing
data resource management processes and
technologies in an organization

Outline the advantages of a database
management approach to managing the data
resources of a business, compared with a file
processing approach

Explain how database management software
helps business professionals and supports the
operations and management of a business
5-2
Learning Objectives

Provide examples to illustrate the following
concepts
– Major types of databases
– Data warehouses and data mining
– Logical data elements
– Fundamental database structures
– Database development
5-3
Case 1: Cogent Communications, Intel, Others

IT integration and adoption issues can make or break
merger and acquisition activities

Companies will feel the full impact of the merger or
acquisition frenzy (undesirable state of disorder) in their
data centers

Companies with employees who hold all the knowledge
suffer greatly when those people are let go

It is important to document the knowledge from those
people and to figure out how to work with small crews

Companies should know what goes on in the data
center, in terms of systems and how they interact with
each other and the business
5-4
Case Study Questions

Place yourself in the role of a manager at a
company undergoing a merge or acquisition
– What would be the most important things
customers would expect from you while
still in that process?
– What role would IT play in meeting those
expectations?

Provide at least three examples
5-5
Case Study Questions

Focus on what Andi Mann in the case calls
“tribal knowledge”
– What do you think he means by that, and
why is it so important to this process?
– What strategies would you suggest for
companies that are faced with the extensive
presence of this issue in an acquired
organization?
5-6
Case Study Questions

Most of the discussion on the case focused
on hardware and software issues. However,
these are essentially enablers for underlying
business processes developed by each of
the companies involved
– What alternatives do companies have for
merging their business processes, and what
role would IT play in supporting those
activities?
– Pay particular attention to data management
and governance issues
5-7
Logical Data Elements
5-8
Logical Data Elements
Character
•A single
alphabetic,
numeric, or other
symbol
Field
(data item)
•Represents an
attribute (quality
or characteristic)
of some entity
(object, person,
place, event)
•Examples…
salary, job title
Record
•Grouping of all
the fields used to
describe the
attributes of an
entity
•Example… payroll
records with name,
SSN, pay rate
5-9
Logical Data Elements
File
(table, flat file)
•Group of related
records
Database
•Integrated
collection of
logically related
data elements
5-10
Electric Utility Database
5-11
Common Database Structures: Hierarchical
– Early DBMS structure
– Records arranged in tree-like structure
– Relationships are one-to-many
5-12
Common Database Structures: Network
– Used in some mainframe DBMS packages
– Many-to-many relationships
5-13
Common Database Structures: Relational

Most widely used structure
– Data elements are stored in tables
– Row represents a record; column is a field
– Can relate data in one file with data in another,
if both files share a common data element
5-14
Common Database Structures: Multidimensional

Variation of relational model
– Uses multidimensional structures to
organize data
– Data elements are viewed as being in cubes
– Popular for analytical databases that support
Online Analytical Processing (OLAP)
5-15
Multidimensional Model
5-16
Common Database Structures: Object-Oriented
Source: Adapted from Ivar Jacobsen, Maria Ericsson, and Ageneta Jacobsen, The Object Advantage: Business Process
Reengineering with Object Technology (New York: ACM Press, 1995), p. 65.
Copyright @ 1995, Association for Computing Machinery. By permission.
5-17
Evaluation of Database Structures
Hierarchical
Works for
structured,
routine
transactions
Can’t handle
many-to-many
relationship
Network
Relational
More flexible
than hierarchical
Easily responds
to ad hoc
requests
Unable to handle
ad hoc requests
Easier to work
with & maintain
Not as efficient
or quick as
hierarchical or
network
5-18
Database Development
Database Administrator (DBA)
In charge of enterprise-wide database development
Improves integrity and security of organizational databases
Uses Data Definition Language (DDL) to develop and
specify data content, relationships, and structure
Stores these specifications in a data dictionary
or metadata repository
5-19
Data Dictionary
Data
Dictionary
Contains data about data (metadata)
Relies on specialized software component
to manage a database of data definitions
Can be active or passive
Names and descriptions of all types of data
records and their interrelationships
Requirements for end users’ access and
use of applications
Contains
information
on…
Database maintenance
Security
5-20
Database Development
5-21
Entity Relationship Diagram
5-22
Logical and Physical Database Views
5-23
Data Resource Management

Data resource management is a managerial
activity
– Uses data management, data warehousing,
and other IS technologies
– Manages data resources to meet the
information needs of business stakeholders
5-24
Case 2: Applebee’s, Travelocity, and Others

Applebee’s
– Uses data for basic business decisions, such as
replenishing food supplies based on how much
finished product was sold daily
– Developing more sophisticated analyses that look
at how well items are selling


This will help the company make better decisions about
what to order and what products to promote
Today, organizations extensively aggregate and
mine their data to make better decisions
– Travelocity mined 600,000 comments so it could
better monitor and respond to customer issues
5-25
Case Study Questions

What are the business benefits of taking the time
and effort required to create and operate data
warehouses such as those described in the case?
–
–

Do you see any disadvantages?
Is there any reason why all companies shouldn’t
use data warehousing technology?
Applebee’s noted some of the unexpected insights
obtained from analyzing data about “back-of-house”
performance
–
Using your knowledge of how a restaurant works,
what other interesting questions would you suggest
to the company?
5-26
Case Study Questions

Data mining and warehousing technologies use
data about past events to inform better decisionmaking in the future
– Do you believe this stifles innovative thinking,
causing companies to become too constrained
by the data they are already collecting to think
about unexplored opportunities?
5-27
Types of Databases
5-28
Operational Databases
Stores detailed data needed to support
businesses and operations
Also called subject area databases (SADB),
transaction databases, and
production databases
Database examples:
customer, human resource, inventory
5-29
Distributed Databases
Distributed databases are copies or parts of
databases stored on servers at multiple locations
Protection of valuable data
Data can be distributed into
smaller databases
Each location has control
of its local data
All locations can access
any data, anywhere
Improved database
performance at worksites
Maintaining data
accuracy
Advantages
Disadvantages
5-30
Distributed Databases
Replication
Look at each distributed database and find changes
Apply changes to each distributed database
Very complex
Duplication
One database is master
Duplicate the master after hours, in all locations
Easier to accomplish
Requires extra computing power & bandwidth
5-31
External Databases
Databases available for
a fee from the Web,
or from commercial
online services
Hypermedia databases
Statistical databases
Bibliographic and
full-text databases
Search engines like Google or Yahoo
are external databases
5-32
Components of Web-Based System

A hypermedia database contains
– Hyperlinked pages of multimedia
– Interrelated hypermedia page elements,
rather than interrelated data records
5-33
Data Warehouses
Stores static data that has been extracted from
other databases in an organization
Central source of data that has been cleaned,
transformed, and cataloged
Data is used for data mining, analytical processing,
analysis, research, decision support
Data warehouses may be divided into data marts
Subsets of data that focus on specific aspects of a company
(department or process)
5-34
Data Warehouse Components
5-35
Applications and Data Marts
5-36
Data Mining
5-37
Traditional File Processing
Data are organized, stored, and processed
in independent files
Each business application uses specialized data
files containing specific types of data records
Problems
Data redundancy
Lack of data integration
Data dependence (files, storage devices, software)
Lack of data integrity or standardization
5-38
Traditional File Processing - Banks
5-39
Database Management Approach
The foundation of modern methods
of managing organizational data
Consolidates
data records,
formerly in
separate files,
into databases
Data can be
accessed by
many different
application
programs
A database
management
system (DBMS)
is the software
interface
between users
and databases
5-40
Database Management Approach
5-41
Database Management System
In mainframe and server computer systems,
database management software is used to…
Create new databases
and database applications
Maintain the quality of the data
in an organization’s databases
Use the databases of an organization to
provide the information needed by end users
5-42
Common DBMS Software Components
5-43
Database Management System

Database Development
– Defining and organizing the content, relationships,
and structure of the data needed to build a database

Database Application Development
– Using DBMS to create prototypes of queries, forms,
reports, Web pages

Database Maintenance
– Using transaction processing systems and other
tools to add, delete, update, and correct data
5-44
DBMS Major Functions
5-45
Database Interrogation
End User
Makes DBMS
Query
No programming
required
Query
Language
Immediate
response to ad hoc
data requests
Report Generator
Quickly specify
a format for
information you
want to present
as a report
Response is a video display
or a printed report
5-46
Database Interrogation

SQL Queries
– Structured, international standard query
language found in many DBMS packages
– Query form is SELECT…FROM…WHERE…
5-47
Database Interrogation
Graphical and Natural Queries
It is difficult to correctly phrase SQL and
other database language search queries
Most DBMS packages offer easier-to-use
point-and-click methods
Translates queries into SQL commands
Natural language query statements are
similar to conversational English
5-48
Microsoft Query Wizard
5-49
Database Maintenance

Accomplished by transaction processing
systems and other applications, with the
support of the DBMS
– Done to reflect new business transactions
and other events
– Updating and correcting data, such as
customer addresses
5-50
Application Development
Use DBMS software development tools
to develop custom application programs
Not necessary to develop detailed datahandling procedures using conventional
programming languages
Can include data manipulation language
(DML) statements that call on the DBMS to
perform necessary data handling
5-51
Case 3: Amazon, eBay, and Google

Amazon’s data vault
– Product descriptions
– Prices
– Sales rankings
– Customer reviews
– Inventory figures
– Countless other layers of content
10 years
& $1 billion
to build
5-52
Case 3: Amazon, eBay, and Google

Amazon opened its data vault in 2002
– 65,000 developers, businesses, and
entrepreneurs have tapped into it
– Many have become business partners

eBay opened its $3 billion databases in 2003
– 15,000 developers and others have registered
to use it and to access software features
– 1,000 new applications have appeared
– 41 percent of eBay’s listings are uploaded to
the site using these resources
5-53
Case 3: Amazon, eBay, and Google

Google recently unlocked access to its
desktop and paid-search products
– Dozens of Google-driven services cropped up
– Developers can grab 1,000 search results a
day for free; anything more requires
permission
– In 2005, the Ad-Words paid-search service
was opened to outside applications
5-54
Case Study Questions

What are the business benefits to Amazon
and eBay of opening up some of their
databases to developers and entrepreneurs?
– Do you agree with this strategy?

What business factors are causing Google
to move slowly in opening up its databases?
– Do you agree with its go-slow strategy?
5-55
Case Study Questions

Should other companies follow Amazon’s
and eBay’s lead and open up some of their
databases to developers and others?
– Defend your position with an example of
the risks and benefits to an actual company
5-56