Download Databases and Managing Information

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Entity–attribute–value model wikipedia , lookup

Expense and cost recovery system (ECRS) wikipedia , lookup

Big data wikipedia , lookup

Data Protection Act, 2012 wikipedia , lookup

Data center wikipedia , lookup

Database wikipedia , lookup

Data model wikipedia , lookup

Data analysis wikipedia , lookup

Clusterpoint wikipedia , lookup

3D optical data storage wikipedia , lookup

Data vault modeling wikipedia , lookup

Information privacy law wikipedia , lookup

Database model wikipedia , lookup

Business intelligence wikipedia , lookup

Transcript
Chapter 6



Problems of managing Data Resources in a
Traditional File Environment
Effective IS provides user with Accurate,
timely and relevant information
Many businesses do not have this kind of
information
A computer system organizes data in a
hierarchy that starts with bits and bytes and
progresses to fields, records, files, and
databases.
Traditional File Environment- some terms:
 Bit- smallest
 Byte- 8 bits multiple
 Field- column
 Record- row
 Entity- objective, organization
 Attribute- characteristics, age
 File- table
 Database- multiple files related- RDBMS
Management Information Systems
Chapter 6 Foundations of Business Intelligence: Databases
and Information Management
The Data Hierarchy
A computer system
organizes data in a
hierarchy that starts
with the bit, which
represents either a 0
or a 1. Bits can be
grouped to form a
byte to represent one
character, number, or
symbol. Bytes can be
grouped to form a
field, and related
fields can be
grouped to form a
record. Related
records can be
collected to form a
file, and related files
can be organized
into a database.
Figure 6-1
Organizing Data in a Traditional File Environment
The use of a traditional approach to file processing encourages each functional area in a corporation to
develop specialized applications and files. Each application requires a unique data file that is likely to be a
subset of the master file. These subsets of the master file lead to data redundancy and inconsistency,
processing inflexibility, and wasted storage resources.
Figure 6-2
Organization has different departments. Each
developed their own systems and files
 Data redundancy and inconsistency:
It is the presence of duplicate data in multiple data files
so that the same data are stopped in more than place
or location. Data redundancy occurs when different
groups in an organization independently collect the
same piece of data and store it independently of each
other.
 Program data dependence:
It is the coupling of data stored in files and the specific
programs required to update and maintain those files
such that changes in programs required to update
and maintain those files such that changes in
programs require changes to the data
Lack of flexibility:
A traditional file system can deliver routine
scheduled reports after existence
programming efforts


Poor security:
• Lack of control, widespread
• Who is accessing and making changes
Lack of Data sharing and
Availability
• Information cannot be related
• Impossible to share or access in a
timely manner.


Collection of data organized to serve many applications
efficiently by centralizing the data and controlling the
redundant data.
Database management system:
Is software that permits an organization to centralize data, manage them efficiently
and provides access to the stored data by application programs.
•
•
•
•
Interfaces between application programs and physical data files
Separates logical and physical views of data
Solves problems of traditional file environment
Controls redundancy by minimizing isolated files in which same data
is repeated.
• Eliminated inconsistency
• Enables central management and security
DATA DEFINITION CAPABILITY, DATA
DICTIONARY CAPABILITY AND DATA
MANPULATION LANGUAGE (SQL)
 Software that permits an organization to
centralize data, manages them efficiently,
and provides access to the stored data by
application programs.
 Separates the Logical and physical view of the
data
The Database Approach to Data Management
Human Resources Database with Multiple Views
A single human resources database provides many different views of data, depending on the information
requirements of the user. Illustrated here are two possible views, one of interest to a benefits specialist and
one of interest to a member of the company’s payroll department.
Figure 6-3
Relational DBMS
 Most popular type of DBMS
 Represents data into two dimensional tables
called relations
 Concept of primary and foreign key
 RDBMS have three basic operations: SELECT,
JOIN, PROJECT
 QUERYING AND REPORTING

The Database Approach to Data Management
Relational Database Tables
A relational
database organizes data in the form of two-dimensional tables.
Illustrated here are tables for the entities SUPPLIER and PART showing how
they represent each entity and its attributes. Supplier_Number is a primary key
for the SUPPLIER table and a foreign key for the PART table.
Figure 6-4A
The Database Approach to Data Management
Relational Database Tables (cont.)
Figure 6-4B



Databases that can store drawings, images,
videos, photographs and voices along with
the structured numbers and characters.
Relatively slow than RDBMS
HYBRID OODBMS- it has the capabilities of
RDBMS and OODBMS
PRINCIPAL TOOLS AND TECHNOLOGIES FOR
ACCESSING INFORMATION FROM DATABASE TO
IMPROVE BUSINESS PERFORMANCE AND DECISION
MAKING
 Businesses use their databases to keep track of
basic transactions such as paying employees and
suppliers, processing orders, keeping track of
customers etc;
 In large databases from separate functions large
amount of data is stored and there are special
capabilities and tools for analyzing large amount
of data and for accessing this data
These capabilities include:
 Data warehousing
 Data mining
 Tools for accessing internal databases
through the web
DBMS includes tools for accessing and
manipulating information in databases. Most
DBMS have a specialized language called a
data manipulation language, that is used to
add, change, delete, and retrieve the data in
the database.
Database design also considers how the data
are to be distributed. Information systems
can be designed with a centralized database
that is used by a single central processor or
by multiple processors in a client/ server
network.
The Database Approach to Data Management
Distributed Databases
There are alternative ways of distributing a database. The central database can be partitioned (a) so that each remote
processor has the necessary data to serve its own local needs. The central database also can be replicated (b) at all
remote locations.
Figure 6-12

Businesses use their databases to keep track
of basic transactions, such as paying
suppliers, processing orders, keeping track of
customers and paying employees.
If data is required from:
 Different departments
 Should be complete
 Updated
 Past Data
Then data warehousing solves your problems
DATA WAREHOUSE CONSOLIDATES AND
STANDARDIZES INFORMATION FROM
DIFFERENT OPERATIONAL DATABASES SO
THAT THE INFORMATION CAN BE USED
ACROSS THE ENTERPRISE FOR MANAGEMENT
ANALYSIS AND DECISION MAKING
Data can be accessed but cannot be altered
Using Databases to Improve Business Performance and
Decision Making
Components of a Data Warehouse
The data warehouse extracts current and historical data from multiple
operational systems inside the organization. These data are combined
with data from external sources and reorganized into a central database
designed for management reporting and analysis. The information
directory provides users with information about the data available in
the warehouse.
Figure 6-13


Data marts focuses on single subject area
online of business
Decentralized Subsets of data warehouses
which is highly focused or summarized. It is
placed in separate database for a specific
population of the users



Once data has been captured and organized
it is available for further analysis
Several tools are available for analyzing data
to see new patterns, relationships and
insights.
The tools for consolidating analyzing and
providing access to vast amount of data to
help users make better decisions are often
referred as “Business Intelligence (BI)”
Using Databases to Improve Business Performance and
Decision Making
Business Intelligence
A series of analytical tools
works with data stored in
databases to find patterns
and insights for helping
managers and employees
make better decisions to
improve organizational
performance.
Figure 6-14



Software for querying and reporting
Tools for multidimensional data analysis
(Online analytical processing OLAP)
Data mining


Lets assume you sell four products in three
different regions of the country?
If you want an answer for a simple question
like “how many of the Product A was sold in
the last quarter?”. Simple query would answer
the question but lets say the question is “how
many of product A was sold in each of your
sales region and compare the actual and
projected sales?”

For that you will need OLAP which will enable
user to se same data in different ways
Using Databases to Improve Business Performance and
Decision Making
Multidimensional Data Model
The view that is showing is
product versus region. If
you rotate the cube 90
degrees, the face that will
show is product versus
actual and projected sales.
If you rotate the cube 90
degrees again, you will see
region versus actual and
projected sales. Other views
are possible.
Figure 6-15




Data mining is the process of extracting patterns
from data. Data mining is becoming an
increasingly important tool to transform this data
into information.
It is commonly used in a wide range of profiling
practices, such as marketing, surveillance, fraud
detection and scientific discovery
Data mining can be used to uncover patterns in
data but is often carried out only on samples of
data
The mining process will be ineffective if the
samples are not a good representation of the
larger body of data

Data mining cannot discover patterns that
may be present in the larger body of data if
those patterns are not present in the sample
being “mined”. Inability to find patterns may
become a cause for some disputes between
customers and service providers. Therefore,
data mining is not foolproof, but may be
useful if sufficiently representative data
samples are collected
Database and the Web
 If you have used web to order something
online then you are using a website which is
linked to a database
 There are number of advantages in using the
web to access an organization internal
databases
WEB BROWSER SOFTWARE IS MUCH EASIER TO
HANDLE


Establishing an Information Policy:
An information policy specifies the
organization’s rules for sharing,
dissemination, acquiring, standardizing,
classifying and inventorying information.
Information policy lays out specific
procedures and accountabilities, identifying
which users and organizational units can
share information, where information can be
distributed and who is responsible or
updating and maintaining the information.

A well-designed database and information
policy will go a long way towards ensuring
that the business has the information it
needs. However, additional steps must be
taken to ensure that the data in
organizational databases are accurate and
remain reliable.