Download Database Environment

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

IMDb wikipedia , lookup

Entity–attribute–value model wikipedia , lookup

Concurrency control wikipedia , lookup

Microsoft Jet Database Engine wikipedia , lookup

Extensible Storage Engine wikipedia , lookup

Relational model wikipedia , lookup

Database wikipedia , lookup

Clusterpoint wikipedia , lookup

Functional Database Model wikipedia , lookup

Database model wikipedia , lookup

Transcript
Basic Terminologies
n
Database Environment
n
n
Data are facts concerning things such as people, objects or
events.
Information is data that have been processed and presented
in a form suitable for human interpretation, often with the
purpose of revealing trends or patterns.
Process involves acquisition, storage, manipulation,
retrieval and distribution.
DATA
DB212 © Peter Lo 2004
1
Database Concept
n
INFORMATION
DB212 © Peter Lo 2004
2
Centralized Database
A database is a shared collection of logically related data,
designed to meet the information needs of multiple users in
an organization.
FILES
n
n
DATABASE
n
n
record 1
record 2
record 3
record 4
:::::::
DB212 © Peter Lo 2004
Process
DATA
3
All data are located at a single site in a Centralized
Database
Advantage:
u Provide greater control over accessing and updating
data.
Disadvantage:
u Vulnerable to failure
Examples:
u Personal Computer Database, Central Computer
Database, Client/Server Databases
DB212 © Peter Lo 2004
4
1
Distributed Database
Heterogeneous Databases
n
n
A single logical database that is spread physically
across computers in multiple locations.
DB212 © Peter Lo 2004
n
5
Different computers and operating systems, different data
models, different DBMS may be possible.
Comprise of the following conditions:
u The computer operating systems used at each location
are the same or highly compatible.
u The data models used at each location are the same.
u The DBMS used at each location are the same or highly
compatible.
u The data at the various locations have common
definitions and formats.
DB212 © Peter Lo 2004
File Processing Approach
Disadvantages
n
n
n
A traditional approach to information system
design.
Focuses on the data processing needs of individual
departments in the organization.
n
n
n
n
n
DB212 © Peter Lo 2004
7
6
Uncontrolled Redundancy
Inconsistent Data
Inflexibility
Limited Data Sharing
Poor Enforcement of Standards
Excessive Program Maintenance
DB212 © Peter Lo 2004
8
2
Uncontrolled Redundancy
Inconsistent Data
n
n
n
n
n
Duplication of data.
Valuable storage space is wasted
The same data may have to be input several times
to update all occurrences of a data item
Inconsistencies
DB212 © Peter Lo 2004
n
n
9
When the same data are stored in multiple
locations, inconsistencies are inevitable.
Inconsistencies in stored data are one of the most
common sources of errors in computer
applications.
They lead to inconsistent documents and reports
and undermine the confidence of users in the
integrity of the information systems.
DB212 © Peter Lo 2004
Inflexibility
Limited Data Sharing
n
n
n
Inflexible and cannot easily respond to requests
for a new or redesigned product.
Leads to considerable frustration on the part of the
users, who cannot understand why the computer
system cannot give them information in a new
format when they know it exists in the
applications files.
DB212 © Peter Lo 2004
11
n
n
10
Each application has its own private files, and
users have little opportunity to share data outside
of their own applications.
The same data may have to be entered several
times to update files with duplicate data.
In developing new applications, the designer often
cannot exploit data containing in existing files;
instead new files are designed that duplicate much
of the existing data.
DB212 © Peter Lo 2004
12
3
Poor Enforcement of Standards
n
n
Excessive Program Maintenance
The responsibility for system design and operation has
been decentralized.
Two types of inconsistencies may result from poor
enforcement of standards: Synonyms and Homonyms.
u A synonym results when two different names are used
for the same data item (e.g. student number and
matriculation number)
u A homonym is a single name that is used for two
different data items (e.g. in a bank the term balance
might be used to designate a checking account balance
in one department and a savings account balance in a
different department)
DB212 © Peter Lo 2004
13
Database Approach
n
Emphasizes the
integration and
sharing of data
across the
organization.
DB212 © Peter Lo 2004
Requirement Analysis
n
n
Descriptions of files, records, and data items are
embedded within individual application programs.
Any modification to a data (such as change of data
name, data format, or method of access) requires
that the program (or programs) also be modified.
DB212 © Peter Lo 2004
14
Benefits of the Database Approach
Requirement Analysis
n
n
n
Process Design
Database Design
Data Design
Process Design
Implementation
Implementation
Process-driven design
Data-driven design
n
n
15
Minimal Data Redundancy
Consistency of Data
Integration of Data
Sharing of Data
Ease of Application Development
DB212 © Peter Lo 2004
16
4
Minimal Data Redundancy
Consistency of Data
n
n
n
n
Data files are integrated into a single, logical
structure.
Improve performances
Aware of redundancy
DB212 © Peter Lo 2004
n
17
By controlling data redundancy, we greatly reduce the
opportunities for inconsistency.
u For example, if each address is stored only once, we
cannot have disagreement on the stored values.
When controlled redundancy is permitted in the database,
the database system itself should enforce consistency by
updating each occurrence of a data item when a change
occurs.
DB212 © Peter Lo 2004
Integration of Data
Sharing of Data
n
n
n
Data are organized into a single, logical structure,
with logical relationships defined between
associated data entities.
Easy for users to relate one item of data to another.
DB212 © Peter Lo 2004
19
18
Permit multiple users to share a database
concurrently
DB212 © Peter Lo 2004
20
5
Ease of Application Development
Cost of the Database Approach
n
n
n
n
Greatly reduces the cost and time for developing
new business applications as programmer is
relieved from the burden of designing, building,
and maintaining master files.
Data are independent of the application programs
that use them.
Either data or the application programs that use
the data can be changed without necessitating a
change in the other factor.
DB212 © Peter Lo 2004
21
n
n
n
New, Specialized Personnel
Need for Explicit Backup
Interference with Shared Data
Organizational Conflict
DB212 © Peter Lo 2004
New, Specialized Personnel
Need for Explicit Backup
n
n
Need to hire train individuals
u To maintain the new database software
u To develop and enforce new programming
standards
u To achieve the highest possible performance
u To manage the new staff
DB212 © Peter Lo 2004
n
n
23
22
Restoring damaged files and in providing validity
checks on crucial data.
To ensure that data are accurate and available
whenever needed
Either database management software or
additional procedures have to provide these
essential capabilities.
DB212 © Peter Lo 2004
24
6
Interference with Shared Data
n
n
Organizational Conflict
The concurrent access to shared data via several
application programs can lead to some problems.
Example:
u
u
n
When two concurrent users both want to change the
same or related data, inaccurate results can occur if
access to the data is not properly synchronized.
When data are used exclusively for updating, different
users can obtain control of different segments of the
database and lock up any use of the data (deadlock).
DB212 © Peter Lo 2004
25
Components of the Database
Environment
Data
administrators
Repository
DB212 © Peter Lo 2004
System
developers
End
users
User
interface
Application
programs
DBMS
n
A shared database requires a consensus on data
definitions and ownership as well as
responsibilities for accurate data maintenance.
Experience has shown that conflicts on how to
define data, data length and coding, rights to
update shared data, and associated issues are
frequent and difficult managerial issues to resolve.
DB212 © Peter Lo 2004
26
Components of the Database
Environment
n
n
Database
27
Repository
u Centralized knowledge base containing all data
definitions, screen and report formats and
definitions of other organizations and system
components.
Database Management System (DBMS)
u Commercial software system used to create,
maintain and provide controlled access to the
database and repository.
DB212 © Peter Lo 2004
28
7
Components of the Database
Environment
Components of the Database
Environment
n
n
n
Database
u A shared collection of logically related data,
designed to meet the information needs of
multiple users in an organization.
Application Programs
u Computer programs that are used to create and
maintain the database and provide information
to users.
DB212 © Peter Lo 2004
29
n
User Interface
u Languages, menu and other facilities by which
users interact with various system components.
Data Administrators
u Persons who are responsible for the overall
information resources of an organization.
DB212 © Peter Lo 2004
Components of the Database
Environment
Review Question
n
1.
n
System Developers
u Persons such as system analysts and
programmers who design new application
programs.
End Users
u Persons throughout the organization who add,
delete and modify data in the database and who
request or receive information from it.
DB212 © Peter Lo 2004
31
2.
3.
4.
5.
30
Discuss the characteristics of traditional file processing
system. Why is the system criticized?
How did the database approach eliminate problems of file
processing system?
Describe the components of a database system, with the
aid of a diagram.
Explain why data redundancy is so common in traditional
application systems.
Where are data definitions maintained in each of the
following environment?
u Traditional file processing system
u Database system
DB212 © Peter Lo 2004
32
8