Download LM9 PowerPoints Slides

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Microsoft Access wikipedia , lookup

SQL wikipedia , lookup

Global serializability wikipedia , lookup

Microsoft SQL Server wikipedia , lookup

Entity–attribute–value model wikipedia , lookup

IMDb wikipedia , lookup

Oracle Database wikipedia , lookup

Commitment ordering wikipedia , lookup

Extensible Storage Engine wikipedia , lookup

Open Database Connectivity wikipedia , lookup

Serializability wikipedia , lookup

Ingres (database) wikipedia , lookup

Microsoft Jet Database Engine wikipedia , lookup

Functional Database Model wikipedia , lookup

Database wikipedia , lookup

Relational model wikipedia , lookup

ContactPoint wikipedia , lookup

Database model wikipedia , lookup

Clusterpoint wikipedia , lookup

Concurrency control wikipedia , lookup

Transcript
LM 9. Distributed Database
Dr. Lei Li
1
Note:
• The content of the slides including figures are mainly based on a
publicly available textbook chapter:
• http://wps.prenhall.com/wps/media/objects/3310/3390076/hoffer_c
h13.pdf
2
Learning Objectives
• Explain distributed database models
• List reasons why an enterprise would choose a
distributed database model over a centralized model
• Explain data replication and partitioning
3
Distributed Database
• A single logical database spread physically across
computers in multiple locations that are
connected by a data communications link
• Decentralized Database - A collection of
independent databases on non-networked
computers
4
Why Distributed Database?
• Business unit autonomy and distribution
• Data sharing
• Data communication reliability and costs
• Multiple application vendors
• Database recovery
• Transaction and analytic processing
5
Distributed Database Environment?
6
Distributed Database Options
• Homogeneous
•
•
•
•
Same DBMS at each node
Autonomous–independent DBMSs
Non-autonomous–central, coordinating DBMS
Easy to manage, difficult to enforce
• Heterogeneous
• Different DBMSs at different nodes
• Systems–with full or partial DBMS functionality
• Gateways–simple paths are created to other databases
without the benefits of one logical database
• Difficult to manage, preferred by independent
organizations
7
Homogeneous Distributed Database Environment
Identical DBMSs
Source: Based on Bell and Grimson, 1992.
8
Heterogeneous Distributed Database Environment
Non-identical DBMSs
Source: Based on Bell and Grimson, 1992.
9
Distributed DB Vs. Centralized DB
• Pros
• Increased reliability/availability
• Local control over data
• Modular growth
• Lower communication costs
• Faster response for certain queries
• Cons
• Software cost and complexity
• Processing overhead
• Data integrity exposure
• Slower response for certain queries
10
Options for Distributing a Database
• Data replication
• Copies of data distributed to different sites
• Horizontal partitioning
• Different rows of a table distributed to different sites
• Vertical partitioning
• Different columns of a table distributed to different sites
• Combinations of the above
11
Distributed Database Strategies
• Centralized database, distributed access
• Replication with periodic snapshot update
• Replication with near real-time synchronization of
updates
• Partitioned, one logical database
• Partitioned, independent, nonintegrated segments
12
Distributed DBMS architecture
13
Evolution of Distributed DBMS
• Distributed Unit of Work
• Different statements in a unit of work may refer to different
remote sites
• All databases in a single SQL statement must be at a single site
• Distributed Request
• A single SQL statement may refer to tables in more than one
remote site
• May not support replication transparency or failure transparency
14
15