* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download LM9 PowerPoints Slides
Microsoft Access wikipedia , lookup
Global serializability wikipedia , lookup
Microsoft SQL Server wikipedia , lookup
Entity–attribute–value model wikipedia , lookup
Oracle Database wikipedia , lookup
Commitment ordering wikipedia , lookup
Extensible Storage Engine wikipedia , lookup
Open Database Connectivity wikipedia , lookup
Serializability wikipedia , lookup
Ingres (database) wikipedia , lookup
Microsoft Jet Database Engine wikipedia , lookup
Functional Database Model wikipedia , lookup
Relational model wikipedia , lookup
ContactPoint wikipedia , lookup
Database model wikipedia , lookup
LM 9. Distributed Database Dr. Lei Li 1 Note: • The content of the slides including figures are mainly based on a publicly available textbook chapter: • http://wps.prenhall.com/wps/media/objects/3310/3390076/hoffer_c h13.pdf 2 Learning Objectives • Explain distributed database models • List reasons why an enterprise would choose a distributed database model over a centralized model • Explain data replication and partitioning 3 Distributed Database • A single logical database spread physically across computers in multiple locations that are connected by a data communications link • Decentralized Database - A collection of independent databases on non-networked computers 4 Why Distributed Database? • Business unit autonomy and distribution • Data sharing • Data communication reliability and costs • Multiple application vendors • Database recovery • Transaction and analytic processing 5 Distributed Database Environment? 6 Distributed Database Options • Homogeneous • • • • Same DBMS at each node Autonomous–independent DBMSs Non-autonomous–central, coordinating DBMS Easy to manage, difficult to enforce • Heterogeneous • Different DBMSs at different nodes • Systems–with full or partial DBMS functionality • Gateways–simple paths are created to other databases without the benefits of one logical database • Difficult to manage, preferred by independent organizations 7 Homogeneous Distributed Database Environment Identical DBMSs Source: Based on Bell and Grimson, 1992. 8 Heterogeneous Distributed Database Environment Non-identical DBMSs Source: Based on Bell and Grimson, 1992. 9 Distributed DB Vs. Centralized DB • Pros • Increased reliability/availability • Local control over data • Modular growth • Lower communication costs • Faster response for certain queries • Cons • Software cost and complexity • Processing overhead • Data integrity exposure • Slower response for certain queries 10 Options for Distributing a Database • Data replication • Copies of data distributed to different sites • Horizontal partitioning • Different rows of a table distributed to different sites • Vertical partitioning • Different columns of a table distributed to different sites • Combinations of the above 11 Distributed Database Strategies • Centralized database, distributed access • Replication with periodic snapshot update • Replication with near real-time synchronization of updates • Partitioned, one logical database • Partitioned, independent, nonintegrated segments 12 Distributed DBMS architecture 13 Evolution of Distributed DBMS • Distributed Unit of Work • Different statements in a unit of work may refer to different remote sites • All databases in a single SQL statement must be at a single site • Distributed Request • A single SQL statement may refer to tables in more than one remote site • May not support replication transparency or failure transparency 14 15