0% found this document useful (0 votes)

23 views44 pages

Topic 7 - Distributed Database Systems

The document discusses distributed database systems, including their architecture, design issues, advantages, and types. A distributed database system exists where logically related data is physically distributed between processors linked by a network, with a distributed database management system making this transparent to users. Key issues in distributed database design include data and control distribution and ensuring performance, transparency, and standards.

Uploaded by

miragelimited91

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views44 pages

Topic 7 - Distributed Database Systems

Uploaded by

miragelimited91

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 44

DISTRIBUTED DATABASE

SYSTEMS

Distributed Databases 1
Outline
Introduction
Distributed DBMS Architecture
Distributed Database Design Issues
Date’s Twelve Rules for DDBMS1
Parallel Database Systems

Distributed Databases 2
Motivation

Database Computer
Technology Networks
Integration Distribution

Distributed
Database Systems
Integration
Integration // Centralization

Distributed Databases 3
Distributed Computing
Synonymous terms
distributed function
distributed data processing
multiprocessors/multicomputers
satellite processing
backend processing
dedicated/special purpose computers
timeshared systems
functionally modular systems
Distributed Databases 4
What is Distributed….
Processing logic
Functions
Data
Control

Distributed Databases 5
What is Distributed Database
System?
A distributed database system(DDB) exists where
logically related data is physically distributed between
a number of separate processors linked by a
communications network.
A distributed database management system (DDBMS)
is the software that manages the DDB and provides an
access mechanism that makes this distribution
transparent to the users.
Distributed database system (DDBS) = DDB +
DDBMS

Distributed Databases 6
What is not a DDBS?
A timesharing computer system
A loosely or tightly coupled multiprocessor
system
A database system which resides at one of the
nodes of a network of computers - this is a
centralized database on a network node

Distributed Databases 7
Centralized DBMS on a
Network

Site 1 Site 2

Database

Communications
Network

Site 3 Site 4

Distributed Databases 8
Distributed DBMS Environment

Site 1 Site 2

Database2

Database 1

Communications
Network

Site 3 Site 4
Database4

Distributed Databases 9
Implicit Assumptions
Data stored at a number of sites => each site logically
consists of a single processor.
Processors at different sites are interconnected by a
computer network => no multiprocessors -->
parallel database systems
Distributed database is a database, not a collection of
files => data logically related as exhibited in the users’
access patterns --> relational data model
DDBMS is a full-fledged DBMS
--> not remote file system, not a TP system

Distributed Databases 10
Parallel Database Architectures

Distributed Databases 11
Shared Memory Architecture
Processors and disks have access to a
common memory, typically via a bus or
through an interconnection network.

Distributed Databases 12
Shared Disk Architecture
All processors can directly access all disks
via an interconnection network, but the
processors have private memories.
Architecture provides a degree of fault-
tolerance — if a processor fails, the other
processors can take over its tasks since
the database is resident on disks that are
accessible from all processors.

Distributed Databases 13
Shared Nothing Architecture
Node consists of a processor, memory,
and one or more disks. Processors at one
node communicate with another processor
at another node using an interconnection
network. A node functions as the server for
the data on the disk or disks the node
owns.

Distributed Databases 14
Applications of DDB
Manufacturing - especially multi-plant
manufacturing
Military command and control
Banking
Corporate MIS
Airlines
Hotel chains
Any organization which has a decentralized
organization structure

Distributed Databases 15
Advantages of DDBS
Organizational Structure - fits into
organizations distributed over several locations
Shareability and Local Autonomy - Local users
can control their own data while being
accessible ‘globally’
Improved availability – if there is failure at one
site, others are accessible

Distributed Databases 16
Advantages of DDBS
Improved reliability - replication of data
Improved performance - local data is located
where demand for it is likely to be greatest
Transparent management of distributed,
fragmented, and replicated data
Economical - centralized processing power in a
single piece of hardware is not necessarily
cheaper than separate smaller units
Modular growth – simpler to expand

Distributed Databases 17
Disadvantages
Complexity – by hiding their distributed nature
and trying to ensure optimum performance,
reliability and availability, DDBS are more
complex
10
Cost – procurement and maintenance cost
Security – more difficult
Integrity Control more difficult
Lack of Standards
Lack of Experience
Database Design more Complex
Distributed Databases 18
Types of DDBMS

Homogeneous DDBMS
Heterogeneous DDBMS

Distributed Databases 19
Homogeneous DDBMS

All sites use same DBMS product.

Much easier to design and manage.
Approach provides incremental growth
and allows increased performance.

Distributed Databases 20
Heterogeneous DDBMS

Sites may run different DBMS products, with

possibly different underlying data models.
Occurs when sites have implemented their
own databases and integration is considered
later
Translations required to allow for:
Different hardware
Different DBMS products
Different hardware and different DBMS products
Typical solution is to use gateways
Distributed Databases 21
Reference Architecture for
DDBMS
Due to diversity, no accepted architecture
equivalent to ANSI/SPARC 3-level
architecture.
A reference architecture consists of:
Set of global external schemas.
Global conceptual schema (GCS).
Fragmentation schema and allocation schema.
Set of schemas for each local DBMS conforming
to 3-level ANSI/SPARC.
Some levels may be missing, depending on
levels of transparency supported.
Distributed Databases 22
Reference Architecture for
DDBMS

Distributed Databases 23
Key Design Issues
Division and Location of Data
Why fragment at all?
How to fragment?
How much to fragment?
Division and Location of Control
Performance
Transparency to the User
Degree of homogeneity

Distributed Databases 24
Fragmentation
Why fragment?
Usage:
- Apps work with views rather than entire relations.
Efficiency:
- Data stored close to where most frequently used.
- Data not needed by local applications is not stored.
Security:
- and so not available to unauthorized users.
Parallelism:
- With fragments as unit of distribution, T can be divided
into several subqueries that operate on
fragments.
Distributed Databases 25
Horizontal Fragmentation
Projects with Budget less than/greater than or equal to 400,000

Pno pname budget loc

H90 CAD/CAM 200000 Nairobi

S67 Database Dev 600000 H/Q

T67 Maintenance 450000 Kisumu
T90 Networks 300000 Mombasa
S45 School System 100000 Nairobi

Distributed Databases 26
Vertical Fragmentation
Info about project budgets/Info about project names and
locations

Pno pname budget loc

H90 CAD/CAM 200000 Nairobi

S67 Database Dev 600000 H/Q

T67 Maintenance 450000 Kisumu
T90 Networks 300000 Mombasa
S45 School System 100000 Nairobi

Distributed Databases 27
Allocation Alternatives
Non-replicated
partitioned : each fragment resides at only one site
Replicated
fully replicated : each fragment at each site
partially replicated : each fragment at some of the
sites

Distributed Databases 28
Replication Alternatives

Full Partial Partioned

Replication Replication
Query Easy Same Same
Processing Difficulty Difficulty
Directory Easy or Same Same
Management Non-existent Difficulty Difficulty
Concurrency Moderate Difficulty Easy
Control
Reliability Very High High Low

Realty Possible Realistic Possible

Application Application

Distributed Databases 29
Parallelism Requirements
Have as much of the data required by each
application at the site where the application
executes
Full replication
How about updates?
Updates to replicated data requires implementation
of distributed concurrency control and commit
protocols

Distributed Databases 30
System Expansion
Issue is database scaling
Emergence of microprocessor and workstation
technologies
Client-server model of computing
Data communication cost vs telecommunication
cost

Distributed Databases 31
Distributed DBMS Issues
Distributed Database Design
how to distribute the database
replicated & non-replicated database
distribution
related problem in directory management
Query Processing
convert user transactions to data
manipulation instructions
optimization problem

Distributed Databases 32
Distributed DBMS Issues
Concurrency Control
synchronization of concurrent accesses
consistency and isolation of transactions' effects
deadlock management
Reliability
how to make the system resilient to failures
atomicity and durability

Distributed Databases 33
Transparency in a DDBMS
Transparency hides implementation details from users.

Overall objective: equivalence to user of DDBMs to

centralised DBMS. FULL transparency not universally accepted
objective

Four main types:

1. Distribution transparency
2. Transaction transparency
3. Performance transparency
4. DBMS transparency (only applicable to heterogeneous)
Distributed Databases 34
1. Distribution Transparency
Distribution transparency: allows user to perceive database as
single, logical entity.

If DDBMS exhibits distribution transparency, user does not need to know:

• fragmentation transparency: data is fragmented
• Location transparency: location of data items
• otherwise call this local mapping transparency
• replication transparency: user unaware of replication of
fragments

Distributed Databases 35
2. Transaction Transparency
Transaction transparency: Ensures all distributed tx
maintain distributed database’s integrity and
consistency.

• Distributed Tx accesses data stored at more than one

location.
• Each Tx is divided into no. of subTs, one for each site
that has to be accessed.
• DDBMS must ensure the indivisibility of both the global
Tx and each of the subTxs.

Distributed Databases 36
2. Transaction Transparency
Concurrency transparency: All Txs must execute independently and
be logically consistent with results obtained if Txs executed in some
arbitrary serial order.
• Replication makes concurrency more complex
Failure transparency: must ensure atomicity and durability of global
Tx.
• Means ensuring that subTxs of global Tx either all commit or all
abort.
• Classification transparency: In IBM’s Distributed Relational
Database Architecture (DRDA), four types of Txs:
– Remote request
– Remote unit of work
– Distributed unit of work
– Distributed request. Distributed Databases 37
3. Performance Transparency
DDBMS: - no performance degradation due to distributed architecture.
- determine most cost-effective strategy to execute a request.

Distributed Query Processor (DQP) maps data request into ordered

sequence of operations on local databases.
- Must consider fragmentation, replication, and allocation schemas.
DQP has to decide:
1. which fragment to access
2. which copy of a fragment to use
3. which location to use.
- produces execution strategy optimized with respect to some cost
function.
Typically, costs associated with a distributed request include: I/O cost;
CPU cost, communication cost.Distributed Databases 38
Date’s Twelve Rules for DDBMS 1

0 Fundamental Principle
To the user, a distributed system should look exactly like a
non-distributed system
1 Local Autonomy
The sites in a distributed system should be autonomous.
In this context, autonomy means that:
 Local data is locally owned and managed;
 Local operations remain purely local;
 All operations at a given site are controlled by that
site

Distributed Databases 39
Date’s Twelve Rules for DDBMS 2

2 No reliance on a Central Site

There should be no one site without which the system cannot
operate.
This implies that there should be no central servers for services
such as transaction management, deadlock detection, query
optimization, and management of the Global System Catalog
3 Continuous operation
Ideally, there should never be a need for a planned system
shutdown;
for operations such as:
adding or removing a site from the system;
the dynamic creation and deletion of fragments at one or more
sites

Distributed Databases 40
Date’s Twelve Rules for DDBMS 3

4 Location Independence (Transparency)

The user should be able to access the database from
any site. Furthermore, the user should be able to
access all data as if it were stored at the user’s site,
no matter where it is physically stored
5 Fragmentation Independence
The user should be able to access the data, no
matter how it is fragmented.

Distributed Databases 41
Date’s Twelve Rules for DDBMS 4

6 Replication independence
The user should be unaware that data has been
replicated.
Thus, the user should not be able to access a
particular copy of a data item directly, nor should
the user have to specifically update all copies of a
data item
7 Distributed query processing
The system should be capable of processing
queries that reference data at more than one site

Distributed Databases 42
Date’s Twelve Rules for DDBMS 5

8 Distributed Transaction Processing

The system should support the transaction as the unit of
recovery.
The system should ensure that both the global and local
transactions conform to the ACID rules for transactions,
namely: atomicity, consistency, isolation, and durability.
9 Hardware independence
It should be possible to run the DDBMS on a variety of
hardware platforms.
10 Operating system independence
As a corollary to the previous rule, it should be possible to
run the DDBMS on a variety of operating systems

Distributed Databases 43
Date’s Twelve Rules for DDBMS 6

11Network Independence
Again, it should be possible to run the
DDBMS on a variety of disparate
communication networks
12 Database Independence
It should be possible to run different local
DBMSs, perhaps supporting different
underlying data models.
In other words, the system should support
heterogeneity
Distributed Databases 44

Distributed Database Design
88% (8)
Distributed Database Design
85 pages
Distributeddbms Er. Inderjeet Bal
No ratings yet
Distributeddbms Er. Inderjeet Bal
60 pages
Distributed Database Systems Guide
0% (1)
Distributed Database Systems Guide
54 pages
Distributed DBMS Architecture
No ratings yet
Distributed DBMS Architecture
49 pages
Chapter - 6 Distributed Database System
No ratings yet
Chapter - 6 Distributed Database System
50 pages
Distributed Database Design
100% (3)
Distributed Database Design
86 pages
Team:DBMS: by Navdeep Kaur Assistant Professor Computer Science Department
No ratings yet
Team:DBMS: by Navdeep Kaur Assistant Professor Computer Science Department
19 pages
1 Distributed DB
No ratings yet
1 Distributed DB
67 pages
Chapter 4 Distributed Database Systems
No ratings yet
Chapter 4 Distributed Database Systems
69 pages
Basis For Distributed Database Technology
No ratings yet
Basis For Distributed Database Technology
35 pages
Lecture3-Distributed Introduction
No ratings yet
Lecture3-Distributed Introduction
38 pages
Distributed
No ratings yet
Distributed
83 pages
10 Distributeddbms
No ratings yet
10 Distributeddbms
56 pages
Lecture 1 Ho
No ratings yet
Lecture 1 Ho
62 pages
Lecture 1 Ho PDF
No ratings yet
Lecture 1 Ho PDF
62 pages
Distributed DBMS Architecture Guide
No ratings yet
Distributed DBMS Architecture Guide
19 pages
ADBMS
No ratings yet
ADBMS
84 pages
Parallel & Distributed DBMS Guide
No ratings yet
Parallel & Distributed DBMS Guide
58 pages
Advanced Data Base Management Systems
No ratings yet
Advanced Data Base Management Systems
35 pages
Lecture 8 - Distributed Databases
No ratings yet
Lecture 8 - Distributed Databases
4 pages
Distributed Database Design: Basics
No ratings yet
Distributed Database Design: Basics
18 pages
Distribution Database
No ratings yet
Distribution Database
52 pages
Database II: Distributed Databases
No ratings yet
Database II: Distributed Databases
15 pages
Distributed Databases: Indu Saini (Research Scholar) IIT Roorkee Enrollment No.: 10926003
No ratings yet
Distributed Databases: Indu Saini (Research Scholar) IIT Roorkee Enrollment No.: 10926003
14 pages
Distributed Database Management Systems
No ratings yet
Distributed Database Management Systems
123 pages
Distributed Databases
No ratings yet
Distributed Databases
55 pages
Unit 4 Distributed DBMS by ANS
No ratings yet
Unit 4 Distributed DBMS by ANS
12 pages
Distributed Database Systems Guide
No ratings yet
Distributed Database Systems Guide
52 pages
Basis For Distributed Database Technology
No ratings yet
Basis For Distributed Database Technology
35 pages
NoSQL & Distributed Databases Overview
No ratings yet
NoSQL & Distributed Databases Overview
124 pages
Distributed DBMS for IT Professionals
No ratings yet
Distributed DBMS for IT Professionals
46 pages
Distributed Database Systems Guide
No ratings yet
Distributed Database Systems Guide
10 pages
CSE 453 Slide 1
No ratings yet
CSE 453 Slide 1
46 pages
Topic 7 DDBMS
No ratings yet
Topic 7 DDBMS
28 pages
Distributed Database Systems Guide
No ratings yet
Distributed Database Systems Guide
24 pages
Chapter 6 Distributed System Management
No ratings yet
Chapter 6 Distributed System Management
12 pages
Unit 4 - Concept of Distributed DBMS
No ratings yet
Unit 4 - Concept of Distributed DBMS
29 pages
Chapter 7
No ratings yet
Chapter 7
22 pages
Distributed DBMS Architecture Guide
No ratings yet
Distributed DBMS Architecture Guide
40 pages
Distributed Database Fundamentals
No ratings yet
Distributed Database Fundamentals
9 pages
Distributed Database Management
No ratings yet
Distributed Database Management
7 pages
Distributed Database Systems Overview
No ratings yet
Distributed Database Systems Overview
22 pages
Ddbms Unit 1 Part1
No ratings yet
Ddbms Unit 1 Part1
23 pages
Distributed Database System
No ratings yet
Distributed Database System
15 pages
MC4202 - Adavanced Database Technology
No ratings yet
MC4202 - Adavanced Database Technology
159 pages
13-Distributed Databases
No ratings yet
13-Distributed Databases
12 pages
Parallel Databases
No ratings yet
Parallel Databases
23 pages
ADT Notes
No ratings yet
ADT Notes
36 pages
DDS Lecture 2
0% (1)
DDS Lecture 2
38 pages
DDBS Lec1
No ratings yet
DDBS Lec1
20 pages
Distributed Databases
No ratings yet
Distributed Databases
46 pages
Advantages of Distributed Database
No ratings yet
Advantages of Distributed Database
6 pages
CH 4
No ratings yet
CH 4
16 pages
Iii. Current Trends: Distributed Databases and DBMSS: Concepts and Design
No ratings yet
Iii. Current Trends: Distributed Databases and DBMSS: Concepts and Design
32 pages
Ddbms-Unit 1 Part2
No ratings yet
Ddbms-Unit 1 Part2
16 pages
Topic 8 - OO Databases
No ratings yet
Topic 8 - OO Databases
38 pages
Topic 9 - Data Warehousing
No ratings yet
Topic 9 - Data Warehousing
20 pages
Topic 6a - Transactions Management
No ratings yet
Topic 6a - Transactions Management
32 pages
Topic 6b - Database Recovery
No ratings yet
Topic 6b - Database Recovery
10 pages
Using An Arduino As An AVR ISP (In-System Programmer) : Instructions
No ratings yet
Using An Arduino As An AVR ISP (In-System Programmer) : Instructions
3 pages
OS Test 02: Process Synchronization & Deadlocks
No ratings yet
OS Test 02: Process Synchronization & Deadlocks
8 pages
Apple Computer Interface Board Schematic
No ratings yet
Apple Computer Interface Board Schematic
18 pages
DevOps Engineer - Resume
No ratings yet
DevOps Engineer - Resume
1 page
Brocade 6510 Datasheet
No ratings yet
Brocade 6510 Datasheet
7 pages
ZEROSHELL Internet Redundancy and Balancing
No ratings yet
ZEROSHELL Internet Redundancy and Balancing
6 pages
Manual HT001
No ratings yet
Manual HT001
13 pages
Game Crash
No ratings yet
Game Crash
27 pages
Distributed Databases: An Overview: Unit-1
No ratings yet
Distributed Databases: An Overview: Unit-1
42 pages
IT Skill Project
No ratings yet
IT Skill Project
9 pages
Back End Developer Resume
No ratings yet
Back End Developer Resume
1 page
Elementary Programming Principles
100% (1)
Elementary Programming Principles
61 pages
Project Proposal For Database Systems: Arid Agriculture University Rawalpindi
No ratings yet
Project Proposal For Database Systems: Arid Agriculture University Rawalpindi
4 pages
ITE 6102 - Computer Programming 1 - VC - Sept 2 PDF
No ratings yet
ITE 6102 - Computer Programming 1 - VC - Sept 2 PDF
31 pages
SAP Function Module Extraction Guide
100% (1)
SAP Function Module Extraction Guide
13 pages
PHP Conditional Statements
No ratings yet
PHP Conditional Statements
5 pages
Phrasal Verbs About Digital Technology
No ratings yet
Phrasal Verbs About Digital Technology
9 pages
How To Unlock SPEC-B Bootloader LUMIA Devices
No ratings yet
How To Unlock SPEC-B Bootloader LUMIA Devices
16 pages
Provit 5000: User's Manual
No ratings yet
Provit 5000: User's Manual
586 pages
Internship 15days Physics
No ratings yet
Internship 15days Physics
4 pages
IGCSE ICT - Input and Output Devices
No ratings yet
IGCSE ICT - Input and Output Devices
2 pages
Playlist 150
No ratings yet
Playlist 150
24 pages
Beginners Python Cheat Sheet PCC Files Exceptions PDF
No ratings yet
Beginners Python Cheat Sheet PCC Files Exceptions PDF
2 pages
C Operators: Associativity, Precedence & Type Casting
No ratings yet
C Operators: Associativity, Precedence & Type Casting
7 pages
MCA Lateral Question Paper PDF
No ratings yet
MCA Lateral Question Paper PDF
17 pages
Array Implementation of Stack, Queue and Circular Queue Adts
No ratings yet
Array Implementation of Stack, Queue and Circular Queue Adts
12 pages
Medieval Bluetooth File Transfer (OBEX FTP) v1.50 Full - ApniApps
No ratings yet
Medieval Bluetooth File Transfer (OBEX FTP) v1.50 Full - ApniApps
3 pages
Quartus
No ratings yet
Quartus
51 pages
VGA Price
No ratings yet
VGA Price
10 pages
Wintel Engineer Resume - Hire IT People - We Get IT Done
No ratings yet
Wintel Engineer Resume - Hire IT People - We Get IT Done
7 pages

Topic 7 - Distributed Database Systems

Uploaded by

Topic 7 - Distributed Database Systems

Uploaded by

DISTRIBUTED DATABASE

All sites use same DBMS product.

Sites may run different DBMS products, with

Pno pname budget loc

H90 CAD/CAM 200000 Nairobi

S67 Database Dev 600000 H/Q

Pno pname budget loc

H90 CAD/CAM 200000 Nairobi

S67 Database Dev 600000 H/Q

Full Partial Partioned

Realty Possible Realistic Possible

Overall objective: equivalence to user of DDBMs to

Four main types:

If DDBMS exhibits distribution transparency, user does not need to know:

• Distributed Tx accesses data stored at more than one

Distributed Query Processor (DQP) maps data request into ordered

2 No reliance on a Central Site

4 Location Independence (Transparency)

8 Distributed Transaction Processing

You might also like