0% found this document useful (0 votes)

50 views46 pages

Distributed DBMS for IT Professionals

This document provides an overview of distributed database systems, including their history, promises, and key design issues. It discusses topics like distributed database design, query processing, concurrency control, reliability, and replication.

Uploaded by

Mostafa Elrashidy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views46 pages

Distributed DBMS for IT Professionals

Uploaded by

Mostafa Elrashidy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 46

Distributed Database Systems

1
Outline
◼ Introduction
◼ Distributed Database Design
◼ Distributed Data Control
◼ Distributed Query Processing
◼ Distributed Transaction Processing
◼ Data Replication
◼ Database Integration – Multi-database Systems
◼ Web Data Management

2
Outline
◼ Introduction
❑ What is a distributed DBMS
❑ History
❑ Distributed DBMS promises
❑ Design issues
❑ Distributed DBMS architecture

3
Distributed Computing

◼ A number of autonomous processing elements (not

necessarily homogeneous) that are interconnected by a
computer network and that cooperate in performing their
assigned tasks.
◼ What is being distributed?
❑ Processing logic
❑ Function
❑ Data
❑ Control

4
Current Distribution – Geographically
Distributed Data Centers

5
What is a Distributed Database System?

A distributed database is a collection of multiple, logically

interrelated databases distributed over a computer network

A distributed database management system (Distributed

DBMS) is the software that manages the DDB and provides
an access mechanism that makes this distribution
transparent to the users

6
What is not a DDBS?

◼ A timesharing computer system

◼ A loosely or tightly coupled multiprocessor system
◼ A database system which resides at one of the nodes of
a network of computers - this is a centralized database
on a network node

7
Distributed DBMS Environment

8
Implicit Assumptions

◼ Data stored at a number of sites → each site logically

consists of a single processor
◼ Processors at different sites are interconnected by a
computer network → not a multiprocessor system
❑ Parallel database systems
◼ Distributed database is a database, not a collection of
files → data logically related as exhibited in the users’
access patterns
❑ Relational data model
◼ Distributed DBMS is a full-fledged DBMS
❑ Not remote file system, not a TP system

9
Important Point

Logically integrated
but
Physically distributed

10
Outline
◼ Introduction
❑

❑ History
❑

11
History – File Systems

12
History – Database Management

13
History – Early Distribution
Peer-to-Peer (P2P)

14
History – Client/Server

15
History – Data Integration

16
History – Cloud Computing

On-demand, reliable services provided over the Internet in

a cost-efficient manner
◼ Cost savings: no need to maintain dedicated compute
power
◼ Elasticity: better adaptivity to changing workload

17
Data Delivery Alternatives

◼ Delivery modes
❑ Pull-only
❑ Push-only
❑ Hybrid
◼ Frequency
❑ Periodic
❑ Conditional
❑ Ad-hoc or irregular
◼ Communication Methods
❑ Unicast
❑ One-to-many
◼ Note: not all combinations make sense
18
Outline
◼ Introduction
❑

❑ Distributed DBMS promises

❑

19
Distributed DBMS Promises

 Transparent management of distributed, fragmented,

and replicated data

 Improved reliability/availability through distributed

transactions

 Improved performance

 Easier and more economical system expansion

Transparency

◼ Transparency is the separation of the higher-level

semantics of a system from the lower level
implementation issues.
◼ Fundamental issue is to provide
data independence
in the distributed environment
❑ Network (distribution) transparency
❑ Replication transparency
❑ Fragmentation transparency
◼ horizontal fragmentation: selection
◼ vertical fragmentation: projection
◼ hybrid
Example

22
Transparent Access

Tokyo

SELECT ENAME,SAL
FROM EMP,ASG,PAY Boston Paris
WHERE DUR > 12 Paris projects
Paris employees
AND EMP.ENO = ASG.ENO Communication Paris assignments
Network Boston employees
AND PAY.TITLE = EMP.TITLE
Boston projects
Boston employees
Boston assignments
Montreal
New
Montreal projects
York Paris projects
Boston projects New York projects
New York employees with budget > 200000
New York projects Montreal employees
New York assignments Montreal assignments

23
Distributed Database - User View

Distributed Database

24
Distributed DBMS - Reality
User
Query

User
DBMS
Application
Software
DBMS
Software

DBMS Communication
Software Subsystem

User
DBMS User Application
Software Query
DBMS
Software

User
Query

25
Types of Transparency

◼ Data independence
◼ Network transparency (or distribution transparency)
❑ Location transparency
❑ Fragmentation transparency
◼ Fragmentation transparency
◼ Replication transparency

26
Reliability Through Transactions

◼ Replicated components and data should make distributed

DBMS more reliable.
◼ Distributed transactions provide
❑Concurrency transparency
❑ Failure atomicity

• Distributed transaction support requires implementation of

❑ Distributed concurrency control protocols

❑ Commit protocols

◼ Data replication
❑ Great for read-intensive workloads, problematic for updates
❑ Replication protocols

27
Potentially Improved Performance

◼ Proximity of data to its points of use

❑ Requires some support for fragmentation and replication

◼ Parallelism in execution

❑ Inter-query parallelism

❑ Intra-query parallelism

28
Scalability

◼ Issue is database scaling and workload scaling

◼ Adding processing and storage power

◼ Scale-out: add more servers

❑ Scale-up: increase the capacity of one server → has limits

29
Outline
◼ Introduction
❑

❑ Design issues
❑

30
Distributed DBMS Issues

◼ Distributed database design

❑ How to distribute the database
❑ Replicated & non-replicated database distribution
❑ A related problem in directory management

◼ Distributed query processing

❑ Convert user transactions to data manipulation instructions
❑ Optimization problem
◼ min{cost = data transmission + local processing}
❑ General formulation is NP-hard

31
Distributed DBMS Issues

◼ Distributed concurrency control

❑ Synchronization of concurrent accesses
❑ Consistency and isolation of transactions' effects
❑ Deadlock management

◼ Reliability
❑ How to make the system resilient to failures
❑ Atomicity and durability

32
Distributed DBMS Issues

◼ Replication
❑ Mutual consistency
❑ Freshness of copies
❑ Eager vs lazy
❑ Centralized vs distributed
◼ Parallel DBMS
❑ Objectives: high scalability and performance
❑ Not geo-distributed
❑ Cluster computing

33
Related Issues

◼ Alternative distribution approaches

❑ Modern P2P
❑ World Wide Web (WWW or Web)
◼ Big data processing
❑ 4V: volume, variety, velocity, veracity
❑ MapReduce & Spark
❑ Stream data
❑ Graph analytics
❑ NoSQL
❑ NewSQL
❑ Polystores

34
Outline
◼ Introduction
❑

❑ Distributed DBMS architecture

35
DBMS Implementation Alternatives

36
Dimensions of the Problem

◼ Distribution
❑ Whether the components of the system are located on the same machine or
not
◼ Heterogeneity
❑ Various levels (hardware, communications, operating system)
❑ DBMS important one
◼ data model, query language,transaction management algorithms
◼ Autonomy
❑ Not well understood and most troublesome
❑ Various versions
◼ Design autonomy: Ability of a component DBMS to decide on issues related to its
own design.
◼ Communication autonomy: Ability of a component DBMS to decide whether and
how to communicate with other DBMSs.
◼ Execution autonomy: Ability of a component DBMS to execute local operations in
any manner it wants to.

37
Client/Server Architecture

38
Advantages of Client-Server
Architectures
◼ More efficient division of labor
◼ Horizontal and vertical scaling of resources
◼ Better price/performance on client machines
◼ Ability to use familiar tools on client machines
◼ Client access to remote data (via standards)
◼ Full DBMS functionality provided to client workstations
◼ Overall better system price/performance

39
Database Server

40
Distributed Database Servers

41
Peer-to-Peer Component Architecture

42
MDBS Components & Execution

43
Mediator/Wrapper Architecture

44
Cloud Computing

On-demand, reliable services provided over the Internet in

a cost-efficient manner
◼ IaaS – Infrastructure-as-a-Service

◼ PaaS – Platform-as-a-Service

◼ SaaS – Software-as-a-Service

◼ DaaS – Database-as-a-Service

45
Simplified Cloud Architecture

CSE 453 Slide 1
No ratings yet
CSE 453 Slide 1
46 pages
Distributed DBMS Architecture Guide
No ratings yet
Distributed DBMS Architecture Guide
40 pages
Distributed Database Systems Guide
0% (1)
Distributed Database Systems Guide
54 pages
Distributed Database Management Systems
No ratings yet
Distributed Database Management Systems
23 pages
Distributed DBMS for CS Students
No ratings yet
Distributed DBMS for CS Students
67 pages
DDS Lecture 2
0% (1)
DDS Lecture 2
38 pages
Lec1 30 9 16
No ratings yet
Lec1 30 9 16
32 pages
Unit 4 - Concept of Distributed DBMS
No ratings yet
Unit 4 - Concept of Distributed DBMS
29 pages
Chapter - 6 Distributed Database System
No ratings yet
Chapter - 6 Distributed Database System
50 pages
Distributed Database Systems Guide
No ratings yet
Distributed Database Systems Guide
24 pages
Basis For Distributed Database Technology
No ratings yet
Basis For Distributed Database Technology
35 pages
RBD Lectures Merged
No ratings yet
RBD Lectures Merged
367 pages
Distributed Database Management Systems
No ratings yet
Distributed Database Management Systems
73 pages
Distributed Database Systems Overview
No ratings yet
Distributed Database Systems Overview
22 pages
1 Introduction
No ratings yet
1 Introduction
46 pages
Distributed Database Systems Guide
No ratings yet
Distributed Database Systems Guide
52 pages
Distributed Database Systems (DDBS)
No ratings yet
Distributed Database Systems (DDBS)
30 pages
Distributed Databases
No ratings yet
Distributed Databases
55 pages
Unit - 1 DDB
No ratings yet
Unit - 1 DDB
34 pages
Basis For Distributed Database Technology
No ratings yet
Basis For Distributed Database Technology
35 pages
Lecture 1 Ho PDF
No ratings yet
Lecture 1 Ho PDF
62 pages
Lecture 1 Ho
No ratings yet
Lecture 1 Ho
62 pages
Advanced Data Base Management Systems
No ratings yet
Advanced Data Base Management Systems
35 pages
Distributed Database: Source
No ratings yet
Distributed Database: Source
19 pages
Topic 7 - Distributed Database Systems
No ratings yet
Topic 7 - Distributed Database Systems
44 pages
Distributed Database System
No ratings yet
Distributed Database System
15 pages
DDBS BCS 2 Distributed Database Notes
No ratings yet
DDBS BCS 2 Distributed Database Notes
16 pages
Parallel & Distributed DBMS Guide
No ratings yet
Parallel & Distributed DBMS Guide
58 pages
Distributed DBMS Explained
No ratings yet
Distributed DBMS Explained
12 pages
Chapter-7 Distributed Database Systems
No ratings yet
Chapter-7 Distributed Database Systems
40 pages
1 Distributed DB
No ratings yet
1 Distributed DB
67 pages
Distributed Database MID Notes
No ratings yet
Distributed Database MID Notes
19 pages
ADBMS
No ratings yet
ADBMS
84 pages
10-Distributed Databases Lecturer 3 Best
No ratings yet
10-Distributed Databases Lecturer 3 Best
55 pages
Distributeddbms Er. Inderjeet Bal
No ratings yet
Distributeddbms Er. Inderjeet Bal
60 pages
Chapter - 7 Distributed Database System
No ratings yet
Chapter - 7 Distributed Database System
58 pages
Distributed Database Management Systems
No ratings yet
Distributed Database Management Systems
63 pages
1-Introduction TO Principles of Distributed Database Systems
No ratings yet
1-Introduction TO Principles of Distributed Database Systems
46 pages
Topic 7 DDBMS
No ratings yet
Topic 7 DDBMS
28 pages
1 Introduction
No ratings yet
1 Introduction
46 pages
13-Distributed Databases
No ratings yet
13-Distributed Databases
12 pages
Distributed Database Management Systems
No ratings yet
Distributed Database Management Systems
123 pages
Chapter 4 Distributed Database Systems
No ratings yet
Chapter 4 Distributed Database Systems
69 pages
Distributed DBMS Architecture
No ratings yet
Distributed DBMS Architecture
49 pages
DDB Unit 1-5
No ratings yet
DDB Unit 1-5
190 pages
Internal 050203
No ratings yet
Internal 050203
74 pages
Chapter 6 Distributed System Management
No ratings yet
Chapter 6 Distributed System Management
12 pages
Lecture 1 DDS Continue. (Downloaded With 1stbrowser)
No ratings yet
Lecture 1 DDS Continue. (Downloaded With 1stbrowser)
32 pages
Ddbms-Unit 1 Part2
No ratings yet
Ddbms-Unit 1 Part2
16 pages
Distributed Database Systems Guide
100% (1)
Distributed Database Systems Guide
54 pages
Lecture3-Distributed Introduction
No ratings yet
Lecture3-Distributed Introduction
38 pages
Adbms Chapter 7 Ddbms
No ratings yet
Adbms Chapter 7 Ddbms
73 pages
Outline: What Is A Distributed DBMS Distributed DBMS Architecture
No ratings yet
Outline: What Is A Distributed DBMS Distributed DBMS Architecture
40 pages
Distributed Database Chapter 1 Modified
No ratings yet
Distributed Database Chapter 1 Modified
47 pages
Types of Distributed Data Base System - 49724
No ratings yet
Types of Distributed Data Base System - 49724
37 pages
All Merged
No ratings yet
All Merged
513 pages
Tutorial 3 - Phase Controlled AC-DC Converters
No ratings yet
Tutorial 3 - Phase Controlled AC-DC Converters
3 pages
Access Control (5G RAN6.1 - Draft A)
No ratings yet
Access Control (5G RAN6.1 - Draft A)
28 pages
ICT Technician Job Description
No ratings yet
ICT Technician Job Description
3 pages
LCD Troubleshooting Guide PDF
No ratings yet
LCD Troubleshooting Guide PDF
35 pages
Chapter 1 Intro To CP
No ratings yet
Chapter 1 Intro To CP
37 pages
Practical No 05
No ratings yet
Practical No 05
4 pages
U-V200-18-E4xb Spec
No ratings yet
U-V200-18-E4xb Spec
16 pages
Power Optimization in Design Compiler
No ratings yet
Power Optimization in Design Compiler
3 pages
Intro to Little Man Computer
No ratings yet
Intro to Little Man Computer
40 pages
XR 1458
No ratings yet
XR 1458
2 pages
Et200sp Im 155 6 PN ST Manual en-US en-US
No ratings yet
Et200sp Im 155 6 PN ST Manual en-US en-US
47 pages
BSDA Amera PDX Peplink Presentation
No ratings yet
BSDA Amera PDX Peplink Presentation
24 pages
Intoduction: Machine Input Output Human History Astrolabe Abacus
No ratings yet
Intoduction: Machine Input Output Human History Astrolabe Abacus
4 pages
SCADA Solutions Brochure 2010
No ratings yet
SCADA Solutions Brochure 2010
12 pages
Dr. Rajesh Raut Nagpur 17.02 Yrs
No ratings yet
Dr. Rajesh Raut Nagpur 17.02 Yrs
6 pages
Compact Performance CP Fieldbus Node 13: Programming and Diagnosis
No ratings yet
Compact Performance CP Fieldbus Node 13: Programming and Diagnosis
103 pages
VX Works and Interrupt Service Routines
100% (1)
VX Works and Interrupt Service Routines
22 pages
P2041/P2040 Qoriq Integrated Processor Design Checklist: About This Document
No ratings yet
P2041/P2040 Qoriq Integrated Processor Design Checklist: About This Document
47 pages
Peta Okupasi Bidang TIK
No ratings yet
Peta Okupasi Bidang TIK
1 page
Ubuntu Lab
No ratings yet
Ubuntu Lab
2 pages
Radiant Manual
No ratings yet
Radiant Manual
185 pages
Healy World Manual HealAdvisor Analyse App en EU US
100% (1)
Healy World Manual HealAdvisor Analyse App en EU US
31 pages
Installing Cloudera VM
No ratings yet
Installing Cloudera VM
7 pages
Fiio FH3 EQ Settings and Analysis
No ratings yet
Fiio FH3 EQ Settings and Analysis
1 page
Startlaz 7
No ratings yet
Startlaz 7
20 pages
AGC-2, DRH Vers. 1.52.4 and Earlier, 4189340-258 UK PDF
No ratings yet
AGC-2, DRH Vers. 1.52.4 and Earlier, 4189340-258 UK PDF
121 pages
Drs Sigint Application Note
No ratings yet
Drs Sigint Application Note
7 pages
SRWE Module 8
100% (1)
SRWE Module 8
37 pages
Workday HCM Techno-Functional Course Content
No ratings yet
Workday HCM Techno-Functional Course Content
3 pages
Ethical Hacking and Prevention: Course Contents
No ratings yet
Ethical Hacking and Prevention: Course Contents
4 pages

Distributed DBMS for IT Professionals

Uploaded by

Distributed DBMS for IT Professionals

Uploaded by

Distributed Database Systems

◼ A number of autonomous processing elements (not

A distributed database is a collection of multiple, logically

A distributed database management system (Distributed

◼ A timesharing computer system

◼ Data stored at a number of sites → each site logically

On-demand, reliable services provided over the Internet in

❑ Distributed DBMS promises

 Transparent management of distributed, fragmented,

 Improved reliability/availability through distributed

 Easier and more economical system expansion

◼ Transparency is the separation of the higher-level

◼ Replicated components and data should make distributed

• Distributed transaction support requires implementation of

◼ Proximity of data to its points of use

❑ Requires some support for fragmentation and replication

◼ Issue is database scaling and workload scaling

◼ Adding processing and storage power

◼ Scale-out: add more servers

❑ Scale-up: increase the capacity of one server → has limits

◼ Distributed database design

◼ Distributed query processing

◼ Distributed concurrency control

◼ Alternative distribution approaches

❑ Distributed DBMS architecture

On-demand, reliable services provided over the Internet in

You might also like