0% found this document useful (0 votes)

7 views16 pages

ADBMS Assignment 2

The document is an assignment on Advanced Database Management Systems, focusing on a case study of RetailMart's Sales Data Warehouse. It covers OLAP operations, star and snowflake schemas, and includes definitions, diagrams, and application scenarios for data analysis. The assignment also compares the two schemas in terms of performance, redundancy, and complexity.

Uploaded by

eshayyy321

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views16 pages

ADBMS Assignment 2

Uploaded by

eshayyy321

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Advanced Database Management Systems

Name Zainab Ehsan

Roll Number 347

Semester 4

Section I

Date 15-June-2025

Assignment Number 2

Submitted To Ma’am Farwa Javed

Lahore Garrison University

Department of Computer Science

1
Question 1

Case Study: RetailMart Sales Data Warehouse

RetailMart, a national retail chain, maintains a Sales Data Warehouse to analyze product
performance across its various stores. The data is organized in an OLAP cube with the
following dimensions:
• Time: Year → Quarter → Month → Day
• Region: Country → State → City
• Product: Category → Subcategory → Product Name
• Measure: Total Sales, Quantity Sold
RetailMart’s management uses this data to make strategic decisions such as promotional
planning, stock replenishment, and store performance evaluation.

Making an OLAP Cube

Time

Region
Product

2
Scenario 1:

A manager wants to compare Total Sales for each Product Category in Q1 2025.

Q1. What OLAP operation is this?

A. Identifying OLAP Operations

This is a Slice & Dice (though only we’re slicing data) and Roll-Up operation.

• Slice & Dice: As we’re Filtering/Selecting data for a specific time (Q1 2025).
• Roll-Up: Aggregating data up to the Product Category level from the Product
dimension.

Q2. What dimensions are involved?

A. Dimensions Involved

• Time Dimension – Q1 2025.

• Product – Category Comparison
• Measure – Total Sales.

Q3. What part of the cube is being selected?

A. Cube Selection

The following are the parts of the cube being selected:

• Time (Q1 2025)

• Product (Category)
• Region (All)
• Measure (Total Sales)

3
Scenario 2:

A regional analyst wants to break down the sales in California into more detail, viewing by
city level instead of state.

Q1. What OLAP operation is being applied?

A. Drill-down OLAP operation

As we’re selecting “City” instead of “State”.

Q2. From which level to which level is the transition happening?

A. Transition

From State → City in the Region dimension.

4
Scenario 3:

The head office wants to summarize total sales across all U.S. states into a single value by
year.

Q1. What OLAP operation does this represent?

A. Roll-Up OLAP Operation

As we’re selecting “Total Sales” into a single value by “Year” and from “State” level to
“Country” level.

Q2. Which dimension is being rolled up?

A. Region Dimension

The Region dimension from State level → to Country level (United States) is being rolled up.
Additionally, the Time dimension is being viewed at the Year level.

5
Scenario 4:

A category manager wants to view product performance by rotating the OLAP cube to see
regions as columns and product categories as rows.

Q1. Which OLAP operation is applied here?

A. Pivot OLAP Operation

As we’re rotating the cube making regions as columns and product categories as rows.
Product Categories

Regions

Region Product Categories

Before Rotation After Rotation

Q2. What is the benefit of this operation?

A. Benefit of Pivot OLAP Operation

• It reorganizes the view to make analysis easier to interpret.

• Makes comparison between dimensions more visual and accessible.
• Help reveal patterns by changing perspective.

6
Scenario 5:

A store supervisor filters the data cube to focus only on "Electronics" sold in "New York"
during December 2024.

Q1. What OLAP operation(s) are involved?

A. Slice and Dice OLAP Operations

As we’re using more than one filter such as filtering the data by "Electronics" to a specific
country i.e. "New York" to being specific about time i.e. December 2024.

Q2. Is this a slice, dice, or both? Justify

A. Both OLAP Operation

• Slice: Fixing one value per dimension — e.g., Time (December 2024).
• Dice: Applying multiple filters across dimensions — Product (Electronics), Region
(New York).

Justification: Because more than one dimension is being filtered at specific values/ranges.

7
Question 2

Star Schema & Snowflake Schema

Part A: Conceptual Understanding

Q1. Define the following terms:

a) Fact Table

A. Basically, a Fact Table is the central table in a data warehouse schema that contains
quantitative data (measures) such as sales, revenue, stock or quantity. It includes foreign keys
about dimension tables.

b) Dimension Table

A. A Dimension Table contains descriptive, textual, or categorical information related to

dimensions of a fact contained by fact table (like product, time, region, colors). It helps slice
and dice the data in different ways.

c) Star Schema

A. A Star Schema is shaped like a star. It organizes data into a central fact table surrounded by
dimension tables, hence resembling a star shape. It helps simplify queries, improve
performance, enhance readability and handle large volumes of data.

Figure 1: Star Schema Example

8
d) Snowflake Schema

A. A Snowflake Schema is a normalized version of the star schema, where dimension tables
are normalized into multiple related tables, resembling a snowflake's branching structure. It
has a Fact Table, Dimension Table, Normalized Dimensions, Hierarchical Structure. It helps
reduce data redundancy, improve data integrity, allows for more details analysis by drilling
down into different levels of dimensions. Though it does require more complex joins to retrieve
data from multiple dimension tables and while storage is efficient, the joins needed for querying
can sometimes slow down performance.

Figure 2: Snowflake Schema Example

e) Normalization in the context of data warehousing

A. Normalization in the context of data warehousing means organizing data to reduce

redundancy and improve consistency by dividing tables into related sub-tables and linking
them via keys.

9
Part B: Diagram-Based Analysis

Refer to the diagrams given above. Answer the following questions:

Q2. Based on Diagram 1 (Snowflake Schema - Vehicle Sales):

a) Identify the fact table and write its name and attributes.

A. Sales Fact Table & Its Attributes

Fact Table: Sales Fact Table

Attributes:

• Time_key (FK)
• Item_key (FK)
• Branch_key (FK)
• Location_key (FK)
• Dollars_sold (Measure)
• Units_sold (Measure)

b) Name any 3 dimension tables and explain the type of data they store.

A. 3 dimension tables

1. Time Dimension Table

10
Stores: Temporal data to analyze sales over time.

Attributes: Time_key, Day, Month, Quarter, Year

2. Item Dimension Table

Stores: Product-level descriptive data.

Attributes: Item_key, Item_name, Brand, Type, Supplier_key

3. Branch Dimension Table

Stores: Information about store branches.

Attributes: Branch_key, Branch_name, Branch_type

c) Explain how normalization is applied between the Dealer, Location, and Country tables.

A. Normalization between Tables

• Instead of storing the full city, state, and country data repeatedly in the Location table,
it's stored in a separate City table.
• The Location table just keeps a reference via City_key (foreign key).
• This is a clear application of 3rd Normal Form (3NF), reducing redundancy and
ensuring data consistency.

11
Q3. Based on Diagram 2 (Sales Star/Snowflake Schema - Retail Sales):

a) Is this schema a star or snowflake schema? Justify your answer.

A. Snowflake Schema

This is a Snowflake Schema as it shows normalized dimension tables.

• Dealer table references Location_ID and Country_ID instead of storing full location
and country names.
• Product table references Variant_ID from a separate Variant table.

There are multiple levels of hierarchy in dimension tables, hence, making it a snowflake
schema.

b) Describe the relationship between the Location Dimension Table and Country
Dimension Table.

A. Relationship between the Location Table and Country Table

Dealer Table links to both Location_ID and Country_ID.

But there is no direct relationship between them.

It reflects a hierarchical normalization, Country holds broader geographical info, Location

specifies the region, Dealer acts as the bridge linking both.

12
c) How does the schema help in analyzing supplier-wise and city-wise sales?

A. Supplier-Wise Sales and City-Wise Sales

Supplier-Wise Sales

Using the Revenue fact table, Dealer dimension via Dealer_ID can be reached.

Dealer connects to Country via Country_ID, which leads to Country_Name.

This enables analysis like:

• Total sales per country

• Year-over-year revenue by country
• Comparing regional performance across countries

City-Wise Sales

Supplier table doesn’t exist

13
Part C: Application and Comparison

Q4. Application Scenario:

You are a data analyst for a chain of retail stores. Your manager wants to understand how
sales vary by time, product, supplier, and branch.

a) Which schema (star or snowflake) would you recommend for faster reporting and why?

A. Recommendations for a Schema

I would recommend Star Schema for faster reporting as faster query performance meaning
less joins required. Moreover, all data is denormalized.

b) Which schema is more suitable for minimizing redundancy and ensuring consistency?

A. More Suitable Schema

I would recommend Snowflake Schema for minimizing redundancy and ensuring

consistency as repetitive data is stored only once and reduces costs. Moreover, dimensions
are normalized.

14
Q5. Comparison Table:

Fill in the blanks in the following comparison table:

Feature Star Schema Snowflake Schema

Table Normalization Denormalized Normalized

Query Performance Faster Slower
Storage Requirement More Less
Maintenance Complexity Less More

15
Q6. Star Schema

Draw a simple star schema for a “Library Management System” including:

Fact table: Book Issues

Dimension tables: Book, Member, Time, Librarian

A. Star Schema for Library Management System

Book
Member
Fact Table Dimension Table
Dimension Table

Book Issues

Librarian Time

Dimension Table Dimension Table

Unit 2 DWM
No ratings yet
Unit 2 DWM
16 pages
Data Warehouse Schemas & OLAP
No ratings yet
Data Warehouse Schemas & OLAP
12 pages
DWM CHP2 QB Solution
No ratings yet
DWM CHP2 QB Solution
9 pages
Data Warehouse Schema Explained
No ratings yet
Data Warehouse Schema Explained
6 pages
BA
No ratings yet
BA
6 pages
DWDM Notes
No ratings yet
DWDM Notes
19 pages
Datawarehouse Operations
No ratings yet
Datawarehouse Operations
18 pages
Unit I DMT
No ratings yet
Unit I DMT
74 pages
Unit 3 OLAP and OLTP
No ratings yet
Unit 3 OLAP and OLTP
64 pages
De Lab Programs
No ratings yet
De Lab Programs
32 pages
2.data Warehouse and OLAP
No ratings yet
2.data Warehouse and OLAP
14 pages
Star Schema and OLAP Techniques
100% (3)
Star Schema and OLAP Techniques
45 pages
DMDW Mid 1 Solution
No ratings yet
DMDW Mid 1 Solution
29 pages
Unit 2 Notes DWM
No ratings yet
Unit 2 Notes DWM
14 pages
Star Schema and Data Warehousing Guide
No ratings yet
Star Schema and Data Warehousing Guide
15 pages
Data Warehousing and Data Mining Dec 2023
No ratings yet
Data Warehousing and Data Mining Dec 2023
28 pages
Session-9 Final Notes PRM 45
No ratings yet
Session-9 Final Notes PRM 45
4 pages
Introduction To Datawarehousing: Duration: 45 Minutes (Approx.) Abhishek Ranjan
No ratings yet
Introduction To Datawarehousing: Duration: 45 Minutes (Approx.) Abhishek Ranjan
32 pages
What Is A Data Warehouse?
No ratings yet
What Is A Data Warehouse?
47 pages
Multidimensional Data Modeling Guide
No ratings yet
Multidimensional Data Modeling Guide
29 pages
Data Warehousing Lecture Notes
No ratings yet
Data Warehousing Lecture Notes
30 pages
Data Warehousing: Data Models and OLAP Operations: by Kishore Jaladi
No ratings yet
Data Warehousing: Data Models and OLAP Operations: by Kishore Jaladi
41 pages
OLAP Operations With Queries
No ratings yet
OLAP Operations With Queries
8 pages
R20-DMT Unit-I
No ratings yet
R20-DMT Unit-I
24 pages
Unit-1 Lecture Notes
100% (1)
Unit-1 Lecture Notes
43 pages
Unit-2 1
No ratings yet
Unit-2 1
60 pages
Data Warehouse
No ratings yet
Data Warehouse
71 pages
1
No ratings yet
1
35 pages
Operational Data Stores Data Warehouse: 8) What Is Ods Vs Datawarehouse?
No ratings yet
Operational Data Stores Data Warehouse: 8) What Is Ods Vs Datawarehouse?
15 pages
Batch B DWM Experiments
No ratings yet
Batch B DWM Experiments
90 pages
ACM - IntrotoDW-data Warehousing
No ratings yet
ACM - IntrotoDW-data Warehousing
58 pages
Unit 2 DATA WAREHOUSE AND DATA MART
No ratings yet
Unit 2 DATA WAREHOUSE AND DATA MART
17 pages
DWDM 2
No ratings yet
DWDM 2
16 pages
Assignment 4-1
100% (2)
Assignment 4-1
27 pages
DWM Chp2 Notes
No ratings yet
DWM Chp2 Notes
21 pages
Unit 2
No ratings yet
Unit 2
32 pages
Understanding Multi Dimensional Database: Prepared By: Amit Sharma Hyperion/OBIEE Trainer
No ratings yet
Understanding Multi Dimensional Database: Prepared By: Amit Sharma Hyperion/OBIEE Trainer
34 pages
OLAP Operations for Data Analysts
No ratings yet
OLAP Operations for Data Analysts
3 pages
4 - Dimensional Modeling
No ratings yet
4 - Dimensional Modeling
71 pages
Assignment - 2 DWH
No ratings yet
Assignment - 2 DWH
13 pages
What Is Data Warehouse?: Data Mining by IK Unit 2
No ratings yet
What Is Data Warehouse?: Data Mining by IK Unit 2
21 pages
Introduction To DataWarehouse and DataMining
No ratings yet
Introduction To DataWarehouse and DataMining
35 pages
DWDM Set-2
No ratings yet
DWDM Set-2
55 pages
Name: Reena Kale Te Comps Roll No: 23 DWM Experiment No: 1 Title: Designing A Data Warehouse Schema For A Case Study and Performing
No ratings yet
Name: Reena Kale Te Comps Roll No: 23 DWM Experiment No: 1 Title: Designing A Data Warehouse Schema For A Case Study and Performing
7 pages
DWDM Mid 1
No ratings yet
DWDM Mid 1
10 pages
Experiment2 E059 DWM PDF
No ratings yet
Experiment2 E059 DWM PDF
10 pages
Experiment No.02: LAB Manual Part A
No ratings yet
Experiment No.02: LAB Manual Part A
10 pages
DW Concepts
No ratings yet
DW Concepts
7 pages
Data Warehouse Lec-3
No ratings yet
Data Warehouse Lec-3
38 pages
Correct DW
No ratings yet
Correct DW
9 pages
Data Warehouse Basics for Analysts
0% (1)
Data Warehouse Basics for Analysts
14 pages
Data Warehouse Models and OLAP Operations: Enrico Franconi
No ratings yet
Data Warehouse Models and OLAP Operations: Enrico Franconi
45 pages
dw4 - Dimension1
No ratings yet
dw4 - Dimension1
75 pages
Guide To Writing An Essay in
No ratings yet
Guide To Writing An Essay in
7 pages
Week 14 Lecture 25,26
No ratings yet
Week 14 Lecture 25,26
19 pages
Lec07-B-tree in DBMS
No ratings yet
Lec07-B-tree in DBMS
25 pages
Chapter 10 Controlling
No ratings yet
Chapter 10 Controlling
13 pages
Le0-Star - Snowflake - NoSQL Database
No ratings yet
Le0-Star - Snowflake - NoSQL Database
19 pages
Organizational Design and Structure
No ratings yet
Organizational Design and Structure
18 pages
Quiz 02-B - Solution
No ratings yet
Quiz 02-B - Solution
2 pages
Answer
No ratings yet
Answer
10 pages
Filled NC Audit Report Template 45001 Ohsms Team B
No ratings yet
Filled NC Audit Report Template 45001 Ohsms Team B
5 pages
Recent Aspects of Oil and Gas Internal Pipeline Corrosion Control
No ratings yet
Recent Aspects of Oil and Gas Internal Pipeline Corrosion Control
25 pages
PDF
No ratings yet
PDF
1 page
Hook Farah Neuroscience For Educators
No ratings yet
Hook Farah Neuroscience For Educators
11 pages
Bistos BT-300 Fetal Monitor - Service Manual PDF
100% (1)
Bistos BT-300 Fetal Monitor - Service Manual PDF
37 pages
CeaseFire - Clean Agent brochure-STANDALONE
No ratings yet
CeaseFire - Clean Agent brochure-STANDALONE
1 page
Accounting Concepts & Standards Guide
No ratings yet
Accounting Concepts & Standards Guide
3 pages
Forains (1945) About A: George Balanchine
No ratings yet
Forains (1945) About A: George Balanchine
1 page
Bomag BW213DH-3 Operators and Maintenance Manual
100% (2)
Bomag BW213DH-3 Operators and Maintenance Manual
148 pages
Volvo KAD Etc Operators
100% (1)
Volvo KAD Etc Operators
104 pages
Chapter 5 - MCQ
No ratings yet
Chapter 5 - MCQ
5 pages
France Waste Management Targets 2025
No ratings yet
France Waste Management Targets 2025
57 pages
Carbohydrates
No ratings yet
Carbohydrates
23 pages
Prod Ec 1896508402
0% (1)
Prod Ec 1896508402
98 pages
Massachusetts Department of Public Health (MDPH) Postpartum (PPD) Depression Screening Tool Grid - 2015
No ratings yet
Massachusetts Department of Public Health (MDPH) Postpartum (PPD) Depression Screening Tool Grid - 2015
3 pages
Understanding Liquidity in Trading
100% (1)
Understanding Liquidity in Trading
8 pages
Dhaval Shah: 4.1 Years of IT Experience in Software Development With Knowledge in Different Phases of
No ratings yet
Dhaval Shah: 4.1 Years of IT Experience in Software Development With Knowledge in Different Phases of
4 pages
EHR Incentive Programs Overview
No ratings yet
EHR Incentive Programs Overview
37 pages
Vision CP12170 Spec
No ratings yet
Vision CP12170 Spec
2 pages
2022 Manual
No ratings yet
2022 Manual
16 pages
Appeal Letter
No ratings yet
Appeal Letter
1 page
NeurIPS ML4PS 2019 39
No ratings yet
NeurIPS ML4PS 2019 39
6 pages
SEEL-Teachers Training
No ratings yet
SEEL-Teachers Training
11 pages
Cambridge IGCSE™: German 0525/21
No ratings yet
Cambridge IGCSE™: German 0525/21
12 pages
Orthopedic Manual Therapy (2nd Edition) 2nd Edition Instant Download 2025
No ratings yet
Orthopedic Manual Therapy (2nd Edition) 2nd Edition Instant Download 2025
84 pages
River Management Strategies
No ratings yet
River Management Strategies
2 pages
Boiler Manual
No ratings yet
Boiler Manual
203 pages
Etabs Project Report
No ratings yet
Etabs Project Report
40 pages
XI-IIT - State Wide - Weekend Results - 09.03.2025
No ratings yet
XI-IIT - State Wide - Weekend Results - 09.03.2025
13 pages