Task Explanation202

Task Explanation

Uploaded by

Mark Logmao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

77 views3 pages

Task Explanation202

Task Explanation

Uploaded by

Mark Logmao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

PROBLEMS:

I. Shipments of Household Appliances: Line Graphs.

The file ApplianceShipments.xls contains the series of quarterly shipments (in million $) of U.S. household
appliances between 1985 and 1989 (data courtesy of Ken Black).
a. Create a well-formatted time plot of the data using Excel.

Script:
“This is a well-formatted plot of the data using Excel. Sa loob ng Excel file, may dataset
between 1985 to 1989 na nagcocontain ng series of quarterly shipments in million dollars of US
household appliance. From the first quarter of 1985, it has a value of 4,009 which increased to
4,595 for the year 1988 but then decreased to 4,245 by the year 1989. The second quarter started
off from 4,321 which increased up to 4,806 after 2 years and decreased by 7 data points then
increased by another 11 for the next 2 years. The third quarter started from 4,224 by the year 1985
then increased 4,657 for the year 1986 but decreased for the following next 2 years and increased
for the last year. For the fourth quarter, it started from 3,944 then increased to 4,485, decreased to
4,258 then increased again to 4,533.”

b. Does there appear to be a quarterly pattern? For a closer view of the patterns, zoom in to the
range of 3500–5000 on the y axis.

Script:
“If we look at the plot closer from the range of 3,500 – 5,000, we can see the number of
shipments for each quarter over time and kada quarter, nag increase yung number. Tignan nalang
from Quarter 1, then Quarter 2, pero pagdating ng Quarter 3, pag increase after a year,
nagdecrease siya for the next 2 years sabay taas ulit for the last year. Sa Quarter 4 naman, nag
increade from 3,944 to 4,485 then bumaba ng 4,258 saka tumaas ulit ng 4,533.”

c. Create four separate lines for Q1, Q2, Q3, and Q4, using Excel. In each, plot a line graph. In
Excel, order the data by Q1, Q2, Q3, Q4 (alphabetical sorting will work), and plot them as separate
series on the line graph. Zoom in to the range of 3500–5000 on the y axis. Does there appear to be
a difference between quarters?

Script:
“Kada quarter, iba’t iba yung fluctuations niya kada year depending sa number of
shipments. Dahil nakaseparate yung lines, makikita yung differences ng pag-iiba iba ng number of
shipments every quarter kaya maeeasily determine kung ano rin yung form of pattern nila
depending sa year.”

d. Using Excel, create a line graph of the series at a yearly aggregated level (i.e., the total
shipments in each year).

Script:
“From 1985, we have a total shipments of 16,498 which is nagcontinue to increase for the
next two years up to 1987, having 18,335. Pagdating ng 1988, bumaba yung value to 18,069 then
nag increase ulit for the following last year na 18,263.”
e. Re-create the above plots using a different forms of visualization. Make sure to enter the quarter
information in a format that is recognized by the software as a date.

Script:
“Makikita natin yung difference of use ng Time Plot in terms of Scattered Plot and
Clustered Column. Ang difference nila in terms of purpose ay yung left side mas shinoshow niya
yung relationship between two variables over time while yung right naman ay nagcocompare ng
multiple categories of data over time. Then yung representation ng data sa left ay represented by a
dot while sa right ay nakaseparate column. Ginagamit yung scatter plot para ma-identify yung
trends and patterns in the relationship between two variables while yung clustered column naman
is para macompare yung performance ng different categories of data over time.”
f. Compare the two processes of generating the line graphs in terms of the effort as well as the
quality of the resulting plots. What are the advantages of each?

A well-formatted time plot is a line graph that shows data over time. The x-axis of the
graph represents time, and the y-axis represents the data values. A separate series on the line
graph is a line that represents a different set of data values. There are a number of advantages
to using a separate series on a line graph. First, it allows you to compare two or more different
data sets on the same graph. This can be helpful for identifying trends and relationships
between the data sets. Second, using a separate series can make it easier to see the details of
each data set. If you were to plot all of the data sets on the same line, it would be difficult to
see the individual trends and relationships. Finally, using a separate series can make your
graph more visually appealing. By using different colors and line styles for each series, you can
make it easier for viewers to distinguish between the different data sets.

Script:
*ayan na mismo yung explanation hehe ^^*

II. Sales of Riding Mowers: Scatterplots. A company that manufactures riding mowers wants to identify the best
sales prospects for an intensive sales campaign. In particular, the manufacturer is interested in classifying
households as prospective owners or non-owners on the basis of Income (in $1000s) and Lot Size (in 1000 ft2).
The marketing expert looked at a random sample of 24 households, included in the file RidingMowers.xls.

a. Using Excel, create a scatterplot of Lot Size vs. Income, color coded by the outcome variable
owner/non-owner. Make sure to obtain a well-formatted plot (remove excessive background and
gridlines; create legible labels and a legend, etc.). The result should be similar to Figure 1.

Script:
“Ang kagandahan sa paggamit ng scatterplot is mapapakita yung relationship between
two variables. Maganda rin siya pangvisualize how two variables are related to each other. Another
purpose is that they can show both linear and non-linear relationships between variables, which is
important because maraming relationships in the real world are non-linear. Then they can show
outliers. Magagamit yung scatterplot sap ag-identify ng outliers in your data. Outliers are data
points that are significantly different from the rest of the data; it can be caused by errors in data
collection or entry, or they may represent genuine phenomena. Then lastly, madali siyang iinterpret,
even for people who are not familiar with statistics which makes them a good choice for
communicating data to a general audience.”

b. Create the same plot, this time using any form of visualization.

Script:
“May two graphs above, which is nagpapakita ng sum of income and sum of lot size, both
by ownership. Sa naunang graph, makikita na hindi masyado nagkakalayo yung total number of
non-owner na may 688.8 compared to owner na may 953.7. Meanwhile sa baba, halos ganon din
na hindi masyado nagkakalayo mukha lang malapit since mas nagfocus sa smaller range.”

c. Compare the two processes of generating the plot in terms of the effort as well as the quality

In terms of effort, creating a clustered column plot is generally easier than creating a
scatter plot. This is because a clustered column plot does not require to specify the color and
size of the data points. Creating a scatter plot requires more effort. This is because it needs to
specify the color and size of the data points. It may also need to add a trendline to the plot.
In terms of quality, both clustered column plots and scatter plots can be used to create
informative and visually appealing plots. However, scatter plots are generally considered to be
more informative, especially when trying to identify trends in the data. This is because scatter
plots allow us to see the relationship between two variables without having to group the data
into categories. A clustered column plot would not allow to see this trend as clearly, because
the data would be grouped into categories.
Therefore, if it is trying to identify trends in data, it is generally better to use a scatter plot.
However, if it is simply trying to compare two different groups of data, a clustered column plot
may be sufficient.

Script:
*ayan na mismo yung explanation hehe (2) ^^*

III. Use the data for the breakfast cereals example to explore and summarize the data as follows:

a) Calculate the following summary statistics: mean, median, min, and max for each of the continuous
variables, and the count for each categorical variable. Which, if any of the variables is missing values?

Script:
“Yung statistical table na ito ay nagsusummarize ng mean, median, minimum, maximum,
and count ng kada continuous variable na mayroon sa dataset. Makikita na yung Sodium has the
highest number ng mean, having 160.68 while yung Cups per Serving naman yung may lowest
value, having a total mean of 0.63. In terms of median, Sodium pa rin ang una having 180 while
yung Cups per Serving ay 0.75. Carbs, Sugars, Potassium, and Cups of Serving naman ay pare-
pareho ng number of minimum which is -1. Then Potassium has the highest number for maximum,
having a value of 330. Makikita rin ay no variables are missing values according sa table na na-
generate.”

b) Use any charts or graphs to compare the calories in hot vs. cold cereals. What does the charts/graphs
show?

Script:
“Ineexplain ng clustered column yung sum of calories by type (hot or cold) of food. Cold
food has the most calories, having a value of 7,510, while hot food naman yung least, having a
value of 300. The graph has two bars: one for cold food and one for hot food. Visually speaking,
makikita na yung cold food bar is taller than the hot food bar, which indicates that cold food has
more calories than hot food.”

c) Use any charts or graphs to plot a consumer rating as a function of the shelf height (the variable shelf).
If we were to predict consumer rating from shelf height, does it appear that we need to keep all three
categories of shelf height?

Script
“If we were to predict yung consumer rating from shelf height, it does appear that we would
need to keep all three categories of shelf height. Bakit? Kasi yung rating distribution ay
magkakaiba kada shelf height. Yung may pinakamataas na shelf height ay yung may
pinakamataas na ratings, while yung pinakamababa na shelf height ay nag iindicate ng
pinakamababa na ratings. Kapag isa or dalawang shelf heights lang ang icoconsider, hindi
accurately maprepredict yung consumer rating kasi may iba’t ibang preferences ang consumers.”

Technical Assessment 1
No ratings yet
Technical Assessment 1
3 pages
Home Assignment CO3
No ratings yet
Home Assignment CO3
2 pages
Lab Guide 2
No ratings yet
Lab Guide 2
3 pages
Stata
No ratings yet
Stata
33 pages
Classs12 Ai Practical Graph
No ratings yet
Classs12 Ai Practical Graph
4 pages
Assignment 3
No ratings yet
Assignment 3
10 pages
UNIT - 1 EDA Continuation
No ratings yet
UNIT - 1 EDA Continuation
113 pages
Time Series Forecasting Project (Shoe Sales)
No ratings yet
Time Series Forecasting Project (Shoe Sales)
26 pages
Data Visualization Lab Manual - Final
No ratings yet
Data Visualization Lab Manual - Final
14 pages
Graph Creation Topic6 AG 410
No ratings yet
Graph Creation Topic6 AG 410
4 pages
Ass 2
No ratings yet
Ass 2
13 pages
Create A Scatterplot
No ratings yet
Create A Scatterplot
15 pages
Statistics Homework Guide
No ratings yet
Statistics Homework Guide
3 pages
Case Study
50% (2)
Case Study
8 pages
1714514135
No ratings yet
1714514135
12 pages
Assignment Business Analytics B Biswas
No ratings yet
Assignment Business Analytics B Biswas
7 pages
Chapt-3 Data Visualization
No ratings yet
Chapt-3 Data Visualization
73 pages
Python Pyplot Assignment No 4
No ratings yet
Python Pyplot Assignment No 4
5 pages
Matplotlib
No ratings yet
Matplotlib
8 pages
Tutorial 7
No ratings yet
Tutorial 7
1 page
Subhadeep Seal TSF-Coded Project Rose Wine Business Report
No ratings yet
Subhadeep Seal TSF-Coded Project Rose Wine Business Report
38 pages
DAUP Exam Notes - 2in1
No ratings yet
DAUP Exam Notes - 2in1
35 pages
121a1086 - Bda - Assignment - No.2
No ratings yet
121a1086 - Bda - Assignment - No.2
31 pages
Data Science Unit 2-11-08 2023
No ratings yet
Data Science Unit 2-11-08 2023
78 pages
Data Visualisation Lab Digital Assignment 2: Name: Samar Abbas Naqvi Registration Number: 19BCE0456
No ratings yet
Data Visualisation Lab Digital Assignment 2: Name: Samar Abbas Naqvi Registration Number: 19BCE0456
7 pages
Ch-4 Plotting Data Using Matplotlib
No ratings yet
Ch-4 Plotting Data Using Matplotlib
32 pages
BDA Important Questions
No ratings yet
BDA Important Questions
3 pages
Unit V
No ratings yet
Unit V
24 pages
UNIT4
No ratings yet
UNIT4
8 pages
TD5Numpy Pandas and Matplotlib
No ratings yet
TD5Numpy Pandas and Matplotlib
5 pages
Example of Empirical Exercise in Another Context v7-2
No ratings yet
Example of Empirical Exercise in Another Context v7-2
11 pages
Chapter 3 - Visualizing Data
No ratings yet
Chapter 3 - Visualizing Data
70 pages
Prac - 6
No ratings yet
Prac - 6
7 pages
Be A 65 Ads Exp 2
No ratings yet
Be A 65 Ads Exp 2
10 pages
Data Visualization
No ratings yet
Data Visualization
14 pages
Unit 4 Actual Notes BA
No ratings yet
Unit 4 Actual Notes BA
24 pages
Eda Lab Manual
No ratings yet
Eda Lab Manual
34 pages
Statistics Test - Docxfinal
No ratings yet
Statistics Test - Docxfinal
3 pages
TSF - Rose Data
No ratings yet
TSF - Rose Data
31 pages
Effective Data Visualization Techniques
No ratings yet
Effective Data Visualization Techniques
12 pages
Lecture09 - Data Visualization 2
No ratings yet
Lecture09 - Data Visualization 2
73 pages
9.statistic Lab Ex 9
No ratings yet
9.statistic Lab Ex 9
6 pages
Business Data Visualization Guide
No ratings yet
Business Data Visualization Guide
81 pages
DLP Grade 12
No ratings yet
DLP Grade 12
9 pages
Homework 1 OM690
No ratings yet
Homework 1 OM690
5 pages
Exercise - 6: DS203-2024-S1 Problem1:: Statistics
No ratings yet
Exercise - 6: DS203-2024-S1 Problem1:: Statistics
10 pages
IP Practical File
No ratings yet
IP Practical File
23 pages
Assignment 1
No ratings yet
Assignment 1
7 pages
Data+Visualization+in+Python
No ratings yet
Data+Visualization+in+Python
17 pages
Data Analysis Week 8 Lecture Note
No ratings yet
Data Analysis Week 8 Lecture Note
11 pages
Crash Course Data Science
No ratings yet
Crash Course Data Science
7 pages
Ia2 Dev
No ratings yet
Ia2 Dev
33 pages
Excel Project
No ratings yet
Excel Project
8 pages
Lecture2 Visualizing Data
No ratings yet
Lecture2 Visualizing Data
38 pages
Lec 19
No ratings yet
Lec 19
14 pages
Data Visualization Essentials
No ratings yet
Data Visualization Essentials
32 pages
Data Visualization for Analysts
No ratings yet
Data Visualization for Analysts
26 pages
Bradken Fixed Plant Brochure
No ratings yet
Bradken Fixed Plant Brochure
29 pages
Grievances
No ratings yet
Grievances
3 pages
Unit - 4 RES Notes
No ratings yet
Unit - 4 RES Notes
85 pages
Supplier Category Guidance (SP-FRM-005A)
No ratings yet
Supplier Category Guidance (SP-FRM-005A)
19 pages
Auditing Theory Answer Key 6
No ratings yet
Auditing Theory Answer Key 6
3 pages
Describing Company Structure
No ratings yet
Describing Company Structure
4 pages
You Can Copy and Paste Entires From B9 To B25 For Other Goals
No ratings yet
You Can Copy and Paste Entires From B9 To B25 For Other Goals
7 pages
Fiat WCM - Machine Ledger Example
No ratings yet
Fiat WCM - Machine Ledger Example
1 page
FrodoKEM: Post-Quantum Cryptography
No ratings yet
FrodoKEM: Post-Quantum Cryptography
46 pages
Activated Carbon for Industry Use
No ratings yet
Activated Carbon for Industry Use
1 page
LTE1434
No ratings yet
LTE1434
14 pages
Static Electricity Risks in Aviation
No ratings yet
Static Electricity Risks in Aviation
6 pages
Multinational Management 6th Edition Cullen Parboteeah Test Bank
100% (64)
Multinational Management 6th Edition Cullen Parboteeah Test Bank
11 pages
Indian Technology Startup Funding Report 2017
100% (1)
Indian Technology Startup Funding Report 2017
141 pages
NATO Legal Deskbook (2010)
100% (1)
NATO Legal Deskbook (2010)
348 pages
Truck Crane: Zoomlion Heavy Industry Science & Technology Co.,Ltd
100% (1)
Truck Crane: Zoomlion Heavy Industry Science & Technology Co.,Ltd
4 pages
ZS Business Operations Consultant Role
No ratings yet
ZS Business Operations Consultant Role
3 pages
Parts Catalog For ZW225 6
No ratings yet
Parts Catalog For ZW225 6
551 pages
The Performance Criteria Matrix For Investment Banking
No ratings yet
The Performance Criteria Matrix For Investment Banking
5 pages
Valvular Disorders Questions
No ratings yet
Valvular Disorders Questions
7 pages
Year/Sem: III/VI Date: Time:3 Hrs Max. Marks:100
No ratings yet
Year/Sem: III/VI Date: Time:3 Hrs Max. Marks:100
2 pages
Housekeeping NC II (COC4)
No ratings yet
Housekeeping NC II (COC4)
2 pages
Dxdiag
No ratings yet
Dxdiag
34 pages
Lecture 24
No ratings yet
Lecture 24
25 pages
Medicine Donation Web Portal Guide
No ratings yet
Medicine Donation Web Portal Guide
52 pages
Setalux 1182 SS-55
No ratings yet
Setalux 1182 SS-55
2 pages
Microsoft Power BI For Dummies
No ratings yet
Microsoft Power BI For Dummies
1 page
AD9914
No ratings yet
AD9914
48 pages
Linux Commands
No ratings yet
Linux Commands
6 pages
Indian Railways Ticket Details
No ratings yet
Indian Railways Ticket Details
2 pages

Task Explanation202

Uploaded by

Task Explanation202

Uploaded by

PROBLEMS:

I. Shipments of Household Appliances: Line Graphs.

You might also like