0% found this document useful (0 votes)

91 views26 pages

Powerpoint Workshop Introduction To Deep Learning - Statistics and Data Analysis

The document provides an outline for a livestream workshop on statistics and data analysis. It covers topics like central tendency, measures of variability, visualizing data, and the normal distribution. The outline also distinguishes between descriptive and inferential statistics, and parametric versus nonparametric methods. Hands-on sessions are planned to complement the theoretical content.

Uploaded by

habifian sultan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

91 views26 pages

Powerpoint Workshop Introduction To Deep Learning - Statistics and Data Analysis

Uploaded by

habifian sultan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 26

STATISTICS AND DATA

ANALYSIS
z Livestream Workshop
@ Purwadhika

Sunday, 10 May 2020

Andre Nurrohman
andrenurrohman@gmail.com
z
Outline

 Statistics and Data Analysis

 Central Tendency

 Measures of Variability

 Visualizing Data

 Distribution (Normal Distribution)

 Hands On Session
z
Statistics and Data Analysis

Statistics is the science concerned with developing and studying methods for collecting,
analyzing, interpreting and presenting empirical data.
z
Statistics and Data Analysis
Types of Data
z
Statistics and Data Analysis

Statistics

Descriptive Inferential

Presenting, organizing, and Drawing conclusions about a

summarizing data population based data observed in
sample
z
Statistics and Data Analysis

Descriptive
Statistics

Central Measures
Tendency of Variability

Mean, median, and mode. Range, variance, standard

deviation, and quartile
z
Statistics and Data Analysis

Population
PARAMETER sampling

Statistical Inference Sample

STATISTIC

Parametric Methods

Nonparametric Methods
z
Statistics and Data Analysis
Parametric Method Nonparametric Method
Data Type Interval or Ratio Nominal or Ordinal
Assumed Distribution Normal Any
Assumed Variance Homogeneous Homo/Heterogeneous
Number of Data >= 30 Flexible
Data set relationship Independent Any
Usual central measure Means Medians
Statistical Power Strong Weak

Analysis Type
Independent measures, 2 groups Independet measure t test Mann-whitney test
Independent measures, >2 groups ANOVA Krusskal-Wallis test
Repeated measures, 2 conditions Matched pair t-test Wilcoxon test
z
Statistics and Data
Analysis

Data Analysis is the process of

systematically applying statistical
and/or logical techniques to describe
and illustrate, condense and recap,
and evaluate data.

Epicycle of Data Analysis

Source: The Art of Data Analysis

z
Central Tendency

Central
Tendency

Mean Median Mode

z
Central Tendency: Mean

Example:
Number of children in each house in my street:
0, 2, 3, 2, 1, 0, 0, 2, 0
Hence, the mean is:
(0+2+3+2+1+0+0+2+0) / 9 = 1.11
In my street, a new neighborhood with 11 children come in, so the
new mean is:
(0+2+3+2+1+0+0+2+0+11) / 10 = 2.1
z
Central Tendency: Median
Median is the middle data point.

1. Dataset is odd, exactly on the middle.

2. Dataset is even, average of the two middle data point.

Example:
0, 2, 3, 2, 1, 0, 0, 2, 0
Sort it Median:
0, 0, 0, 0, 1, 2, 2, 2, 3
1
0, 2, 3, 2, 1, 0, 0, 0, 11
Sort it 1.5
0, 0, 0, 0, 1, 2, 2, 2, 3, 11
z
Central Tendency: Mode
Mode is the most frequently occurs.

Example:
0, 2, 3, 2, 1, 0, 0, 2, 0

Count each value:

0 => 4
1 => 1
2 => 3
3 => 1

How’s the mode if we add 11?

z
Measures of Variability

Measures of Boxplot
Variability

Variance and
Range Standard Quartile Outliers
Deviation
z
Measures of Variability: Range

The distance between the largest value and

the smallest value of the data.

The range of the data: (1, 4, 5, 2, 8) is ???

z
Measures of Variability:
Variance and Standard Deviation
The variance is the average of the
squared differences from the mean.

The standard deviation is the square root of

the variance and is used to measure
distance from the mean.
z
Measures of Variability:
Quartile and Outlier
Q1 Q2 Q3

IQR = Q3 – Q1

Upper outliers > Q3 + 1.5 * IQR

Lower outliers < Q1 – 1.5 * IQR
Example:
(1, 4, 5, 2, 8)
Sort it
(1, 2, 4, 5, 8)

So, Q1 = 1.5 ; Q2 = 4 ; Q3 = 6.5

IQR = 6.5 – 1.5 = 5
Outliers > 6.5 + 1.5 * 5 = 14
Outliers < 6.5 – 1.5 * 5 = -1
z
Measures of Variability: Boxplot
z
Visualizing Data

Visualizing
Data

Frequency Proportion
Chart Histograms
Table Table
z
Visualizing Data: Tables

Frequency Table Proportion Table

z
Visualizing Data: Charts
z
Visualizing Data: Histogram
z
Skewness and Kurtosis
z
Normal Distribution
 Probability distribution is a list of all of the possible outcomes of a random variable
along with their corresponding probability values.

Example:

A probability distribution of a fair 6-sided die.

 Probability distribution function is a statistical function that describes all the possible
values and likelihoods that a random variable can take within a given range.

 Most common example for probability distribution is Normal Distribution

z
Normal Distribution
 Properties of Normal Distribution:

1. The mean, mode and median are all

equal.

2. The curve is symmetric at the center

(i.e. around the mean).

3. Exactly half of the values are to the left

of center and exactly half the values
are to the right.

4. The total area under the curve is 1

TERIMAKASIH

Moderation 2
No ratings yet
Moderation 2
10 pages
02-03 ASAP Business Analytics-2 Descriptive Statistics
No ratings yet
02-03 ASAP Business Analytics-2 Descriptive Statistics
109 pages
Data Analytics: Key Concepts & Terms
No ratings yet
Data Analytics: Key Concepts & Terms
22 pages
Chapter 8 B - Trendlines and Regression Analysis
No ratings yet
Chapter 8 B - Trendlines and Regression Analysis
73 pages
Intro to Statistics for Students
No ratings yet
Intro to Statistics for Students
18 pages
DATA SUMMARIZATION - Print
No ratings yet
DATA SUMMARIZATION - Print
28 pages
Statistic Interview Questions and Answers by Jeevan Raj
No ratings yet
Statistic Interview Questions and Answers by Jeevan Raj
21 pages
Data Analysis
No ratings yet
Data Analysis
30 pages
Data Analysis
No ratings yet
Data Analysis
17 pages
Group 2 - Chap 11 Analysis of Variance
No ratings yet
Group 2 - Chap 11 Analysis of Variance
102 pages
Chapter 2 - Describing Data
No ratings yet
Chapter 2 - Describing Data
24 pages
07 Outlier Detection
No ratings yet
07 Outlier Detection
54 pages
Basic Business Statistics: 11 Edition
No ratings yet
Basic Business Statistics: 11 Edition
24 pages
Exploratory Data Analysis - Komorowski PDF
No ratings yet
Exploratory Data Analysis - Komorowski PDF
20 pages
Statistic Book
100% (1)
Statistic Book
328 pages
Decision Support Systems Guide
No ratings yet
Decision Support Systems Guide
9 pages
Measures of Dispersion Guide
No ratings yet
Measures of Dispersion Guide
13 pages
Data Visualization Types Guide
No ratings yet
Data Visualization Types Guide
6 pages
Essentials of Organizational Behavior: Fourteenth Edition
No ratings yet
Essentials of Organizational Behavior: Fourteenth Edition
26 pages
Approaches To The Analysis of Survey Data PDF
No ratings yet
Approaches To The Analysis of Survey Data PDF
28 pages
Assignment 1&2
No ratings yet
Assignment 1&2
4 pages
Statistics Exam Prep Guide
No ratings yet
Statistics Exam Prep Guide
15 pages
Gaussian Noise Detection & Estimation
No ratings yet
Gaussian Noise Detection & Estimation
55 pages
Predictive Modeling Project Report
100% (2)
Predictive Modeling Project Report
31 pages
Chapter 1-Basic Statistical Concepts
No ratings yet
Chapter 1-Basic Statistical Concepts
30 pages
Power BI Field List Icon Updates
No ratings yet
Power BI Field List Icon Updates
3 pages
2 - Introduction To Statistics
No ratings yet
2 - Introduction To Statistics
97 pages
Free Data Science Courses & Certs
No ratings yet
Free Data Science Courses & Certs
2 pages
Chapter 3 - Describing Data
No ratings yet
Chapter 3 - Describing Data
39 pages
Business Analytics: Methods, Models, and Decisions: Descriptive Statistics
No ratings yet
Business Analytics: Methods, Models, and Decisions: Descriptive Statistics
100 pages
Statistics For Data Science by Mihir Patnaik
No ratings yet
Statistics For Data Science by Mihir Patnaik
103 pages
Basic Statistics: Simple Linear Regression
No ratings yet
Basic Statistics: Simple Linear Regression
8 pages
Test Bank For Business Analytics 3rd Edition by Evans
0% (1)
Test Bank For Business Analytics 3rd Edition by Evans
28 pages
A Comprehensive Guide To Data Exploration: Steps of Data Exploration and Preparation Missing Value Treatment
100% (2)
A Comprehensive Guide To Data Exploration: Steps of Data Exploration and Preparation Missing Value Treatment
8 pages
Basic Business Statistics: Introduction and Data Collection
No ratings yet
Basic Business Statistics: Introduction and Data Collection
33 pages
Time Series Characteristic
No ratings yet
Time Series Characteristic
72 pages
Jackknife Resampling Methods Explained
No ratings yet
Jackknife Resampling Methods Explained
22 pages
Business Stats for Beginners
No ratings yet
Business Stats for Beginners
20 pages
Applied Statistics: Assessment Tasks
No ratings yet
Applied Statistics: Assessment Tasks
4 pages
Data Science Course Content Chapter 1: Introduction To Data Science
No ratings yet
Data Science Course Content Chapter 1: Introduction To Data Science
8 pages
Business Statistics May Module
No ratings yet
Business Statistics May Module
72 pages
Data Analysis - Using Excel
No ratings yet
Data Analysis - Using Excel
9 pages
Topic:use Statistical Data Analysis To Drive Fact - Based Decisions
0% (1)
Topic:use Statistical Data Analysis To Drive Fact - Based Decisions
11 pages
Book-Sher Muhammad Chaudary - 89-133 PDF
100% (1)
Book-Sher Muhammad Chaudary - 89-133 PDF
45 pages
Statistics For Business and Economics: Sampling and Sampling Distributions
No ratings yet
Statistics For Business and Economics: Sampling and Sampling Distributions
50 pages
1b.data Understanding
No ratings yet
1b.data Understanding
4 pages
PCA Tutorial with Iris Dataset
No ratings yet
PCA Tutorial with Iris Dataset
3 pages
BA4101 - Statistics - For - Management - Revised
No ratings yet
BA4101 - Statistics - For - Management - Revised
21 pages
Full Stats Notes
No ratings yet
Full Stats Notes
126 pages
Ix. Introduction To Statistical Concepts: Frequency Distribution Measures of Central Tendency Measures of Variability
No ratings yet
Ix. Introduction To Statistical Concepts: Frequency Distribution Measures of Central Tendency Measures of Variability
119 pages
Business Intelligence and Analytics Notes
No ratings yet
Business Intelligence and Analytics Notes
260 pages
Statistics Formulas
100% (1)
Statistics Formulas
2 pages
Hypothesis Testing Spinning The Wheel
No ratings yet
Hypothesis Testing Spinning The Wheel
1 page
Lecture Nr2 04 Descriptive Data Analysis Sy 2023
No ratings yet
Lecture Nr2 04 Descriptive Data Analysis Sy 2023
9 pages
Spss Syllabus
No ratings yet
Spss Syllabus
2 pages
Chapter 3
No ratings yet
Chapter 3
17 pages
Statistical Data
No ratings yet
Statistical Data
41 pages
Statistics ClassNotes - 2
No ratings yet
Statistics ClassNotes - 2
10 pages
STAT241 - Business Statistics (Day 3)
No ratings yet
STAT241 - Business Statistics (Day 3)
32 pages
Basic 1
No ratings yet
Basic 1
60 pages
Loss-Versus-Rebalancing Under Deterministic and Generalized Block-Times
No ratings yet
Loss-Versus-Rebalancing Under Deterministic and Generalized Block-Times
16 pages
Statistics - Final Exam
No ratings yet
Statistics - Final Exam
4 pages
Cornerstones of Cost Management 2nd Edition Hansen Solutions Manual Complete Edition
100% (2)
Cornerstones of Cost Management 2nd Edition Hansen Solutions Manual Complete Edition
125 pages
Rohini 27786294869
No ratings yet
Rohini 27786294869
10 pages
Collins Statis 2
No ratings yet
Collins Statis 2
147 pages
A 1 TOPS - W Analog Deep Machine-Learning Engine With Floating-Gate Storage in 0.13 Μm CMOS
No ratings yet
A 1 TOPS - W Analog Deep Machine-Learning Engine With Floating-Gate Storage in 0.13 Μm CMOS
12 pages
Student Performance Stats
No ratings yet
Student Performance Stats
25 pages
MN5554 Reliability Notes
No ratings yet
MN5554 Reliability Notes
64 pages
CS1 CMP Upgrade 2020
No ratings yet
CS1 CMP Upgrade 2020
94 pages
Nanyang Junior College Jc2 Preliminary Examination
No ratings yet
Nanyang Junior College Jc2 Preliminary Examination
6 pages
Module Methods of Agricultural Research Chapter 1 7 Revised Aug 11 2021
No ratings yet
Module Methods of Agricultural Research Chapter 1 7 Revised Aug 11 2021
67 pages
The Standard Deviation and Variance
No ratings yet
The Standard Deviation and Variance
14 pages
Factor Analysis PDF
100% (1)
Factor Analysis PDF
57 pages
Random Variables & Probability Guide
No ratings yet
Random Variables & Probability Guide
3 pages
Computer Assignment FinMod 2022-2023
No ratings yet
Computer Assignment FinMod 2022-2023
7 pages
ExecCourse1 9-20pdf
No ratings yet
ExecCourse1 9-20pdf
44 pages
Sae: An R Package For Small Area Estimation
No ratings yet
Sae: An R Package For Small Area Estimation
18 pages
Instant Download (Ebook PDF) Fundamental Statistics For The Behavioral Sciences 9th Edition by David C. Howell PDF All Chapter
100% (5)
Instant Download (Ebook PDF) Fundamental Statistics For The Behavioral Sciences 9th Edition by David C. Howell PDF All Chapter
41 pages
Subjective Questions
No ratings yet
Subjective Questions
8 pages
Ex08 PDF
No ratings yet
Ex08 PDF
29 pages
Validity of Academic Goals Questionnaire
No ratings yet
Validity of Academic Goals Questionnaire
6 pages
Finding The Mean and The Variance of The Sampling Distribution of The Sample Means
No ratings yet
Finding The Mean and The Variance of The Sampling Distribution of The Sample Means
28 pages
Intermediate STATS
No ratings yet
Intermediate STATS
23 pages
Problem SET 1
No ratings yet
Problem SET 1
3 pages
All Note Sizzle
No ratings yet
All Note Sizzle
220 pages
MTH302 FinalTerm MCQs 2010 PDF
No ratings yet
MTH302 FinalTerm MCQs 2010 PDF
137 pages
FY Financial Data Analysis Unit 1 To 3 PDF
No ratings yet
FY Financial Data Analysis Unit 1 To 3 PDF
14 pages
Water - METHOD VERIFICATION
No ratings yet
Water - METHOD VERIFICATION
23 pages
Ce-Pham Dinh Xuan Thu (126-137) 058
No ratings yet
Ce-Pham Dinh Xuan Thu (126-137) 058
12 pages
Non Equilibrium Stat Mech
No ratings yet
Non Equilibrium Stat Mech
51 pages

Powerpoint Workshop Introduction To Deep Learning - Statistics and Data Analysis

Uploaded by

Powerpoint Workshop Introduction To Deep Learning - Statistics and Data Analysis

Uploaded by

STATISTICS AND DATA

Sunday, 10 May 2020

 Statistics and Data Analysis

 Distribution (Normal Distribution)

Presenting, organizing, and Drawing conclusions about a

Mean, median, and mode. Range, variance, standard

Statistical Inference Sample

Data Analysis is the process of

Epicycle of Data Analysis

Source: The Art of Data Analysis

Mean Median Mode

1. Dataset is odd, exactly on the middle.

Count each value:

How’s the mode if we add 11?

The distance between the largest value and

The range of the data: (1, 4, 5, 2, 8) is ???

The standard deviation is the square root of

Upper outliers > Q3 + 1.5 * IQR

So, Q1 = 1.5 ; Q2 = 4 ; Q3 = 6.5

Frequency Table Proportion Table

A probability distribution of a fair 6-sided die.

 Most common example for probability distribution is Normal Distribution

1. The mean, mode and median are all

2. The curve is symmetric at the center

3. Exactly half of the values are to the left

4. The total area under the curve is 1

You might also like