0% found this document useful (0 votes)

29 views21 pages

Ch4 Machine Learning

Machine Learning is the process of programming computers to learn from data, improving performance on tasks through experience. It is categorized into supervised, unsupervised, semi-supervised, and reinforcement learning, with applications including classification, regression, clustering, and anomaly detection. Different algorithms are used for each type, and systems can learn in batch or online modes depending on the data flow and resource constraints.

Uploaded by

pereirajoshnatba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views21 pages

Ch4 Machine Learning

Uploaded by

pereirajoshnatba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 21

MACHINE

LEARNING
WITH
PYTHON
WHAT IS MACHINE LEARNING?

• Machine Learning is the science (and art) of programming

computers so they can learn from data.
• And a more engineering-oriented one:
• A computer program is said to learn from experience E with respect
to some task T and some performance measure P, if its performance
on T, as measured by P, improves with experience E.
• —Tom Mitchell, 1997
EXAMPLE

• For example, your spam filter is a Machine Learning program that can learn
to flag
spam given examples of spam emails (e.g., flagged by users) and examples of
regular (nonspam, also called “ham”) emails.
• The examples that the system uses to learn are called the training set. Each
training example is called a training instance (or sample).
• In this case, the task T is to flag spam for new emails, the experience E is
the training data, and the performance measure P needs to be defined;
• for example, you can use the ratio of correctly classified emails. This particular
performance measure is called accuracy and it is often used in classification
tasks.
TYPES OF MACHINE LEARNING SYSTEMS

• There are so many different types of Machine Learning systems that it is useful
toclassify them in broad categories based on:
1. Whether or not they are trained with human supervision (supervised,
unsupervised,semisupervised, and Reinforcement Learning)

2. • Whether or not they can learn incrementally on the fly (online versus batch
learning)

3. Whether they work by simply comparing new data points to known data points, or
instead detect patterns in the training data and build a predictive model, much like
scientists do (instance-based versus model-based learning)
SUPERVISED/UNSUPERVISED LEARNING

• Machine Learning systems can be classified according to the amount and

type of supervision they get during training.
• There are four major categories:
1. Supervised learning,
2. Unsupervised learning,
3. Semi-supervised learning, and
4. Reinforcement Learning.
SUPERVISED LEARNING

• In supervised learning, the training data you feed to the algorithm

includes the desired solutions, called labels.
CLASSIFICATION

• A typical supervised learning task is

classification.
• The spam filter is a good example of
this: it is trained with many example
emails along with their class (spam
or ham),and it must learn how to
classify new emails.

A labeled training set for supervised

learning (e.g., spam classification)
REGRESSION
• Another supervised learning task is
regression
• A typical task is to predict a target
numeric value, such as the price of
a car, given a set of features
(mileage, age, brand, etc.) called
predictors. This sort of task is
called regression To train the
system, you need to give it many
examples of cars, including both
their predictors and their labels (i.e.,
their prices).
IMPORTANT SUPERVISED LEARNING
ALGORITHMS
1. k-Nearest Neighbors
2. Linear Regression
3. Logistic Regression
4. Support Vector Machines (SVMs)
5. Decision Trees and Random Forests
6. Neural networks2
UNSUPERVISED LEARNING

• In unsupervised learning, as you An unlabeled training set for

might guess, the training data is unsupervised learning
unlabeled.
• The system tries to learn without a
teacher.
MOST IMPORTANT UNSUPERVISED
LEARNING ALGORITHMS
• Clustering
1. k-Means
2. Hierarchical Cluster Analysis (HCA)
3. Expectation Maximization

• Visualization and dimensionality reduction

1. Principal Component Analysis (PCA)
2. Kernel PCA
3. Locally-Linear Embedding (LLE)
4. t-distributed Stochastic Neighbor Embedding (t-SNE)

• Association rule learning

5. Apriori
6. Eclat
CLUSTERING
• For example, say you have a lot of data about your
blog’s visitors.
• You may want to run a clustering algorithm to try
to detect groups of similar visitors
• At no point do you tell the algorithm which group a
visitor belongs to: it finds those connections
without your help.
• For example, it might notice that 40% of your
visitors are males who love comic books and
generally read your blog in the evening, while 20%
are young sci-fi lovers who visit during the
weekends, and so on.
• If you use a hierarchical clustering algorithm, it
may also subdivide each group into smaller
groups. This may help you target your posts for
each group.
VISUALIZATION
• Visualization algorithms are also good
examples of unsupervised learning
algorithms
• you feed them a lot of complex and
unlabeled data, and they output a 2D or
3D representation of your data that can
easily be plotted.
• These algorithms try to preserve as
much structure as they can (e.g., trying
to keep separate clusters in the input
space from overlapping in the
visualization), so you can understand
how the data is organized and perhaps
identify unsuspected patterns. Example of a t-SNE visualization highlighting
semantic clusters3
DIMENSIONALITY REDUCTION

• A related task is dimensionality reduction, in which the goal is to simplify the

data
without losing too much information.
One way to do this is to merge several correlated features into one.
For example, a car’s mileage may be very correlated with its age, so the
dimensionality reduction algorithm will merge them into one feature that
represents the car’s wear and tear. This is called feature extraction.
ANOMALY DETECTION

• Another important unsupervised task is

anomaly detection
• for example: detecting unusual credit card
transactions to prevent fraud, catching
manufacturing defects, or automatically
removing outliers from a dataset before
feeding it to another learning algorithm.
• The system is trained with normal
instances, and when it sees a new instance
it can tell whether it looks like a normal
one or whether it is likely an anomaly.
ASSOCIATION RULE LEARNING

• Another common unsupervised task is association rule learning, in which the

goal is to dig into large amounts of data and discover interesting relations
between attributes.
• For example, suppose you own a supermarket.
• Running an association rule on your sales logs may reveal that people who
purchase barbecue sauce and potato chips also tend to buy steak.
• Thus, you may want to place these items close to each other.
SEMISUPERVISED LEARNING

• Some algorithms can deal with partially labeled

training data, usually a lot of unlabeled data and a
little bit of labeled data. This is called
semisupervised learning
• Some photo-hosting services, such as Google
Photos, are good examples of this.
• Once you upload all your family photos to the
service, it automatically recognizes that the same
person A shows up in photos 1, 5, and 11, while
another person B shows up in photos 2, 5, and 7.
This is the unsupervised part of the algorithm
(clustering).
• Now all the system needs is for you to tell it who
these people are. Just one label per person,4 and it
is able to name everyone in every photo, which is
useful for searching photos.
REINFORCEMENT LEARNING
• The learning system, called an
agent in this context, can observe
the environment, select and
perform actions, and get rewards
in return (or penalties in the form
of negative rewards).
• It must then learn by itself what is
the best strategy, called a policy, to
get the most reward over time. A
policy defines what action the agent
should choose when it is in a given
situation.
BATCH AND ONLINE LEARNING
• The system is incapable of learning incrementally: it must be trained using all
the available data. This will generally take a lot of time and computing
resources, so it is typically done offline.
• First the system is trained, and then it is launched into production and runs
without learning anymore; it just applies what it has learned. This is called
offline learning.
• If you want a batch learning system to know about new data (such as a new
type of spam), you need to train a new version of the system from scratch on
the full dataset (not just the new data, but also the old data), then stop the
old system and replace it with the new one.
ONLINE LEARNING

• In online learning, you train the system incrementally by feeding it data instances
sequentially, either individually or by small groups called mini-batches.
• Each learning step is fast and cheap, so the system can learn about new data on
the fly.
• Online learning is great for systems that receive data as a continuous flow (e.g.,
stock prices) and need to adapt to change rapidly or autonomously.
• It is also a good option if you have limited computing resources: once an online
learning system has learned about new data instances, it does not need them
anymore, so you can discard them (unless you want to be able to roll back to a
previous state and “replay” the data). This can save a huge amount of space.
INSTANCE-BASED VERSUS MODEL-BASED
LEARNING

• One more way to categorize Machine Learning systems is by how they

generalize.
• Most Machine Learning tasks are about making predictions.
• This means that given a number of training examples, the system needs to
be able to generalize to examples it has never seen before.

ML L1 PDF
No ratings yet
ML L1 PDF
43 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
6 pages
Machine Learning A Basic Approach
No ratings yet
Machine Learning A Basic Approach
9 pages
Session 3 Types of Machine Learning
No ratings yet
Session 3 Types of Machine Learning
22 pages
Machine Learning Basics & Types
No ratings yet
Machine Learning Basics & Types
56 pages
Unit I
No ratings yet
Unit I
69 pages
Machine Learning - Data
No ratings yet
Machine Learning - Data
11 pages
The Machine Learning Landscape
No ratings yet
The Machine Learning Landscape
25 pages
My Hands-On ML Notebook
No ratings yet
My Hands-On ML Notebook
5 pages
Machine Learning Essentials
No ratings yet
Machine Learning Essentials
58 pages
Unit 1
No ratings yet
Unit 1
66 pages
Module 1 Notes
No ratings yet
Module 1 Notes
38 pages
AI Chapter 5
No ratings yet
AI Chapter 5
31 pages
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
No ratings yet
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
35 pages
AIML
No ratings yet
AIML
26 pages
1 ML Landscape, ML Categories
No ratings yet
1 ML Landscape, ML Categories
3 pages
Python UNIT-5
100% (1)
Python UNIT-5
67 pages
Intro to Machine Learning Basics
100% (3)
Intro to Machine Learning Basics
24 pages
Unit 1 Intro
No ratings yet
Unit 1 Intro
41 pages
Lecture 1 Machine Learning
No ratings yet
Lecture 1 Machine Learning
23 pages
Module1 ML
No ratings yet
Module1 ML
114 pages
@vtucode - in 21AI63 Module 1 AI&ML 2021 Scheme
No ratings yet
@vtucode - in 21AI63 Module 1 AI&ML 2021 Scheme
38 pages
ML Study
No ratings yet
ML Study
9 pages
Module 1
No ratings yet
Module 1
47 pages
Machine Learning Basics for Beginners
No ratings yet
Machine Learning Basics for Beginners
122 pages
Business Analytics For Decision Making Machine Learning 4-6
No ratings yet
Business Analytics For Decision Making Machine Learning 4-6
16 pages
Unit5 ML Introduction
No ratings yet
Unit5 ML Introduction
32 pages
Machine Learning Concepts Guide
No ratings yet
Machine Learning Concepts Guide
122 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
21 pages
Introduction To ML
No ratings yet
Introduction To ML
46 pages
FAM Unit5
No ratings yet
FAM Unit5
47 pages
Intro To Machine Learning
No ratings yet
Intro To Machine Learning
25 pages
LKSK ML typesToStudents
No ratings yet
LKSK ML typesToStudents
18 pages
AI Chapter 6
No ratings yet
AI Chapter 6
103 pages
5 Le
No ratings yet
5 Le
36 pages
UNIT 1ML Removed Removed
No ratings yet
UNIT 1ML Removed Removed
123 pages
ML Assignment 1
No ratings yet
ML Assignment 1
12 pages
Unit 1: Shobana T S Assistant Professor Dept. of ISE, BMSCE
No ratings yet
Unit 1: Shobana T S Assistant Professor Dept. of ISE, BMSCE
114 pages
01 - ML - Introduction
No ratings yet
01 - ML - Introduction
65 pages
Unit 1
No ratings yet
Unit 1
24 pages
Chapter 1
No ratings yet
Chapter 1
27 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
16 pages
Ai Unit 4
No ratings yet
Ai Unit 4
34 pages
FDS Assignment
No ratings yet
FDS Assignment
76 pages
Unit-5 Machine Learning
No ratings yet
Unit-5 Machine Learning
25 pages
Introduction 1175
No ratings yet
Introduction 1175
58 pages
Module 1
No ratings yet
Module 1
34 pages
Introduction To Machine Learning Lecture1 14july25
No ratings yet
Introduction To Machine Learning Lecture1 14july25
44 pages
Module IV - Machine Learning
No ratings yet
Module IV - Machine Learning
53 pages
1.machine Learning Basics
No ratings yet
1.machine Learning Basics
74 pages
Introduction To Machine Learing
No ratings yet
Introduction To Machine Learing
4 pages
Machine Learning-Lecture 01
No ratings yet
Machine Learning-Lecture 01
28 pages
Ml-Unit 1
No ratings yet
Ml-Unit 1
53 pages
Unit 1
No ratings yet
Unit 1
19 pages
AI Unit4 Learning Dd83e0ee 7d19 48c7 Bc5d B39decf3b0fc
No ratings yet
AI Unit4 Learning Dd83e0ee 7d19 48c7 Bc5d B39decf3b0fc
19 pages
Machine Learning for Beginners
No ratings yet
Machine Learning for Beginners
27 pages
Unit3-Important Topics Related To Neural Network
No ratings yet
Unit3-Important Topics Related To Neural Network
10 pages
ML Chapter 1
No ratings yet
ML Chapter 1
37 pages
Semi-: Supervised Learning
No ratings yet
Semi-: Supervised Learning
40 pages
Digital Twin Based Intrusion Detection For ICS
No ratings yet
Digital Twin Based Intrusion Detection For ICS
7 pages
Wikicontradiction: Detecting Self-Contradiction Articles On Wikipedia
No ratings yet
Wikicontradiction: Detecting Self-Contradiction Articles On Wikipedia
10 pages
Geospatial-Temporal Analysis Andclassification of Criminal Data in Manila
No ratings yet
Geospatial-Temporal Analysis Andclassification of Criminal Data in Manila
6 pages
Chapter 1
No ratings yet
Chapter 1
24 pages
Diabetes Dataset Analysis & Prep
No ratings yet
Diabetes Dataset Analysis & Prep
11 pages
IIT Kharagpur AI4ICPS Program Schedule
No ratings yet
IIT Kharagpur AI4ICPS Program Schedule
2 pages
An EEG-based Machine Learning Framework For Depression Detection Using Effective Connectivity Analysis
No ratings yet
An EEG-based Machine Learning Framework For Depression Detection Using Effective Connectivity Analysis
20 pages
Bioinformatics 28 7 991
No ratings yet
Bioinformatics 28 7 991
10 pages
Data Mining for Business Decisions
100% (1)
Data Mining for Business Decisions
85 pages
Machine Learning Interview Q&A
100% (1)
Machine Learning Interview Q&A
83 pages
Data Mining Lecture Notes-1: Bsc. (H) Computer Science: Vi Semester Teacher: Ms. Sonal Linda
No ratings yet
Data Mining Lecture Notes-1: Bsc. (H) Computer Science: Vi Semester Teacher: Ms. Sonal Linda
40 pages
Moisen and Frescino. 2002 Comparing Modeling Techniques To Predict Forest Characteristics
No ratings yet
Moisen and Frescino. 2002 Comparing Modeling Techniques To Predict Forest Characteristics
17 pages
Bengal College of Engineering and Technology
No ratings yet
Bengal College of Engineering and Technology
12 pages
Machine Learning 2M&10M Qpaper
No ratings yet
Machine Learning 2M&10M Qpaper
3 pages
On Posture As A Modality For Expressing and Recognizing Emotions
No ratings yet
On Posture As A Modality For Expressing and Recognizing Emotions
7 pages
(PDF Download) Remote Sensing Digital Image Analysis Sixth Edition John Alan Richards Fulll Chapter
100% (2)
(PDF Download) Remote Sensing Digital Image Analysis Sixth Edition John Alan Richards Fulll Chapter
64 pages
Computational Intelligence To Aid Text F
No ratings yet
Computational Intelligence To Aid Text F
14 pages
Notes On Introduction To Deep Learning
No ratings yet
Notes On Introduction To Deep Learning
19 pages
AU680 AU480 Instrument Online Specification Jan1-2011v9
100% (3)
AU680 AU480 Instrument Online Specification Jan1-2011v9
55 pages
(Ebook PDF) Spreadsheet Modeling and Decision Analysis: A Practical Introduction To Business Analytics 7th Edition Instant Download
100% (1)
(Ebook PDF) Spreadsheet Modeling and Decision Analysis: A Practical Introduction To Business Analytics 7th Edition Instant Download
58 pages
Perceptron Neural Network Program
No ratings yet
Perceptron Neural Network Program
3 pages
Clustering Analysis (Unsupervised)
No ratings yet
Clustering Analysis (Unsupervised)
6 pages
Ijcset V1
No ratings yet
Ijcset V1
412 pages
Sentiment Analysis On Amazon Fine Food Reviews by Using Linear Machine Learning Models
No ratings yet
Sentiment Analysis On Amazon Fine Food Reviews by Using Linear Machine Learning Models
6 pages
Analysis of German Credit Data
100% (1)
Analysis of German Credit Data
24 pages
Machine Learning
No ratings yet
Machine Learning
22 pages
R20 ML Notes
No ratings yet
R20 ML Notes
118 pages
Image Classification in IDRISI and ILWIS
100% (1)
Image Classification in IDRISI and ILWIS
4 pages
Patient Classification System
90% (10)
Patient Classification System
10 pages
The Illustrated BERT, ELMo, and Co. (How NLP Cracked Transfer Learning) - Jay Alammar - Visualizing Machine Learning One Concept at A Time.
No ratings yet
The Illustrated BERT, ELMo, and Co. (How NLP Cracked Transfer Learning) - Jay Alammar - Visualizing Machine Learning One Concept at A Time.
4 pages

Ch4 Machine Learning

Uploaded by

Ch4 Machine Learning

Uploaded by

MACHINE

• Machine Learning is the science (and art) of programming

• Machine Learning systems can be classified according to the amount and

• In supervised learning, the training data you feed to the algorithm

• A typical supervised learning task is

A labeled training set for supervised

• In unsupervised learning, as you An unlabeled training set for

• Visualization and dimensionality reduction

• Association rule learning

• A related task is dimensionality reduction, in which the goal is to simplify the

• Another important unsupervised task is

• Another common unsupervised task is association rule learning, in which the

• Some algorithms can deal with partially labeled

• One more way to categorize Machine Learning systems is by how they

You might also like