0% found this document useful (0 votes)

247 views48 pages

SVM Guide for Data Scientists

This document provides an overview of support vector machines (SVMs) for data classification. It begins with an introduction to machine learning and data classification problems. It then covers high-level concepts of SVMs, including maximizing the margin between classes, mapping data to higher dimensions for nonlinear separation, kernels, soft margins, and support vectors. The document also discusses interpreting SVM models and results, such as using decision values and weights. It provides a case study example and discusses resources for using SVM software libraries like LIBSVM.

Uploaded by

Hau Nguyen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

247 views48 pages

SVM Guide for Data Scientists

Uploaded by

Hau Nguyen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 48

Practical Guide to Support

Vector Machines
Tingfan Wu
MPLAB, UCSD

Outline

Data Classification
High-level Concepts of SVM
Interpretation of SVM Model/Result
Use Case Study

What does it mean to learn?

Acquire new skills?

Make predictions about the world?

Making predictions is
fundamental to survival
Will that bear eat me?

Is there water in that canyon?

Is that person a good mate?

These are all examples of classification problems

Boot Camp Related

Motion classification

face recognition / speaker identification

Brain Computer Interface / Spikes Classification

Driver Fatigue Detection from Facial Expression

Data Classification
Sensor

Data
Preprocessing
features

Classifier
SVM
Adaboost
Neural Network

Prediction

Given training data (class labels known)

Predicts test data (class labels unknown)
Not just fitting generalization

Generalization

Many possible classification models

Which one generalize better ?
8

Generalization

Why SVM ? (my opinion)

With careful data preprocessing, and properly
use of SVM or NN similar performance.
SVM is easier to use properly.

SVM provides a reasonable good baseline

performance.

Outline

Data Classification
High-level Concepts of SVM
Interpretation of SVM Model/Result
Use case study

A Simple Dilemma

Who do I invite to my
birthday party?

Problem Formulation
training data as vectors: xi
binary labels [ +1, -1]
Name

Gift?

Income

Fondness

John
Mary

Yes
No

3k
5k

3/5
1/5

class

feature vector

y1=+1 x1 = [3000, 0.6]

y2= -1 x2 = [5000, 0.2]

x2
(Disposable Income)

Vector space

+ +
+

No Gift

+ Gift

+
+ ++
+
+

x1(Fondness)
14

A Line
The line : w T x

+ b= 0

x2(second feature)

Normal: w

++
+
+
+
+ ++
+
+
+
x1(first feature)

Hyperplane in high dimensional space

The inequalities and regions

w x + b= 0
+ + wT x + b> 0
i
- +
+
+
+
+
+
wT xi + b < 0
+
+
+
-

model

D eci si on funct i on f ( x ) = si gn( w T x n ew + b)

Large Margin

Maximal Margin

Data not linearly separable

Case 1

Case 2

Trick 1: Soft-Margin
These points are usually outliers. The hyperplane should not bias too much.

Penalty of
violating data

Soft-margin

[Ben-Hur & Weston 2005]

Support vectors

More important data that support (define) the hyperplane

Trick2: Map to Higher Dimension

x 2 = x 21

M appi ng:
x1

x 21

Mapping to Infinite Dimension

Is it possible to create a universal mapping ?
What if we can map to infinite dimension ? Every problem is separable!
Consider Radial Basis Function (RBF):

=Kernel(x,y)

w : infinite number of variables!

Dual Problem
Primal

Dual

finite calculation

Gaussian/RBF Kernel

~ linear kernel

Overfitting
nearest neighbor?

[Ben-Hur & Weston 2005]

Recap

Soft-ness

Nonlinearity

Checkout the SVMToy

http://www.csie.ntu.edu.tw/~cjlin/libsvm/
-c (cost control softness of the margin/#SV)
-g (gamma controls the curvature of the
hyperplane)

Cross Validation
What is the best (C, ) ? Date dependent
Need to be determined by testing performance
Split training data into pseudo training, testing sets
Training

Split: training

Testing

Split: test

determine the best (C, )

Exhausted grid search for best (C, )

Outline

Machine Learning Classification

High-level Concepts of SVM
Interpretation of SVM Model/Result
Use Case Study

(1)Decision value as strength

D eci si on funct i on f ( x ) = si gn( w T x n ew + b)

Facial Movement Classification

Classes: brow up(+) or down(-)
Features: pixels of Gabor filtered image

Decision value as strength

Probability estimates from decision values also available

(2)Weight as feature importance

Magnitude of weight : feature importance
Similar to regression

(3)Weights as profiles
Fluorescent image of cells of
various dosage of certain drug
Various image-based features

Clustering the weights shows the

primal and secondary effect of the
drug
37

Outline

Machine Learning Classification

High-level Concepts of SVM
Interpretation of SVM Model/Result
User Case Study

The Software
SVM requires an constraint quadratic
optimization solver
not easy to implement.
Off-the-shelf Software
libsvm by Chih-Jen Lin et. al.
svmlight by Thorsten Joachims

Incorporated into many ML software

matlab / pyML / R
39

Beginners may
1. Convert their data into the format of a
SVM software.
2. May not conduct scaling
3. Randomly try few parameters and
without cross validation
4. Good result on training data, but poor in
testing.
40

Data scaling
Without scaling
feature of large dynamic range may dominate
separating hyperplane.

X Height Gender
x1 150 2

y2=1
y3=1

x2 180
x3 185

1
1

Gender

label
y1=0

Height
41

Parameter Selection
Contour of cross validation accuracy.

Good area

User case : Astroparticle scientist

User:
I am using libsvm in a astroparticle physics
application .. First, let me congratulate you to
a really easy to use and nice package.
Unfortunately, it gives me astonishingly bad
test results...
OK. Please send us your data
We are able to get 97% test accuracy. Is that
good enough for you ?
User:
You earned a copy of my PhD thesis
43

Dynamic Range Mismatch

A problem from astroparticle physics
<label> <index>:<value> <index>:<value>
1 1:2.6173e+01 2:5.88670e+01 3:-1.89469e-01 4:1.25122e+02
1 1:5.7073e+01 2:2.21404e+02 3:8.60795e-02 4:1.22911e+02
1 1:1.7259e+01 2:1.73436e+02 3:-1.29805e-01 4:1.25031e+02
1 1:2.1779e+01 2:1.24953e+02 3:1.53885e-01 4:1.52715e+02
1 1:9.1339e+01 2:2.93569e+02 3:1.42391e-01 4:1.60540e+02
1 1:5.5375e+01 2:1.79222e+02 3:1.65495e-01 4:1.11227e+02
1 1:2.9562e+01 2:1.91357e+02 3:9.90143e-02 4:1.03407e+02

#Training set 3,089 and #testing set 4,000

Large dynamic range of some features.

Overfitting
Training
$./svm-train train.1 (default parameter used)
optimization finished, #iter = 6131
nSV = 3053, nBSV = 724
Total nSV = 3053
Training Accuracy
$./svm-predict train.1 train.1.model o
Accuracy = 99.7734% (3082/3089)
Testing Accuracy
$./svm-predict test.1 train.1.model test.1.out
Accuracy = 66.925% (2677/4000)
nSV and nBSV: number of SVs and bounded SVs (i = C).
Without scaling. One feature may dominant the value overfitting

3053/3089 training data become support vectorOverfitting

Training accuracy high, but low testing accuracy Overfitting
45

Suggested Procedure
Data pre-scaling
scale range [0 1] or unit variance

Using (default) Gaussian(RBF) kernel

Use cross-validation to find the best parameter (C, )
Train your model with best parameter
Test!

All above done automatically in easy.py script provided with libsvm.

Large Scale SVM

(#training data >> #feature ) and linear kernel
Use primal solvers (eg. liblinear)
To approximated result in short time
Allow inaccurate stopping condition
svm-train e 0.01
Use stochastic gradient descent solvers

24
47

Resources

LIBSVM: http://www.csie.ntu.edu.tw/~cjlin/libsvm
LIBSVM Tools: http://www.csie.ntu.edu.tw/~cjlin/libsvmtools
Kernel Machines Forum: http://www.kernel-machines.org
Hsu, Chang, and Lin: A Practical Guide to Suppor t Vector
Classification
my email: tfwu@ucsd.edu

Acknowledgement
Many slides from Dr. Chih-Jen Lin , NTU

Support Vector Machine: Prof. Subodh Kumar Mohanty
No ratings yet
Support Vector Machine: Prof. Subodh Kumar Mohanty
52 pages
Support Vector Machine: Abinas Panda
No ratings yet
Support Vector Machine: Abinas Panda
52 pages
This Is
No ratings yet
This Is
7 pages
Unit - 2
No ratings yet
Unit - 2
15 pages
ML 18-20 SVM
No ratings yet
ML 18-20 SVM
44 pages
2024 Scu ML 2 1 SVM
No ratings yet
2024 Scu ML 2 1 SVM
36 pages
Understanding Support Vector Machines
No ratings yet
Understanding Support Vector Machines
32 pages
SVM Classifier Techniques Guide
No ratings yet
SVM Classifier Techniques Guide
15 pages
Support Vector Machines (SVMS) - Introduction and Key Concepts
No ratings yet
Support Vector Machines (SVMS) - Introduction and Key Concepts
52 pages
Unit - 2-1
No ratings yet
Unit - 2-1
7 pages
AP For NLP-LO2
No ratings yet
AP For NLP-LO2
38 pages
SVM Basics for Data Scientists
No ratings yet
SVM Basics for Data Scientists
28 pages
SVM Algorithm Guide with Python Code
No ratings yet
SVM Algorithm Guide with Python Code
10 pages
SVM Basics for Data Scientists
No ratings yet
SVM Basics for Data Scientists
139 pages
SUpport Vector Machine
No ratings yet
SUpport Vector Machine
28 pages
Support Vector Machine in R Paper
No ratings yet
Support Vector Machine in R Paper
28 pages
Support Vector Machine For Classification
No ratings yet
Support Vector Machine For Classification
38 pages
7 - Support Vector Machines (SVM)
No ratings yet
7 - Support Vector Machines (SVM)
29 pages
Support Vector Machine
No ratings yet
Support Vector Machine
52 pages
Support Vector Machines: Jeff Wu
No ratings yet
Support Vector Machines: Jeff Wu
35 pages
Course Title: Fundamentals of Machine Learning Course Code: Group Assignment On
No ratings yet
Course Title: Fundamentals of Machine Learning Course Code: Group Assignment On
9 pages
Support Vector Machine Guide
No ratings yet
Support Vector Machine Guide
21 pages
Ann Unit III
No ratings yet
Ann Unit III
20 pages
SVM Guide: Concepts, Implementation, Tuning
No ratings yet
SVM Guide: Concepts, Implementation, Tuning
13 pages
Detailed SVM Presentation
No ratings yet
Detailed SVM Presentation
15 pages
SVM
No ratings yet
SVM
11 pages
SVM - Feb 15
No ratings yet
SVM - Feb 15
34 pages
S V M (SVM) : Upport Ector Achine
No ratings yet
S V M (SVM) : Upport Ector Achine
67 pages
Introduction To: Support Vector Machines
No ratings yet
Introduction To: Support Vector Machines
53 pages
Support Vector Machine (SVM) Classifier:: Key Features
No ratings yet
Support Vector Machine (SVM) Classifier:: Key Features
6 pages
Aim of The Experiment-Software Required - Theory
No ratings yet
Aim of The Experiment-Software Required - Theory
6 pages
Support Vector Machines: (Vapnik, 1979)
No ratings yet
Support Vector Machines: (Vapnik, 1979)
34 pages
Lec 05
No ratings yet
Lec 05
54 pages
27-Module 4 - Support Vector Machine and Naïve Bayes-20-09-2024
No ratings yet
27-Module 4 - Support Vector Machine and Naïve Bayes-20-09-2024
31 pages
SVM Guide for MCA Students
No ratings yet
SVM Guide for MCA Students
17 pages
Introduction To Support Vector Machines
No ratings yet
Introduction To Support Vector Machines
46 pages
Support Vector Machines
No ratings yet
Support Vector Machines
4 pages
Unit 2 - SVM - 241016 - 104220
No ratings yet
Unit 2 - SVM - 241016 - 104220
47 pages
SVM Notes Unit 4
No ratings yet
SVM Notes Unit 4
8 pages
Fundamental Knowledge of Machine Learning: Abstract This Chapter Introduces The Basic Concepts and Methods of Machine
No ratings yet
Fundamental Knowledge of Machine Learning: Abstract This Chapter Introduces The Basic Concepts and Methods of Machine
14 pages
Tutorial On Support Vector Machine (SVM) : Abstract
No ratings yet
Tutorial On Support Vector Machine (SVM) : Abstract
13 pages
How SVM Works: Support Vector Machine (SVM)
No ratings yet
How SVM Works: Support Vector Machine (SVM)
7 pages
Unit2 Notes What Is A Support Vector Machine
No ratings yet
Unit2 Notes What Is A Support Vector Machine
11 pages
DMML Unit4 - SVM
No ratings yet
DMML Unit4 - SVM
50 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
29 pages
Hands On Machine Learning 3 Edition
No ratings yet
Hands On Machine Learning 3 Edition
43 pages
Pca PDF
No ratings yet
Pca PDF
10 pages
Support Vector Machine: Suraj Kumar Das
No ratings yet
Support Vector Machine: Suraj Kumar Das
10 pages
SVM Presentation
No ratings yet
SVM Presentation
27 pages
Machine Learning
No ratings yet
Machine Learning
45 pages
Support Vector Machines
No ratings yet
Support Vector Machines
33 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
26 pages
Exp 5
No ratings yet
Exp 5
14 pages
Statistics and Data Visualization in Climate Science With R and Python
No ratings yet
Statistics and Data Visualization in Climate Science With R and Python
415 pages
Effects of Whole Body Vibration On The Skeleton and Other Organ Systems in Man and Animal
No ratings yet
Effects of Whole Body Vibration On The Skeleton and Other Organ Systems in Man and Animal
11 pages
Solutions Chapter4
100% (2)
Solutions Chapter4
27 pages
L 6
No ratings yet
L 6
11 pages
PDF DIR 5432
No ratings yet
PDF DIR 5432
4 pages
Numerical Reasoning
100% (1)
Numerical Reasoning
3 pages
Angel's Winning Strategy
No ratings yet
Angel's Winning Strategy
9 pages
Discrete Math Course Overview
No ratings yet
Discrete Math Course Overview
2 pages
Designed by T. Mark Niño B Javier
No ratings yet
Designed by T. Mark Niño B Javier
58 pages
Lie Groups and Lie Algebras, Chapters 4-6 - Nicolas Bourbaki
100% (1)
Lie Groups and Lie Algebras, Chapters 4-6 - Nicolas Bourbaki
314 pages
CHAPTER 03 - Cyclic Codes
No ratings yet
CHAPTER 03 - Cyclic Codes
30 pages
QB - Matrices and Determinants (Core Maths)
No ratings yet
QB - Matrices and Determinants (Core Maths)
18 pages
Discrete Mathematics I: Solution
No ratings yet
Discrete Mathematics I: Solution
5 pages
Lect 25
No ratings yet
Lect 25
28 pages
Quimio, John Carlo B. Cpe1A - S Ma'Am Floryfe Hernandez: DX DX
No ratings yet
Quimio, John Carlo B. Cpe1A - S Ma'Am Floryfe Hernandez: DX DX
5 pages
Treap: Manohar - Bhat.B 4JC07CS056 8 Sem C.S SJCE, Mysore
No ratings yet
Treap: Manohar - Bhat.B 4JC07CS056 8 Sem C.S SJCE, Mysore
33 pages
MTH104 Slides Final (23-45) by Mahar Afaq Safdar Muhammadi
No ratings yet
MTH104 Slides Final (23-45) by Mahar Afaq Safdar Muhammadi
39 pages
SG 113 65475f2949f801.65475f2ace9015.87061823
No ratings yet
SG 113 65475f2949f801.65475f2ace9015.87061823
6 pages
NCERT Solutions For Class 10 Maths Chapter 4 Quadratic Equations
No ratings yet
NCERT Solutions For Class 10 Maths Chapter 4 Quadratic Equations
29 pages
Algorithms For Uncertainty Quantification
No ratings yet
Algorithms For Uncertainty Quantification
35 pages
TOAE 105, K60, FTU, Time Duration: 75 Minutes Student Name: Student ID
No ratings yet
TOAE 105, K60, FTU, Time Duration: 75 Minutes Student Name: Student ID
2 pages
Math Problem Set with Solutions
No ratings yet
Math Problem Set with Solutions
20 pages
Math Inequalities: Wavy Curve Method
No ratings yet
Math Inequalities: Wavy Curve Method
12 pages
Cayley - Hamilton Theorem
No ratings yet
Cayley - Hamilton Theorem
3 pages
220 HW 4 Solutions
No ratings yet
220 HW 4 Solutions
4 pages
Real Numbers: DHA 01 (Of Lecture 02)
No ratings yet
Real Numbers: DHA 01 (Of Lecture 02)
2 pages
The Student S Introduction To Mathematica A Handbook For Precalculus Calculus and Linear Algebra 2nd Edition Bruce F Torrence PDF Download
100% (4)
The Student S Introduction To Mathematica A Handbook For Precalculus Calculus and Linear Algebra 2nd Edition Bruce F Torrence PDF Download
81 pages
II PUC Board APR - 2022 (Mathematics) - Answer Key
No ratings yet
II PUC Board APR - 2022 (Mathematics) - Answer Key
19 pages
MMW Exercise Set 6.4
100% (2)
MMW Exercise Set 6.4
10 pages
Interconnect 03 - Interconnect Modeling
No ratings yet
Interconnect 03 - Interconnect Modeling
31 pages
Matrix Operations Guide
No ratings yet
Matrix Operations Guide
4 pages
1.4 Exponential and Logarithm Graphs. Solutions
No ratings yet
1.4 Exponential and Logarithm Graphs. Solutions
5 pages
(A-MATH) Chapter 1 - Quadratic Functions
No ratings yet
(A-MATH) Chapter 1 - Quadratic Functions
16 pages
IT Curriculum Overview
No ratings yet
IT Curriculum Overview
6 pages

SVM Guide for Data Scientists

Uploaded by

SVM Guide for Data Scientists

Uploaded by

Practical Guide to Support

What does it mean to learn?

Make predictions about the world?

Is there water in that canyon?

Is that person a good mate?

These are all examples of classification problems

Boot Camp Related

face recognition / speaker identification

Brain Computer Interface / Spikes Classification

Driver Fatigue Detection from Facial Expression

Given training data (class labels known)

Many possible classification models

Why SVM ? (my opinion)

SVM provides a reasonable good baseline

y1=+1 x1 = [3000, 0.6]

Hyperplane in high dimensional space

The inequalities and regions

D eci si on funct i on f ( x ) = si gn( w T x n ew + b)

Data not linearly separable

[Ben-Hur & Weston 2005]

More important data that support (define) the hyperplane

Trick2: Map to Higher Dimension

Mapping to Infinite Dimension

w : infinite number of variables!

[Ben-Hur & Weston 2005]

Checkout the SVMToy

determine the best (C, )

Exhausted grid search for best (C, )

Machine Learning Classification

(1)Decision value as strength

Facial Movement Classification

Decision value as strength

Probability estimates from decision values also available

(2)Weight as feature importance

Clustering the weights shows the

Machine Learning Classification

Incorporated into many ML software

User case : Astroparticle scientist

Dynamic Range Mismatch

#Training set 3,089 and #testing set 4,000

3053/3089 training data become support vectorOverfitting

Using (default) Gaussian(RBF) kernel

All above done automatically in easy.py script provided with libsvm.

Large Scale SVM

You might also like