0% found this document useful (0 votes)

551 views84 pages

Pytorch Tutorial by Chongruo Wu

The document provides an overview of PyTorch, including basics of tensors, modules, data loading, training a simple network, and other helpful skills like pretrained models, optimization, and visualization. Popular frameworks are also discussed including MXNet, TensorFlow and PyTorch.

Uploaded by

Thaly Pardo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

551 views84 pages

Pytorch Tutorial by Chongruo Wu

Uploaded by

Thaly Pardo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 84

Pytorch Tutorial

Chongruo Wu
Agenda

1. Popular Frameworks

2. Pytorch, Basics

3. Helpful skills
Popular Deep Learning Frameworks

Gluon: new MXNet interface to accelerate research

Popular Deep Learning Frameworks

Imperative: Imperative-style programs perform computation as you run them

Symbolic: define the function first, then compile them

Gluon: new MXNet interface to accelerate research

Popular Deep Learning Frameworks

Gluon: new MXNet interface to accelerate research

Popular Deep Learning Frameworks

lua
C++, Python,
R, Julia, Perl
Scala

Stanford cs231n.
MxNet Tutorial, CVPR 2017
MxNet Tutorial, CVPR 2017
MxNet Tutorial, CVPR 2017
Stanford cs231n.
MxNet Tutorial, CVPR 2017
MxNet Tutorial, CVPR 2017
MxNet Online Document, https://goo.gl/UZ2byD
Stanford cs231n.
Stanford cs231n.
Pytorch
Stanford cs231n.
Stanford cs231n.
Pytorch Tensors

https://transfer.d2.mpi-inf.mpg.de/rshetty/hlcv/Pytorch_tutorial.pdf
Stanford cs231n.
Stanford cs231n.
Stanford cs231n.
Stanford cs231n.
Stanford cs231n.
Stanford cs231n.
Stanford cs231n.
Variable
The autograd package provides automatic differentiation for all operations on Tensors.

“ autograd.Variable is the central class of the package. It wraps a Tensor, and supports
nearly all of operations defined on it.

Once you finish your computation you can call .backward() and have all the gradients
computed automatically. “

Pytorch Tutorial. www.pytorch.org

Stanford cs231n.
Stanford cs231n.
Stanford cs231n.
Stanford cs231n.
Stanford cs231n.
Stanford cs231n.
Module, single layer

Other layers:
Dropout, Linear,
Normalization Layer
Module, network

Pytorch, zero to all. HKUST

Module, sub-network

http://book.paddlepaddle.org/03.image_classification/
Module, sub-network

https://github.com/junyanz/pytorch-CycleGAN-and-pix2pix
Module
Stanford cs231n.
When starting a new project

1. Data preparation ( processing, format )

2. Model Design ( pretrained, design your own Model)

3. Training Strategy
Train a simple Network

Stanford cs231n.
Train a simple Network

1. Forward: compute output of each layer

2. Backward: compute gradient
3. Update: update the parameters with computed gradient
Train a simple Network

Stanford cs231n.
Train a simple Network

Stanford cs231n.
MNIST example
MNIST example
MNIST example

https://goo.gl/mQEw15
MNIST example

Data Loading

https://goo.gl/mQEw15
MNIST example

Define Network

https://goo.gl/mQEw15
MNIST example

Training

https://goo.gl/mQEw15
MNIST example

Inference

eval() mode:

*Dropout Layer
*Batchnorm Layer

https://goo.gl/mQEw15
When starting a new project

1. Data preparation ( processing, format )

2. Model Design ( pretrained, design your own model)

3. Training Strategy
Data Loading
Data Loading

Pytorch, zero to all. HKUST

Data Loading

Pytorch, zero to all. HKUST

Data Loading

https://goo.gl/mQEw15
Data Loading

Pytorch, zero to all. HKUST

Data Loading

Pytorch, zero to all. HKUST

Data Loading

Stanford cs231n.
Data Loading

Pytorch, zero to all. HKUST

Data Processing

https://goo.gl/mQEw15
Data Processing
Pix2pix Code

https://github.com/junyanz/pytorch-CycleGAN-and-pix2pix
When starting a new project

1. Data preparation ( processing, format )

2. Model Design ( pretrained, design your own model)

3. Training Strategy ( learning rate )

Learning Rate Scheduler

Stanford cs231n.
Learning Rate Scheduler

torch.optim.lr_scheuler
● StepLR: LR is delayed by gamma every step_size epochs
● MultiStepLR: LR is delayed by gamma once the number of epoch reaches milestones.
● ExponentialLR
● CosineAnnealingLR
● ReduceLROnPlateau

https://github.com/Jiaming-Liu/pytorch-lr-scheduler
http://pytorch.org/docs/master/optim.html#how-to-adjust-learning-rate
Pretrained Model
Load Model

args.resume is the path to the trained model

Define the model

before loading parameters

https://gist.github.com/panovr/2977d9f26866b05583b0c40d88a315bf
Weights Initialization

net.apply( weights_init_normal)

https://goo.gl/bqeW1K
Weights Initialization

https://goo.gl/bqeW1K
Hooks

It is used to inspect or modify the output and grad of a layer

Need to write register function

Hooks ( Forward )

http://pytorch.org/tutorials/beginner/former_torchies/nn_tutorial.html
Hooks ( Backward )

http://pytorch.org/tutorials/beginner/former_torchies/nn_tutorial.html
Cudnn.benchmark flag

https://goo.gl/5gzj8F
Visualization, Pytorch Visdom
https://github.com/facebookresearch/visdom
Visualization, TensorBoard

https://www.tensorflow.org/get_started/summaries_and_tensorboard
Other Resources
Official examples. https://goo.gl/Q6Z2k8
Other Resources
Official documents. https://goo.gl/gecKC4
Other Resources
Pix2pix code https://github.com/phillipi/pix2pix
Other Resources
Pytorch, Zero to All (HKUST) https://goo.gl/S3vEUN
Thank You

Deep Learning Guide: Installation to MLPs
No ratings yet
Deep Learning Guide: Installation to MLPs
986 pages
02 - Lecture Note - TensorFlow Ops
No ratings yet
02 - Lecture Note - TensorFlow Ops
21 pages
LLM Ai Interview SS
No ratings yet
LLM Ai Interview SS
187 pages
Intro To Machine Learning With PyTorch
No ratings yet
Intro To Machine Learning With PyTorch
48 pages
RLHF - Reinforcement Learning From Human Feedback
No ratings yet
RLHF - Reinforcement Learning From Human Feedback
21 pages
Simple Libraries in Python
No ratings yet
Simple Libraries in Python
12 pages
PyTorch Lightning Guide 0.8.5
No ratings yet
PyTorch Lightning Guide 0.8.5
562 pages
100 Interview Q A For Large Language Models LLMs 1748803296
No ratings yet
100 Interview Q A For Large Language Models LLMs 1748803296
10 pages
Gluon Tutorials: Deep Learning - The Straight Dope
No ratings yet
Gluon Tutorials: Deep Learning - The Straight Dope
403 pages
Professional Machine Learning Engineer Demo
No ratings yet
Professional Machine Learning Engineer Demo
6 pages
MLOPS Notes
100% (1)
MLOPS Notes
5 pages
NLP Transformers for Researchers
No ratings yet
NLP Transformers for Researchers
42 pages
Ultimate Data Science - GenAI Bootcamp
No ratings yet
Ultimate Data Science - GenAI Bootcamp
34 pages
Lang Graph
100% (1)
Lang Graph
113 pages
LoRA Techniques for LLM Fine-Tuning
No ratings yet
LoRA Techniques for LLM Fine-Tuning
27 pages
Machine Learning Engineering Guide
No ratings yet
Machine Learning Engineering Guide
308 pages
GANs for Financial Data Augmentation
No ratings yet
GANs for Financial Data Augmentation
8 pages
StaticSpeed Security Assessment
No ratings yet
StaticSpeed Security Assessment
57 pages
Lecture+Notes Intro To MLOps Session3
No ratings yet
Lecture+Notes Intro To MLOps Session3
8 pages
LangChain Document Loading Guide
No ratings yet
LangChain Document Loading Guide
8 pages
GenAI Pinnacle Plus Brochure
No ratings yet
GenAI Pinnacle Plus Brochure
10 pages
Getting Started With TensorFlow - Js - TensorFlow - Medium
No ratings yet
Getting Started With TensorFlow - Js - TensorFlow - Medium
6 pages
Cours 1 - Intro To Deep Learning
100% (1)
Cours 1 - Intro To Deep Learning
38 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
AI by Hand: Neural Network Concepts
No ratings yet
AI by Hand: Neural Network Concepts
28 pages
Generative AI System Design Resources
No ratings yet
Generative AI System Design Resources
5 pages
Anaconda Cheat Sheet
No ratings yet
Anaconda Cheat Sheet
1 page
Machine Learning in Finance Course
100% (1)
Machine Learning in Finance Course
131 pages
CVPR2022 Tutorial Diffusion Model
No ratings yet
CVPR2022 Tutorial Diffusion Model
188 pages
Deploy Machine Learning Models To Production: With Flask, Streamlit, Docker, and Kubernetes On Google Cloud Platform 1st Edition Pramod Singh Kindle & PDF Formats
100% (1)
Deploy Machine Learning Models To Production: With Flask, Streamlit, Docker, and Kubernetes On Google Cloud Platform 1st Edition Pramod Singh Kindle & PDF Formats
147 pages
TensorFlow Basics for Beginners
No ratings yet
TensorFlow Basics for Beginners
26 pages
Planet, Code - PYTHON For LARGE LANGUAGE MODELS - A Beginners Handbook For Leveraging Llms Into Modern Development Workflows and Applications (2025)
100% (2)
Planet, Code - PYTHON For LARGE LANGUAGE MODELS - A Beginners Handbook For Leveraging Llms Into Modern Development Workflows and Applications (2025)
254 pages
A Quick Introduction To Tensorflow: Machine Learning Spring 2019
100% (1)
A Quick Introduction To Tensorflow: Machine Learning Spring 2019
22 pages
Essential Cheat Sheets For Machine Learning and Deep Learning Engineers - by Kailash Ahirwar - Startups & Venture Capital
No ratings yet
Essential Cheat Sheets For Machine Learning and Deep Learning Engineers - by Kailash Ahirwar - Startups & Venture Capital
32 pages
Bias-Variance Tradeoff Presentation
No ratings yet
Bias-Variance Tradeoff Presentation
11 pages
GPT2 From Scratch in PyTorch
No ratings yet
GPT2 From Scratch in PyTorch
13 pages
Machine Learning Interview Guide
100% (1)
Machine Learning Interview Guide
41 pages
Guha Rehan - Machine Learning Interview Guide - 2025
No ratings yet
Guha Rehan - Machine Learning Interview Guide - 2025
442 pages
LLM Basics
No ratings yet
LLM Basics
35 pages
MLOps Syllabus and Weekly Schedule (June 2021) PDF
No ratings yet
MLOps Syllabus and Weekly Schedule (June 2021) PDF
5 pages
Running Llama 2 On CPU Inference Locally For Document Q&A - by Kenneth Leung - Jul, 2023 - Towards Data Science
100% (1)
Running Llama 2 On CPU Inference Locally For Document Q&A - by Kenneth Leung - Jul, 2023 - Towards Data Science
21 pages
Tensorflow Internal
No ratings yet
Tensorflow Internal
17 pages
Deep Learning Applications,: M. Arif Wani Taghi M. Khoshgoftaar Vasile Palade Editors
No ratings yet
Deep Learning Applications,: M. Arif Wani Taghi M. Khoshgoftaar Vasile Palade Editors
307 pages
6 Types of Neural Network
No ratings yet
6 Types of Neural Network
8 pages
Optimizing Long-Context LLMs in RAG
No ratings yet
Optimizing Long-Context LLMs in RAG
34 pages
Autoregressive Generative Models Guide
No ratings yet
Autoregressive Generative Models Guide
57 pages
AI and Machine Learning in Action Real World Solutions For Coders
No ratings yet
AI and Machine Learning in Action Real World Solutions For Coders
175 pages
Cassandra
100% (1)
Cassandra
31 pages
AIML001 Generative AI On AWS - Build and Scale Generative AI Applications With Foundation Models
100% (2)
AIML001 Generative AI On AWS - Build and Scale Generative AI Applications With Foundation Models
28 pages
Deep Learning
100% (2)
Deep Learning
49 pages
Deep Learning Foundations and Concepts
0% (1)
Deep Learning Foundations and Concepts
4 pages
Foundations of LLM
100% (1)
Foundations of LLM
231 pages
Deep Learning Lab: How To Train Your First Neural Network
No ratings yet
Deep Learning Lab: How To Train Your First Neural Network
68 pages
Pytorch
No ratings yet
Pytorch
38 pages
Day 45 PyTorch Presentation
No ratings yet
Day 45 PyTorch Presentation
67 pages
ISPR 26 Pytorch
No ratings yet
ISPR 26 Pytorch
35 pages
DIP Lab 10
No ratings yet
DIP Lab 10
11 pages
Pytorch Tutorial 1
No ratings yet
Pytorch Tutorial 1
48 pages
Introduction To PyTorch
No ratings yet
Introduction To PyTorch
25 pages
Pytorch Tutorial PDF
No ratings yet
Pytorch Tutorial PDF
27 pages
Data Architecture
No ratings yet
Data Architecture
24 pages
HP Integrity Rx3600 Rx6600 Servers
No ratings yet
HP Integrity Rx3600 Rx6600 Servers
2 pages
Vyakarana Sanskrit Grammar Guide
No ratings yet
Vyakarana Sanskrit Grammar Guide
37 pages
Assignment 04
No ratings yet
Assignment 04
8 pages
Add Sub Integers
No ratings yet
Add Sub Integers
20 pages
History of Android
No ratings yet
History of Android
44 pages
2023 Scheme BE CSE 2nd Year CBS Syllabus Min
No ratings yet
2023 Scheme BE CSE 2nd Year CBS Syllabus Min
200 pages
What Is Metastability
No ratings yet
What Is Metastability
4 pages
CAFM Systems - How To Choose
No ratings yet
CAFM Systems - How To Choose
18 pages
Battleplan Magazine 04
100% (2)
Battleplan Magazine 04
74 pages
Ceramic Membrane Skid Systems
No ratings yet
Ceramic Membrane Skid Systems
2 pages
Surveillance Equipment Specs & Prices
No ratings yet
Surveillance Equipment Specs & Prices
4 pages
Pre Unit5 Extension
No ratings yet
Pre Unit5 Extension
2 pages
Fractional Control for DC Motors
No ratings yet
Fractional Control for DC Motors
12 pages
FB Catalogue 2016 150DPI
100% (1)
FB Catalogue 2016 150DPI
244 pages
Computing Environments
No ratings yet
Computing Environments
24 pages
Certificado de Origen Koreano
No ratings yet
Certificado de Origen Koreano
2 pages
VISA
No ratings yet
VISA
9 pages
Semiconductor Notes
No ratings yet
Semiconductor Notes
24 pages
Sunny Island 8.0H 13 3010268867parameter Export 2023 03 22 17 06 50
No ratings yet
Sunny Island 8.0H 13 3010268867parameter Export 2023 03 22 17 06 50
1 page
7th Semester 24B.E. Tech Civil
No ratings yet
7th Semester 24B.E. Tech Civil
23 pages
PM210011-NUI900-TL5-00003 - 03 - GF Cmmnuctn
No ratings yet
PM210011-NUI900-TL5-00003 - 03 - GF Cmmnuctn
3 pages
Development and Experimental Investigation of An Automatic Control System For An Excavator
No ratings yet
Development and Experimental Investigation of An Automatic Control System For An Excavator
16 pages
Data Mining P9-SVM
No ratings yet
Data Mining P9-SVM
30 pages
Dog Breed Classifier
No ratings yet
Dog Breed Classifier
30 pages
3 Marks Questions E-Commerce Cybersecurity
No ratings yet
3 Marks Questions E-Commerce Cybersecurity
5 pages
Google (Chrome) Default
No ratings yet
Google (Chrome) Default
53 pages
Economics N110, Game Theory in The Social Sciences: UC Berkeley, Summer 2012
No ratings yet
Economics N110, Game Theory in The Social Sciences: UC Berkeley, Summer 2012
23 pages
Manual Sa Akx95lm K
No ratings yet
Manual Sa Akx95lm K
150 pages
Paper 4 - Health and Safety
No ratings yet
Paper 4 - Health and Safety
3 pages

Pytorch Tutorial by Chongruo Wu

Uploaded by

Pytorch Tutorial by Chongruo Wu

Uploaded by

Pytorch Tutorial

Gluon: new MXNet interface to accelerate research

Imperative: Imperative-style programs perform computation as you run them

Symbolic: define the function first, then compile them

Gluon: new MXNet interface to accelerate research

Gluon: new MXNet interface to accelerate research

Pytorch Tutorial. www.pytorch.org

Pytorch, zero to all. HKUST

1. Data preparation ( processing, format )

2. Model Design ( pretrained, design your own Model)

1. Forward: compute output of each layer

1. Data preparation ( processing, format )

2. Model Design ( pretrained, design your own model)

Pytorch, zero to all. HKUST

Pytorch, zero to all. HKUST

Pytorch, zero to all. HKUST

Pytorch, zero to all. HKUST

Pytorch, zero to all. HKUST

1. Data preparation ( processing, format )

2. Model Design ( pretrained, design your own model)

3. Training Strategy ( learning rate )

args.resume is the path to the trained model

Define the model

It is used to inspect or modify the output and grad of a layer

Need to write register function

You might also like