0% found this document useful (0 votes)

20 views29 pages

Lecture Paola Object Detection

Uploaded by

Milad24

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views29 pages

Lecture Paola Object Detection

Uploaded by

Milad24

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 29

CS6501: Deep Learning for Visual Recognition

Object Detection:
RCNN, Fast-RCNN, Faster-RCNN
Today’s Class

• Object Detection

• The RCNN Object Detector (2014)

• The Fast RCNN Object Detector (2015)
• The Faster RCNN Object Detector (2016)

• YOLO (CVPR 2016)

• SSD (ECCV 2016)
Object Detection

deer

cat
Object Detection

Class Scores
Deer: 0.9
Fully Connected: Cat: 0.05
4096 to k Umbrella: 0.01
…

Fully Connected:
4096 to 4 Box Coordinates
(x, y, w, h)
Object Detection

4096 Deer: (x, y, w, h)

Cat: (x, y, w, h)
Object Detection

Penguin: (x, y, w, h)
4096 Penguin: (x, y, w, h)
Penguin: (x, y, w, h)
Penguin: (x, y, w, h)
…
Object Detection as Classification

deer?
CNN cat?
background?
Object Detection as Classification

deer?
CNN cat?
background?
Object Detection as Classification with Sliding Window

deer?
CNN cat?
background?
Object Detection as Classification with Box Proposals
RCNN

https://people.eecs.berkeley.edu/~rbg/papers/r-cnn-cvpr.pdf
Rich feature hierarchies for accurate object detection and semantic segmentation.
Girshick et al. CVPR 2014.
RCNN
First stage: generate category-
independent region proposals.
• 2000 Region proposals for every image

Selective Search: combine the strength of

both an exhaustive search and segmentation.
Uijlings et al. IJCV 2013.
ref
RCNN
First stage: generate category-
independent region proposals.
• 2000 Region proposals for every image

Second stage: extracts a fixed-length

feature vector from each region.
• a 4096-dimensional feature vector
from each region proposal
warp feature vector
CNN

Arbitrary rectangles? 5 conv layers + 2 fully

A fixed size input? 227 x 227 connected layers
RCNN
First stage: generate category-
independent region proposals.
• 2000 Region proposals for every image

Second stage: extracts a fixed-length

feature vector from each region.
• a 4096-dimensional feature vector
from each region proposal people?
feature vector
linear horse?
svm
Third stage: a set of class- specific background?
linear SVMs.
x
• object category and location Bounding box y
regression w
h
proposal
location
RCN Fast-
• Nand scalable.
Simple
RCNN
• improves mAP.

• A multistage pipeline.
• Training is expensive in
space and time (features
are extracted from each
region proposal in each
?
image and written into
disk).
• Object detection is slow.
Fast-RCNN

Idea: No need to recompute fea-

https://arxiv.org/abs/1504.08083 tures for every box independently
Fast R-CNN. Girshick. ICCV 2015.
Fast-RCNN

Process the whole image with several

convolutional (conv) and max pooling
layers to produce a conv feature map. a region of interest (RoI) pooling
layer extracts a fixed-length feature
vector from the region feature map.
FC+
K + 1 categories
feature vector softmax

+ four real-valued
FC+ numbers for each of
regressor the K object classes.

…
RCNN vs Fast-RCNN

Figure adapted from: http://cs231n.stanford.edu/slides/2017/cs231n_2017_lecture11.pdf

RCN Fast- Faster-RCNN
• Nand scalable.
Simple •
RCNN
Higher mAP.
• improves mAP. • Single stage, end-to-end
training.
• No disk storage is required
• A multistage pipeline. for feature caching.
• Training is expensive in
space and time (features
are extracted from each
region proposal in each
• proposals are the
computational bottleneck
?
image and written into in detection systems.
disk).
• Object detection is slow.
Faster-RCNN

Idea: Integrate the Bounding Box

Proposals as part of the CNN predic-
tions
https://arxiv.org/abs/1506.01497
Ren et al. NIPS 2015.
Faster-RCNN
Region Proposal Networks:

k anchors boxes
2k scores 4k coordinates

object or not object bounding box proposal RPN

1x1 conv layer 1x1 conv layer
cls layer reg layer

nxn conv layer Shared conv layers

Fast-RCNN

feature map
sliding window, nxn
…
RCNN vs Fast-RCNN

Figure adapted from: http://cs231n.stanford.edu/slides/2017/cs231n_2017_lecture11.pdf

RCN Fast- Faster-RCNN
• Nand scalable.
Simple •
RCNN
Higher mAP. • compute proposals with a
• improves mAP. • Single stage, end-to-end deep convolutional neural
training. network --Region Proposal
• No disk storage is required Network (RPN)
• A multistage pipeline. for feature caching. • merge RPN and Fast R-CNN
• Training is expensive in into a single network,
space and time (features enabling nearly cost-free
are extracted from each • proposals are the
region proposals.
region proposal in each computational bottleneck
image and written into in detection systems.

?
disk).
• Object detection is slow.
YOLO- You Only Look Once

Idea: No bounding box proposal.

A single regression problem,
straight from image pixels to
bounding box coordinates and
class probabilities.

• extremely fast
• reason globally
• learn generalizable represen-
tations

https://arxiv.org/abs/1506.02640
Redmon et al. CVPR 2016.
YOLO- You Only Look Once

Divide the image into 7x7 cells.

Each cell trains a detector.
The detector needs to predict the object’s class distributions.
The detector has 2 bounding-box predictors to predict
bounding-boxes and confidence scores.
SSD: Single Shot Detector

Idea: Similar to YOLO, but denser grid map, multiscale grid maps. + Data aug-
mentation + Hard negative mining + Other design choices in the network.

Liu et al. ECCV 2016.

Questions?

Object Detection1
No ratings yet
Object Detection1
29 pages
Face Detection With The Faster R-CNN
No ratings yet
Face Detection With The Faster R-CNN
6 pages
L7 Detection
No ratings yet
L7 Detection
54 pages
10 R CNN
No ratings yet
10 R CNN
28 pages
BTP Report Faster R CNN Compressed
No ratings yet
BTP Report Faster R CNN Compressed
32 pages
Faster R-CNN - Deep Dive Into Object Detection
No ratings yet
Faster R-CNN - Deep Dive Into Object Detection
31 pages
Week 5 - Fast RCNN
No ratings yet
Week 5 - Fast RCNN
17 pages
Deep Learning Algorithms For Object Detection
No ratings yet
Deep Learning Algorithms For Object Detection
43 pages
RCNN
No ratings yet
RCNN
25 pages
Ref 16
No ratings yet
Ref 16
14 pages
A Comprehensive Survey of The R-CNN Family For Object Detection
No ratings yet
A Comprehensive Survey of The R-CNN Family For Object Detection
6 pages
Fast Methods For Deep Learning Based Object Detection
No ratings yet
Fast Methods For Deep Learning Based Object Detection
43 pages
1 ObjectDetection
No ratings yet
1 ObjectDetection
46 pages
Faster R-CNN with Region Proposal Networks
No ratings yet
Faster R-CNN with Region Proposal Networks
9 pages
cv2021 Lec6 Object Detection - 1600 - PDF - Gdrive.vip
No ratings yet
cv2021 Lec6 Object Detection - 1600 - PDF - Gdrive.vip
60 pages
R-CNN vs Fast R-CNN Analysis
No ratings yet
R-CNN vs Fast R-CNN Analysis
4 pages
Ross Girshick Et Al - in 2013 Proposed An Architecture Called R-CNN (Region
No ratings yet
Ross Girshick Et Al - in 2013 Proposed An Architecture Called R-CNN (Region
6 pages
Li 2021 J. Phys.: Conf. Ser. 1827 012085
No ratings yet
Li 2021 J. Phys.: Conf. Ser. 1827 012085
11 pages
R-CNN, Fast R-CNN, Faster R-CNN, YOLO - Object Detection Algorithms
No ratings yet
R-CNN, Fast R-CNN, Faster R-CNN, YOLO - Object Detection Algorithms
11 pages
Object Detection
No ratings yet
Object Detection
57 pages
R-CNN Minus R: Karel Lenc Andrea Vedaldi
No ratings yet
R-CNN Minus R: Karel Lenc Andrea Vedaldi
9 pages
CVR FDP
No ratings yet
CVR FDP
37 pages
An Improved Faster R-CNN For Same Object
No ratings yet
An Improved Faster R-CNN For Same Object
12 pages
DINTA Object Recognition
No ratings yet
DINTA Object Recognition
47 pages
3.1 Faster - R-CNN - Towards - Real-Time - Object - Detection - With - Region - Proposal - Networks
No ratings yet
3.1 Faster - R-CNN - Towards - Real-Time - Object - Detection - With - Region - Proposal - Networks
13 pages
IMINT Target Acquisition Using Deep Learning
No ratings yet
IMINT Target Acquisition Using Deep Learning
5 pages
Obstacle Detection and Classification Using Deep Learning For Tracking in High-Speed Autonomous Driving
No ratings yet
Obstacle Detection and Classification Using Deep Learning For Tracking in High-Speed Autonomous Driving
6 pages
Yolo Family
No ratings yet
Yolo Family
40 pages
Object Detection
No ratings yet
Object Detection
76 pages
Najibi G-CNN An Iterative CVPR 2016 Paper
No ratings yet
Najibi G-CNN An Iterative CVPR 2016 Paper
9 pages
YOLO Evolution Through Time
No ratings yet
YOLO Evolution Through Time
5 pages
Presentation 1
No ratings yet
Presentation 1
15 pages
Unit 3
No ratings yet
Unit 3
45 pages
Multilateral OCC with CNN Models
No ratings yet
Multilateral OCC with CNN Models
9 pages
MV cs4243 2024 Amir 6 p2
No ratings yet
MV cs4243 2024 Amir 6 p2
95 pages
(2018) RFB
No ratings yet
(2018) RFB
16 pages
L10 Lecture Detection - Segmentation v2.5
No ratings yet
L10 Lecture Detection - Segmentation v2.5
35 pages
Engproc 33 00022
No ratings yet
Engproc 33 00022
6 pages
CS7015 (Deep Learning) : Lecture 12: Object Detection: R-CNN, Fast R-CNN, Faster R-CNN, You Only Look Once (YOLO)
No ratings yet
CS7015 (Deep Learning) : Lecture 12: Object Detection: R-CNN, Fast R-CNN, Faster R-CNN, You Only Look Once (YOLO)
47 pages
IT5409 - Ch7 - Part3 - DL For CV-v2 - 4pages
No ratings yet
IT5409 - Ch7 - Part3 - DL For CV-v2 - 4pages
42 pages
Region-Based Object Detection and Classification Using Faster R-CNN
No ratings yet
Region-Based Object Detection and Classification Using Faster R-CNN
6 pages
Object Detection and Identification
67% (3)
Object Detection and Identification
20 pages
Generalized R-CNN for Researchers
No ratings yet
Generalized R-CNN for Researchers
127 pages
DeFRCN Decoupled Faster R-CNN For Few-Shot Object Detection
No ratings yet
DeFRCN Decoupled Faster R-CNN For Few-Shot Object Detection
17 pages
Lec36 Obj Detn
No ratings yet
Lec36 Obj Detn
60 pages
Advanced Object Detection Guide
No ratings yet
Advanced Object Detection Guide
90 pages
Deep-Drone-Object 2
No ratings yet
Deep-Drone-Object 2
8 pages
Object Detection Using CNN-RCNN.-1
No ratings yet
Object Detection Using CNN-RCNN.-1
14 pages
Real Time Object Detection System
No ratings yet
Real Time Object Detection System
31 pages
Fast R-CNN
No ratings yet
Fast R-CNN
9 pages
R CNN Regions With Convolutional Neural Network Features
No ratings yet
R CNN Regions With Convolutional Neural Network Features
8 pages
Beginner's Guide to R-CNN Basics
No ratings yet
Beginner's Guide to R-CNN Basics
6 pages
139 Pretrained Networks Object Detection
No ratings yet
139 Pretrained Networks Object Detection
22 pages
Object Detection Using Deep Learning
No ratings yet
Object Detection Using Deep Learning
6 pages
Objectdetection
No ratings yet
Objectdetection
7 pages
Du 2018 J. Phys. Conf. Ser. 1004 012029
No ratings yet
Du 2018 J. Phys. Conf. Ser. 1004 012029
9 pages
Fast R-CNN
No ratings yet
Fast R-CNN
9 pages
02 - 2012 - PTPAPE - Pipeline Engineer
No ratings yet
02 - 2012 - PTPAPE - Pipeline Engineer
2 pages
Madhuri Resume
No ratings yet
Madhuri Resume
4 pages
Nanomaterials Course Overview
No ratings yet
Nanomaterials Course Overview
5 pages
Statement of Marks: Examination Held In: June: 2023 Seat No. Name
No ratings yet
Statement of Marks: Examination Held In: June: 2023 Seat No. Name
1 page
PR Chapter 1-5
No ratings yet
PR Chapter 1-5
65 pages
Comparative SComparative Study The Kurt Lewin of Changtudy The Kurt Lewin of Chang
100% (1)
Comparative SComparative Study The Kurt Lewin of Changtudy The Kurt Lewin of Chang
4 pages
Midterm 3b Ad20232
No ratings yet
Midterm 3b Ad20232
4 pages
Adijfpqo
No ratings yet
Adijfpqo
8 pages
Stockholm's Educational Evolution
No ratings yet
Stockholm's Educational Evolution
2 pages
Phrases and Clauses PDF
No ratings yet
Phrases and Clauses PDF
14 pages
Reading Remediation Through Peer Mentoring
No ratings yet
Reading Remediation Through Peer Mentoring
10 pages
Onni Annisa - Nim 155110501111053 - Skripsi-2
No ratings yet
Onni Annisa - Nim 155110501111053 - Skripsi-2
154 pages
Abu Jafar Al Tahawi
No ratings yet
Abu Jafar Al Tahawi
8 pages
MSc Financial Economics Guide
No ratings yet
MSc Financial Economics Guide
4 pages
Year 1 Maths Homework Guide
100% (1)
Year 1 Maths Homework Guide
8 pages
Clinical Attachment Guidance UK
No ratings yet
Clinical Attachment Guidance UK
2 pages
Perdev q1 Mod3 Kdoctolero - Compress
No ratings yet
Perdev q1 Mod3 Kdoctolero - Compress
24 pages
Cells: Basic Units of Life Worksheet
No ratings yet
Cells: Basic Units of Life Worksheet
12 pages
From Silent Spring PDF
No ratings yet
From Silent Spring PDF
10 pages
Multicultural Identity and Ecocentrism
No ratings yet
Multicultural Identity and Ecocentrism
13 pages
Resume 2023
No ratings yet
Resume 2023
1 page
Script For National Book Month
No ratings yet
Script For National Book Month
4 pages
Consti Concepts Citizanship Suffrage
No ratings yet
Consti Concepts Citizanship Suffrage
7 pages
Classroom Constitution
No ratings yet
Classroom Constitution
2 pages
PG Handbook 2019
No ratings yet
PG Handbook 2019
96 pages
Singapore Maths (P2) Test 1
No ratings yet
Singapore Maths (P2) Test 1
3 pages
Yuma PVT 02282018 b02
No ratings yet
Yuma PVT 02282018 b02
1 page
Lesson Plan Grade 2 Competency 1 Quarter 1
No ratings yet
Lesson Plan Grade 2 Competency 1 Quarter 1
17 pages
English 3
100% (1)
English 3
5 pages
Simolazione Seconda Traccia Inglese 2023 Extra
No ratings yet
Simolazione Seconda Traccia Inglese 2023 Extra
5 pages