Bounding box annotations and object orientation

Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.

Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

getstream.io

featured

InfluxDB – Built for High-Performance Time Series Workloads

InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

www.influxdata.com

featured

darknet

1 62 22,192 4.8 C

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet ) (by AlexeyAB)

One thing I'd like to point out: YOLOv5 may not be what you think it is. Note that a company took the the YOLO name, released something called YOLOv5 just days (weeks?) after YOLOv4 was announced by AlexeyAB. In the end, YOLOv4 is both faster and more precise than YOLOv5. You can find some of the details on the YOLOv5 shenanigans here: https://github.com/AlexeyAB/darknet/issues/5920
Stream

getstream.io featured

Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
Yet-Another-EfficientDet-Pytorch

2 1 5,259 0.0 Jupyter Notebook

The pytorch re-implement of the official efficientdet with SOTA performance in real time and pretrained weights.

However, there are papers on oriented object detectors (see https://arxiv.org/pdf/1911.07732.pdf) for example. In that paper, they do achieve better results using oriented bounding boxes. If you want to go down that route, I would suggest using the EfficientDet model, because the PyTorch code that you'll find for it is quite easy to understand and modify. For example, I've taken https://github.com/zylo117/Yet-Another-EfficientDet-Pytorch, and modified it to include a "thing-ness" logit, and this was pretty easy to do. Classic EfficientDet models only include logits (aka output neurons that get softmax-ed) for each class, and if any one of these class neurons is greater than 0.5, then it is considered "a thing". Anyway - that's digression, but my point is that I've thought about adding oriented box support to an EfficientDet model, and it didn't seem to be too hard, although I haven't actually done it. If I was to start now, I would probably go with https://github.com/rwightman/efficientdet-pytorch, since Ross Wightman's models are becoming a de-facto standard in the PyTorch world for all things image-related.
efficientdet-pytorch

3 1 1,642 3.3 Python

A PyTorch impl of EfficientDet faithful to the original Google impl w/ ported weights

However, there are papers on oriented object detectors (see https://arxiv.org/pdf/1911.07732.pdf) for example. In that paper, they do achieve better results using oriented bounding boxes. If you want to go down that route, I would suggest using the EfficientDet model, because the PyTorch code that you'll find for it is quite easy to understand and modify. For example, I've taken https://github.com/zylo117/Yet-Another-EfficientDet-Pytorch, and modified it to include a "thing-ness" logit, and this was pretty easy to do. Classic EfficientDet models only include logits (aka output neurons that get softmax-ed) for each class, and if any one of these class neurons is greater than 0.5, then it is considered "a thing". Anyway - that's digression, but my point is that I've thought about adding oriented box support to an EfficientDet model, and it didn't seem to be too hard, although I haven't actually done it. If I was to start now, I would probably go with https://github.com/rwightman/efficientdet-pytorch, since Ross Wightman's models are becoming a de-facto standard in the PyTorch world for all things image-related.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Slowdown / normalization on the Front Lines

1 project | /r/singularity | 30 May 2023
Lion, a new Optimizer from Google, provides 3-5x speedup compared to AdamW

1 project | news.ycombinator.com | 21 Feb 2023
How do I increase the accuracy of small objects when training an object detector?

1 project | /r/learnmachinelearning | 6 Jun 2022
[R] Google AI Introduces Two New Families of Neural Networks Called ‘EfficientNetV2’ and ‘CoAtNet’ For Image Recognition

2 projects | /r/MachineLearning | 18 Sep 2021
Google AI Introduces Two New Families of Neural Networks Called ‘EfficientNetV2’ and ‘CoAtNet’ For Image Recognition

1 project | /r/neuralnetworks | 18 Sep 2021

Bounding box annotations and object orientation

This page summarizes the projects mentioned and recommended in the original post on /r/computervision
hardware-buttons linkedin-bot template-engine-js
Post date: 26 Aug 2021

darknet

Stream

Yet-Another-EfficientDet-Pytorch

efficientdet-pytorch

Related posts

Slowdown / normalization on the Front Lines

Lion, a new Optimizer from Google, provides 3-5x speedup compared to AdamW

How do I increase the accuracy of small objects when training an object detector?

[R] Google AI Introduces Two New Families of Neural Networks Called ‘EfficientNetV2’ and ‘CoAtNet’ For Image Recognition

Google AI Introduces Two New Families of Neural Networks Called ‘EfficientNetV2’ and ‘CoAtNet’ For Image Recognition

Did you know that C is
the 6th most popular programming language
based on number of references?

Bounding box annotations and object orientation

This page summarizes the projects mentioned and recommended in the original post on /r/computervision hardware-buttons linkedin-bot template-engine-js Post date: 26 Aug 2021

darknet

Stream

Yet-Another-EfficientDet-Pytorch

efficientdet-pytorch

Related posts

Slowdown / normalization on the Front Lines

Lion, a new Optimizer from Google, provides 3-5x speedup compared to AdamW

How do I increase the accuracy of small objects when training an object detector?

[R] Google AI Introduces Two New Families of Neural Networks Called ‘EfficientNetV2’ and ‘CoAtNet’ For Image Recognition

Google AI Introduces Two New Families of Neural Networks Called ‘EfficientNetV2’ and ‘CoAtNet’ For Image Recognition

Did you know that C is the 6th most popular programming language based on number of references?

This page summarizes the projects mentioned and recommended in the original post on /r/computervision
hardware-buttons linkedin-bot template-engine-js
Post date: 26 Aug 2021

Did you know that C is
the 6th most popular programming language
based on number of references?