Highest Voted 'yolo' Questions - Artificial Intelligence Stack Exchange

11

votes

3 answers

Is it difficult to learn the rotated bounding box for a (rotated) object?

I have checked out many methods and papers, like YOLO, SSD, etc., with good results in detecting a rectangular box around an object, However, I could not find any paper that shows a method that learns a rotated bounding box. Is it difficult to learn…

asked Jan 11 '19 at 15:00

Ankish Bansal

253
1
2
8

9

votes

1 answer

In YOLO, what exactly do the values associated with each anchor box represent?

I'm going through Andrew NG's course, which talks about YOLO, but he doesn't go into the implementation details of anchor boxes. After having looked through the code, each anchor box is represented by two values, but what exactly are these values…

neural-networks convolutional-neural-networks computer-vision yolo

asked Jan 24 '18 at 01:46

moondra

209
2
4

7

votes

2 answers

What's the role of bounding boxes in object detection?

I'm quite new to the field of computer vision and was wondering what are the purposes of having the boundary boxes in object detection. Obviously, it shows where the detected object is, and using a classifier can only classify one object per image,…

machine-learning convolutional-neural-networks object-detection yolo bounding-box

asked Jan 25 '19 at 06:39

Cody Chung

173
5

6

votes

0 answers

What are the differences between Yolo v1 and CenterNet?

I recently read a new paper (late 2019) about a one-shot object detector called CenterNet. Apart from this, I'm using Yolo (V3) one-shot detector, and what surprised me is the close similarity between Yolo V1 and CenterNet. First, both frameworks…

deep-learning comparison object-detection yolo center-net

asked Oct 24 '19 at 15:25

Louis Lac

308
2
9

5

votes

1 answer

In YOLO, when is $\mathbb{1}_{i j}^{\mathrm{obj}} = 1$, and what are the ground-truth labels for $x_i$ and $y_i$?

I'm trying to implement a custom version of the YOLO neural network. Originally, it was described in the paper You Only Look Once: Unified, Real-Time Object Detection (2016). I have some problems understanding the loss function they used. Basic…

deep-learning convolutional-neural-networks papers object-recognition yolo

asked Mar 05 '18 at 20:22

Andrew

276
2
5

4

votes

1 answer

What is a unified neural network model?

In many articles (for example, in the YOLO paper, this paper or this one), I see the term "unified" being used. I was wondering what the meaning of "unified" in this case is.

neural-networks deep-learning terminology architecture yolo

asked Dec 27 '20 at 20:13

Reactionic

63
3

3

votes

1 answer

YOLO - are the anchor boxes used only in training?

another question in YOLO. I've red about how YOLO adjusts anchor boxes by offsets to create the final bounding boxes. What I do not understand, is when YOLO does it. Is it being done only during the training process, or also during the common use of…

convolutional-neural-networks yolo bounding-box

asked Jan 31 '22 at 12:48

Igor

181
10

3

votes

1 answer

What are the main differences between YOLOv3 and RetinaNet object detection algorithms?

I am looking at a certain project that compares performance on a certain dataset for an object detection problem using YOLOv3 and RetinaNet (or the "SSD_ResNet50_FPN" from TF Model Zoo). Both YOLOv3 and RetinaNet seem to have similar features like…

computer-vision tensorflow object-detection yolo single-shot-multibox-detector

asked Mar 02 '21 at 20:50

Ananda

148
9

3

votes

1 answer

How to treat (label and process) edge case inputs in machine learning?

In every computer vision project, I struggle with labeling guidelines for border cases. Benchmark datasets don't have this problem, because they are 'cleaned', but in real life unsure cases often constitute the majority of data. Is 15% of a cat's…

computer-vision classification datasets object-detection yolo

asked Feb 03 '21 at 14:43

Huxwell

101
4

3

votes

0 answers

How are Ground truth provided to each Pyramid map in RetinaNet or YOLOv3 Paper? How is the mapping of Feature Pyramids done to Ground Truth

SO the YOLO V3 and RetinaNet both uses the Feature pyramids which look something like this: (except b and e which have one output) I'm just confuse how the predictions and training is done? Do we have to give EACH feature map a different Y label? IF…

deep-learning computer-vision object-detection object-recognition yolo

asked Jan 31 '21 at 11:14

Deshwal

253
1
10

3

votes

0 answers

If random rotations are included in the data augmentation process, how are the new bounding boxes calculated?

When studying bounding box-based detectors, it's not clear to me if data augmentation includes adding random rotations. If random rotations are added, how is the new bounding box calculated?

deep-learning object-detection object-recognition yolo data-augmentation

asked Nov 15 '20 at 22:22

FourierFlux

783
1
4
14

3

votes

2 answers

Calculation of FPS on object detection task

How to calculate mean speed in FPS for an object detection model like YOLOv3 or YOLOv3-Tiny? Different object detection models are often presented on charts like this: I am using the DarkNet framework in my project and I want to create similar…

convolutional-neural-networks object-detection yolo

asked Jan 04 '20 at 15:50

Panicum

141
1
7

3

votes

1 answer

How can I incrementally train a Yolo model without catastrophic forgetting?

I have successfully trained a Yolo model to recognize k classes. Now I want to train by adding k+1 class to the pre-trained weights (k classes) without forgetting previous k classes. Ideally, I want to keep adding classes and train over the previous…

convolutional-neural-networks training yolo incremental-learning catastrophic-forgetting

asked Jul 16 '19 at 13:30

Troy

73
4

2

votes

3 answers

Would YOLO be able to detect objects in "different" positions?

I have the following question about You Only Look Once (YOLO) algorithm, for object detection. I have to develop a neural network to recognize web components in web applications - for example, login forms, text boxes, and so on. In this context, I…

convolutional-neural-networks reference-request datasets object-detection yolo

asked Aug 20 '18 at 13:40

giada

21
3

2

votes

2 answers

What YOLO algorithm can I use for images with noise as I will implement it in real time?

I want to detect drivers with or without seatbelts at crossroads. For that, as it is real-time, I am going to use the YOLO algorithm/model. For training data sets (the images) I need to collect, I placed a camera. By recording it and collecting…

machine-learning deep-learning tensorflow yolo

asked Jul 01 '18 at 15:18

ťęjă RAVURI

21
1

Questions tagged [yolo]