Questions tagged [faster-r-cnn]

For questions related to the faster R-CNN model, which was proposed in "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks" (2015) by Shaoqing Ren et al. and published in NeurIPS. Faster R-CNN is an improved version of fast R-CNN, which, in turn, is an improved version of R-CNN.

8 questions

votes

2 answers

Is there an argument against using the (reviewed) predictions of a model as ground truth to further train exactly this model?

I plan to use my predictions as ground truth to continue training my model. These predictions are of course reviewed during this process. Is there an argument against that (reinforcement of slight mistakes/overfitting etc.)? Here my specific use…

asked May 23 '22 at 12:40

thzu

votes

1 answer

How does the region proposal method work in Fast R-CNN?

I read so many articles and the Fast R-CNN paper, but I'm still confused about how the region proposal method works in Fast R-CNN. As you can see in the image below, they say they used a proposal method, but it is not specified how it works. What…

computer-vision object-detection r-cnn selective-search faster-r-cnn

asked May 10 '20 at 12:01

ozoubia

vote

0 answers

Mask R-CNN: how is the inference done?

According to the Mask R-CNN paper and the picture below (taken from the paper), the mask branch is computed in parallel with the bbox classification and regression branches. However in the paper they write that inference is done differently from…

deep-learning image-segmentation instance-segmentation mask-rcnn faster-r-cnn

asked Feb 16 '22 at 09:45

orbit

vote

0 answers

Why are the learned offsets of anchor boxes in the RCNN object detection models scale invariant?

In the original RCNN paper (https://arxiv.org/pdf/1311.2524.pdf) and continued in later RCNN papers such as faster RCNN (https://arxiv.org/pdf/1506.01497.pdf) the learned offsets of the anchor boxes are scale-invariant. For example the learned…

object-detection r-cnn faster-r-cnn

asked Jan 15 '22 at 16:10

phil

vote

0 answers

In anchor based object detection, why don't the anchors share the same weights?

After reading about YOLO V3 and Faster R-CNN, I don't understand why the weights for the regression head aren't the same across all boxes of the same size. Given that the backbone of these systems is fully convolutional, the location of the…

computer-vision object-detection object-recognition yolo faster-r-cnn

asked Sep 18 '21 at 23:47

FourierFlux

vote

1 answer

In Faster R-CNN, how can I get the predicted bounding box given the neural network's output?

The RPN loss in Faster RCNN paper is $$ L({p_i}, {t_i}) = \frac{1}{N_{cls}} \sum_{i} L_{cls}(p_i,p_i^*) + \lambda \frac{1}{N_{reg}} \sum_i p_i^* L_{reg}(t_i, t_i^*) $$ For regression problems, we have the following parametrization $$t_x=\frac{x -…

convolutional-neural-networks papers object-detection r-cnn faster-r-cnn

asked Dec 04 '19 at 06:38

user31844

votes

1 answer

Confusion about faster RCNN neither object nor background label

I am trying to construct a faster RCNN from scratch using KERAS. I am generating the tensor which contains whether anchor at each location corresponds to object or background or neither for training the RPN. The output tensor for the RPN is…

tensorflow python keras faster-r-cnn

asked Jun 22 '21 at 13:54

Abhisek Dash

votes

0 answers

How to design training loop in RPN?

I have a short question. I understand the concept of RPN but one small details keeps me from implementing it. How should I design the training loop given that I have to use only a subset of anchor boxes (128 positive and 128 negative). In other…

faster-r-cnn

asked Apr 27 '21 at 15:11

Bogdan Duduc