Questions tagged [faster-rcnn]
The faster-rcnn tag has no summary.
48 questions
0 votes
1 answer
68 views
MaskRCNN.train gives 'list index out of range'
I have been trying to use MaskRCNN with a Resnet backbone on the DeepFashion2 Dataset for instance segmentation. The custom configurations are as follows: ...
1 vote
1 answer
59 views
How are OCR training datasets constructed?
For the sake of concreteness: let's suppose that the word "OCR" refers to any OCR system build on an R-CNN architecture. Similarly, in aims of simplicity, let's declare that we are ...
2 votes
0 answers
82 views
Which model is used for document extraction (CamScanner, Microsoft Lens etc)
I want to start a small project where I'd create a model(s) that would extract document from a picture and rescale it, something like CamScanner or Microsoft Lens apps do. I've gathered a small ...
1 vote
0 answers
179 views
Understanding how anchors are created in a regional proposal network
I understand that in Faster R-CNN, the image is fed into a pre-trained CNN (such as VG16). So say I have a 37x50x512 feature map. Firstly, I assume that each feature map (37x50x1) is fed into the RPN? ...
2 votes
1 answer
837 views
Count repeating "objects" in a picture
This is my first data-science project and I would love to get some guidance to know how to get started. My problem is the following: I want to count objects that are in a picture. This picture has a ...
0 votes
1 answer
2k views
How to interpret fast-rcnn metrics?
I'm following this tutorial to fine tune Faster RCNN model, during training process a lot of statistics are produced however I don't know how to interpret them. what are major characteristics to look ...
1 vote
0 answers
144 views
Region Proposal Network - How to subsampling 256 fg/bg anchors
I am trying to understand training process of RPN. I have problem with creating mini batches of 256 anchors. If features map has shape 18x25=450 and every position has 9 anchors it is 4050 potential ...
1 vote
2 answers
399 views
Training Object Detection model on just 10 images
I am trying to train an object detection model using Mask-RCNN with Resnet50 as backbone. I am using the pre-trained models from PyTorch's Torchvision library. I have only 10 images that I can use to ...