Questions tagged [semantic-segmentation]
25 questions
4
votes
1 answer
How does Mask R-CNN automatically output a different number of objects on the image?
Recently, I was reading Pytorch's official tutorial about Mask R-CNN.
When I run the code on colab, it turned out that it automatically outputs a different number of channels during prediction. If the image has 2 people on it, it would output a mask…

dato nefaridze
- 862
- 6
- 20
3
votes
3 answers
If I trained a model to perform semantic segmentation on images with only one object, would it also work on images with multiple objects?
I'm working on semantic segmentation tasks in the medical space using the U-Net. Let's say that I train a U-Net model on medical images with the goal of segmenting out, say, ligaments, from a medical image. If I train that model on images that…

cmed123
- 131
- 1
2
votes
0 answers
Why CNN inference works on larger images
I have been reading up on 'regular' CNN's such as Mask R-CNN, and as far as I understand it they rely on a fully connected layer in the end to classify pixels. FCN's (such as U-Net) which do not use these layers are able to effectively process…

Cartman123
- 121
- 2
2
votes
0 answers
What is meant by Hinton when he refers to "Part-Whole Hierarchies" in his GLOM framework
I was recently reading Hinton's GLOM idea How to represent part-whole hierarchies in a neural network, and I am simply unsure about what exactly he means when he says parsing images into "part-whole hierarchies".
Moreover, wouldn't semantic…

ElectricMinimum58
- 21
- 1
2
votes
1 answer
What does the "number of channels" correspond to in U-Net?
I'm studying the U-Net CNN architecture. I'm new to CNNs and am confused regarding the "number of channels".
Referring to the U-Net diagram, the input image is convolved with a 3x3 mask which generates a 570x570 output. This output image is then…

Izzo
- 123
- 1
- 5
2
votes
0 answers
Semantic segmentation - background or ignore for non-target classes?
I am training a deep learning model for semantic segmentation. I am using the cityscapes dataset for training/evaluation.
In cityscapes, there are 34 classes, and of which, we consider only 19 classes and the rest of the classes are ignored. For…

147956
- 33
- 2
1
vote
0 answers
Resizing segmentation masks
I am trying to run a semantic segmentation model.
The problem is, my data has different resolution for every image and for corresponding segmentation map (image and corresponding segmentation map have same aspect ratio. But aspect ratio changes for…

programmer_04_03
- 53
- 4
1
vote
1 answer
How to identify and diferentiate several edge lines of an object?
I want to create an AI to detect and identify certain edge lines on my image. The input image is a locker key, and I want to know the exact position of certain edges.
Sample input image:
Sample output image. As output i have each red line…

Lluis C
- 111
- 3
1
vote
1 answer
How does the classification head of EfficientDet work?
EfficientDet outputs classes and bounding boxes. My question is about both but specifically I am interested in the class prediction net part. In the paper's diagram it shows 2 conv layers. I don't understand the code and how it works. And what's the…

ML Dev
- 11
- 3
1
vote
3 answers
What is the difference (if any) between semantic segmentation and multi-class, mutually exclusive classification?
Multi-class classification is simply assigning all data points into one of up to any finite number of mutually exclusive labels. I am new to the field(s) of AI/ML and I keep hearing people use the term "semantic segmentation."
I want to "translate"…

brethvoice
- 115
- 7
1
vote
0 answers
Pixel values of segmap in multi-class semantic segmentation
I'm preparing a dataset for a multiclass semantic segmentation using U-Net like architecture. To be precise, I've got it ready but a question came to my mind. How does pixel values of a segmentation map influence the training?
Also, is it better to…

Nuwanda
- 11
- 2
1
vote
1 answer
Why does my model not improve when training with mini-batch gradient descent, while it does with Adam?
I am currently experimenting with the U-Net. I am doing semantic segmentation on the 2018 Data Science Bowl dataset from Kaggle without any data augmentation.
In my experiments, I am trying different hyper-parameters, like using Adam, mini-batch GD…

Bert Gayus
- 545
- 3
- 12
0
votes
0 answers
Batch Normalization Layer is not learning the data semantics of a dataset comprised of datasets from different sources
I have built a dataset for image segmentation that is comprised of datasets from several different sources.
Almost all of my models have problems with learning the correct parameters of the batchnormalization layer, the networks are very deep and it…

user199590
- 125
- 6
0
votes
0 answers
how to preprocess satellite imagery for semantic segmentation?
I have to train binary semantic segmentation in Python(using Tenforflow, rasterio, geopandas, sh). And I have raster .tif images and vector .shp images which is mask. I know that I should divide each images into sub-images(tiles). But I need some…

NailaBagir
- 1
- 1
0
votes
0 answers
How to compare word segmentation methods?
I am comparing a few methods of word segmentation in artificial language without dictionary and "golden" segmentation.
Let's say, idolikecats is splitted by three different algorithms to i do like cats, ido li kecats and ido lik cat s.
Is there a…

dobrowol
- 1
- 1