Questions tagged [u-net]
For questions related to the U-net, a neural network proposed in "U-Net: Convolutional Networks for Biomedical Image Segmentation" (2015) by Olaf Ronneberger et al. for semantic segmentation.
51 questions
0 votes
0 answers
26 views
neural network for learning the permeter of objects from binary images
I have a binary image that is the Canny output of an image containing some objects and my goal is to learn to recognise the corners of the objects and measure the perimeters. Example: The goal is to ...
0 votes
0 answers
24 views
How to compute local text orientation and interline distance for curved baselines in ARU-Net (Algorithm S.1)?
I'm implementing Algorithm S.1 (Pixel Ground Truth Generation) from the ARU-Net paper by Grüner et al., but I'm having trouble understanding a few key steps in the context of non-linear (curved) ...
0 votes
1 answer
153 views
Low validation loss from the first epoch?
The initial validation loss is low from the first epoch and then decreases slightly. What does this actually mean? Does it indicate that the model can effectively and quickly identify patterns for ...
1 vote
0 answers
36 views
Adding extra parameters to the inputs of a network?
I have a bunch of images of cells from a brightfield microscope that I want to segment. Cells have different morphologies, shapes, etc. depending on cell lines. I also have some metadata associated ...
2 votes
0 answers
94 views
Why does UNet often output noisy pattern in blank/homogeneous region?
I am recently implementing DDPM model from scratch, and I discovered that UNet often tends to give noisy output in blank region. Here is an example with FashionMNIST, my DDPM seems to generate OK ...
2 votes
1 answer
266 views
How do I input multi-channel Numpy array to U-net for semantic segmentation
I had lidar 3D point cloud data from semantckitti. I want to perform Semantic Segmentation on the data using U-Net. I converted the 3d point cloud data into 2D using spherical conversion and saved the ...
0 votes
1 answer
135 views
How to remove random noise from an image (denoising)?
When adding noise to an image, for instance, is the noise added evenly random (equally likely values within some range), or random but following some distribution (like the normal distribution)? Then,...
3 votes
1 answer
578 views
How to perform latent space Interpolation between two images?
I have a variational convolutional autoencoder that has trained on 2 images and outputs a linear interpolation (inserted at the bottleneck stage) between those 2 input images. However, the result ...
0 votes
1 answer
129 views
3D Unet gives "output size is too small" error [closed]
I wrote simple 3D-Unet arch in pytorch to do segmentation on 3D images. ...
0 votes
1 answer
384 views
Image segmentation with varying resolution
I am looking to create a model that is able to perform binary segmentation of images with varying resolutions. For model should be able to classify tree or not tree regardless of the resolution of the ...
0 votes
1 answer
137 views
Which models can be applied recursively?
I come from a math background, so I am not up-to-date with machine learning literature. For the purpose of learning dynamics, I would like to train a model to minimize the following loss: $$\mathcal{L}...
3 votes
1 answer
8k views
What is the role of skip connections in U-Net?
I was able to find that the skip connections used in U-Net help to recover fine grained details in the prediction, however I do not understand what is meant by this. Besides, I was wondering what ...
5 votes
3 answers
14k views
Why diffusion model always use U-Net?
I want to know why diffusion models always use U-Net. In my opinion, they use U-Net because you can see features of different resolutions and skip connection is good to add detail of images. But I am ...
3 votes
2 answers
441 views
Learning an identity function with convolutional networks
I am trying to train networks to achieve what I expected to be a trivial task: learn the identity mapping. However, this is very hard to achieve, and the optimization is hard. Moreover, I don't want ...
0 votes
1 answer
369 views
U-Net Maxpooling vs Convolution
Hello I'm implementing a CycleGAN and most of the other implementations I've seen on the internet use Convolution with stride 2 instead of a Maxpoolinglayer for downsample. On to my question, why ...