Questions tagged [image]
The image tag has no summary.
87 questions
0 votes
1 answer
30 views
Why does an image is made of many images?
I download an image and I show it using Python. image = imread('spheres.jpg') print(image.shape)#(398, 429, 3) As we can notice, the image shape has 3 dimensions....
1 vote
1 answer
537 views
Downsampling an image in the right way
I was not shure where to ask this question, but SE Data Science seems to be the best place for it. So I tried to build a CNN based super resolution model. Unfortunately I have only high-res images but ...
0 votes
0 answers
42 views
How to get the wattage recorded by a dlsr from an image?
I am doing a physics lab where I need to take photos of a light and the intensity of the light changes. I have the photos but I need to understand how to get the average wattage from the images. There ...
1 vote
1 answer
83 views
How to use additional features in image captioning?
I have the following question - is it possible to train a model based on Transformer architecture to use additional attributes to generate a caption for an image? For example, I have a dataset with ...
6 votes
2 answers
2k views
How to remove the hotspots from given image by using Python and opencv?
In the picture below there are some regions which are very bright (i.e. more white). Some bright regions are wide and some are narrow or thin. The red box covers one such wide bright spot, and blue ...
0 votes
1 answer
774 views
OpenCV add/subtract functions produce different results from numpy array add/subtract
Im trying to brighten and dim an image using OpenCV with two approaches. Approach 1: Used OpenCV's add and subtract functions to ...
1 vote
1 answer
906 views
Is it possible to reverse the layers of a convolutional neural network?
From my understanding typically a convolutional neural network has a matrix (e.g. an image) as input and output is either an integer or a vector of integers in regression and in classification a ...
1 vote
1 answer
309 views
Autoencoder: How should hidden layer be used?
I'm building a variational autoencoder to generate faces. I'm using gray-scale images with the size 30x30. I started with a very simple model: Input Layer, 900 nodes, values 0-1 Latent Space, 10 nodes ...