conv-neural-network – Page 2

Adam optimizer goes haywire after 200k batches, training loss grows

August 30, 2023 by Tarik

Yes. This is a known problem of Adam. The equations for Adam are t <- t + 1 lr_t <- learning_rate * sqrt(1 – beta2^t) / (1 – beta1^t) m_t <- beta1 * m_{t-1} + (1 – beta1) * g v_t <- beta2 * v_{t-1} + (1 – beta2) * g * g variable <- … Read more

Tensorflow: loss decreasing, but accuracy stable

August 30, 2023 by Tarik

A decrease in binary cross-entropy loss does not imply an increase in accuracy. Consider label 1, predictions 0.2, 0.4 and 0.6 at timesteps 1, 2, 3 and classification threshold 0.5. timesteps 1 and 2 will produce a decrease in loss but no increase in accuracy. Ensure that your model has enough capacity by overfitting the … Read more

Why rotation-invariant neural networks are not used in winners of the popular competitions?

July 27, 2023 by Tarik

The recent progress in image recognition which was mainly made by changing the approach from a classic feature selection – shallow learning algorithm to no feature selection – deep learning algorithm wasn’t only caused by mathematical properties of convolutional neural networks. Yes – of course their ability to capture the same information using smaller number … Read more

What are forward and backward passes in neural networks?

June 14, 2023 by Tarik

The “forward pass” refers to calculation process, values of the output layers from the inputs data. It’s traversing through all neurons from first to last layer. A loss function is calculated from the output values. And then “backward pass” refers to process of counting changes in weights (de facto learning), using gradient descent algorithm (or … Read more

In Keras, how to get the layer name associated with a “Model” object contained in my model?

June 11, 2023 by Tarik

The key is to first do .get_layer on the Model object, then do another .get_layer on that specifying the specific vgg16 layer, THEN do .output: layer_output = model.get_layer(‘vgg16’).get_layer(‘block3_conv1’).output

ValueError: Target size (torch.Size([16])) must be the same as input size (torch.Size([16, 1]))

June 8, 2023 by Tarik

target = target.unsqueeze(1), before passing target to criterion, changed the target tensor size from [16] to [16,1]. Doing it solved the issue. Furthermore, I also needed to do target = target.float() before passing it to criterion, because our outputs are in float. Besides, there was another error in the code. I was using sigmoid activation … Read more

How to understand SpatialDropout1D and when to use it?

June 1, 2023 by Tarik

To make it simple, I would first note that so-called feature maps (1D, 2D, etc.) is our regular channels. Let’s look at examples: Dropout(): Let’s define 2D input: [[1, 1, 1], [2, 2, 2]]. Dropout will consider every element independently, and may result in something like [[1, 0, 1], [0, 2, 2]] SpatialDropout1D(): In this … Read more

What does TensorFlow’s `conv2d_transpose()` operation do?

May 25, 2023 by Tarik

This is the best explanation I’ve seen online how convolution transpose works is here. I’ll give my own short description. It applies convolution with a fractional stride. In other words spacing out the input values (with zeroes) to apply the filter over a region that’s potentially smaller than the filter size. As for the why … Read more

Why should we use Temperature in softmax? [closed]

May 22, 2023 by Tarik

One reason to use the temperature function is to change the output distribution computed by your neural net. It is added to the logits vector according to this equation : 𝑞𝑖 =exp(𝑧𝑖/𝑇)/ ∑𝑗exp(𝑧𝑗/𝑇) where 𝑇 is the temperature parameter. You see, what this will do is change the final probabilities. You can choose T to … Read more

What is the difference between UpSampling2D and Conv2DTranspose functions in keras?

May 16, 2023 by Tarik

UpSampling2D is just a simple scaling up of the image by using nearest neighbour or bilinear upsampling, so nothing smart. Advantage is it’s cheap. Conv2DTranspose is a convolution operation whose kernel is learnt (just like normal conv2d operation) while training your model. Using Conv2DTranspose will also upsample its input but the key difference is the … Read more