deep-learning – Page 2

how to format the image data for training/prediction when images are different in size?

January 17, 2023 by Tarik

You didn’t say what architecture you’re talking about. Since you said you want to classify images, I’m assuming it’s a partly convolutional, partly fully connected network like AlexNet, GoogLeNet, etc. In general, the answer to your question depends on the network type you are working with. If, for example, your network only contains convolutional units … Read more

What’s the difference between “hidden” and “output” in PyTorch LSTM?

January 9, 2023 by Tarik

I made a diagram. The names follow the PyTorch docs, although I renamed num_layers to w. output comprises all the hidden states in the last layer (“last” depth-wise, not time-wise). (h_n, c_n) comprises the hidden states after the last timestep, t = n, so you could potentially feed them into another LSTM. The batch dimension … Read more

Why do we “pack” the sequences in PyTorch?

November 17, 2022 by Tarik

I have stumbled upon this problem too and below is what I figured out. When training RNN (LSTM or GRU or vanilla-RNN), it is difficult to batch the variable length sequences. For example: if the length of sequences in a size 8 batch is [4,6,8,5,4,3,7,8], you will pad all the sequences and that will result … Read more