I am currently training a ResNet18 model with a custom optimizer in PyTorch.
I am using CrossEntropyLoss() and the ResNet18 model from PyTorch. In tensorflow the outputs are of the desired shape, but in pytorch it is necessary to find the argmax of the predicted labels in order to find the accuracy.
If my batch size = 64 with the resnet model, why is the predicted label of shape [64, 1000]?
What do the 1000 values correspond to?