Questions tagged [pytorch]

Ask Question

Pytorch is an open source library for Tensors and Dynamic neural networks in Python with strong GPU acceleration. For details, see https://pytorch.org.

708 questions

0 votes

0 answers

6 views

Sequence generation model produces incorrect, but coherent outputs

My model takes in an image of a handwritten equation and converts it into its LaTeX representation. In order to do this, it uses a ResNet50 pre-trained model for feature extraction and a transformer ...

alt_zancudo

asked Nov 12 at 17:27

7 votes

1 answer

112 views

LSTM feature scaling with windowing?

Beginner ML practitioner here. I'm trying to do some time series forecasting on a fairly high resolution dataset that stretches over a long period of time. The values vary pretty widely over time: to ...

Seth

asked Sep 1 at 22:33

2 votes

0 answers

40 views

Fine-tuning YOLO: Directly cloning and modifying the GitHub repo vs. using Transformers library and Hugging Face — pros and cons?

I’m planning to fine-tune a YOLO model for a custom object detection task. There seem to be two main approaches: Clone the official YOLO GitHub repository (e.g., YOLOv5 or YOLOv8), adjust the codebase ...

chhu

asked Aug 10 at 16:26

3 votes

1 answer

47 views

Single nn.Embedding instance vs mulitple nn.Embedding instances

I am trying to determine if using multiple instances of nn.Embedding() has any value over using a single instance in training a model. As an example, let's say I ...

CCranney

asked Jul 30 at 21:51

0 votes

0 answers

16 views

terrible performance on CIFA10 using SWIN model

I am trying to apply the idea from Embedding Deep Networks into Visual Explanations and see if it works on Transformers. The performance is terrible because the accuracy hasn't passed 10%. Can someone ...

Simple

asked Jul 11 at 22:18

3 votes

1 answer

54 views

Model seems to peek into target sequence and cheat during training despite using masking

I am using CNN-transformer hybrid architecture to detect handwritten equation and convert them to LaTex strings. All target sequences (the actual LaTex representation of a handwritten equation) are ...

alt_zancudo

asked Jul 2 at 8:54

0 votes

1 answer

57 views

when testing with shuffled data, accuracy is high, but when testing with unshuffled data, accuracy is low

To be clear, I shuffled my data when I trained it. It is only the testing data that I modified to be unshuffled, and found that accuracy tanks. (i also used the same data for training and for testing)

Oyomot

asked Jun 30 at 3:54

0 votes

1 answer

37 views

Why is my upscaling gan not working?

I have been trying to code an upscaling gan but while the code run, I pretty much always end up with terrible result when the gan doesn't collapse, collapse which happen often. I previously tried to ...

Freeziey

asked May 14 at 10:49

15 30 50 per page

2 3 4 5

…

48 Next

Stack Exchange Network

Questions tagged [pytorch]

Sequence generation model produces incorrect, but coherent outputs

LSTM feature scaling with windowing?

Fine-tuning YOLO: Directly cloning and modifying the GitHub repo vs. using Transformers library and Hugging Face — pros and cons?

Single nn.Embedding instance vs mulitple nn.Embedding instances

terrible performance on CIFA10 using SWIN model

Model seems to peek into target sequence and cheat during training despite using masking

when testing with shuffled data, accuracy is high, but when testing with unshuffled data, accuracy is low

Why is my upscaling gan not working?

Hot Network Questions