Skip to main content

Questions tagged [masking]

4 votes
0 answers
29 views

I have a large dataset (~10M points) in python and I want to filter it using a large number of different custom masks, as part of calculations to create a new but related dataset. Because the dataset ...
quail's user avatar
  • 41
0 votes
0 answers
50 views

I have been thinking about the Masking in the Self attention of the decoder in the context of the training for a long time and doesn't really make sense to me. I have browsed through a lot of sources ...
struggling_student's user avatar
1 vote
0 answers
225 views

When I generate a src_mask like this ...
kot's user avatar
  • 11
1 vote
1 answer
168 views

Suppose I am working with a Masked Language Model to pre-train on a specific dataset. In that dataset, most sequences have a particular token of a high frequency ...
neel g's user avatar
  • 227
2 votes
1 answer
768 views

I have a question about the decoder transformer feed forward during training. Let's pick an example: input data "i love the sun" traduction i want to ...
erre4's user avatar
  • 95