Return to Answer

added 24 characters in body

edited May 17, 2023 at 13:27

95.8k
23
246
405

Verify1. Verify that your code is bug free

For2. For the love of all that is good, scale your data

Crawl3. Crawl Before You Walk; Walk Before You Run

Neural4. Neural Network Training Is Like Lock Picking

Non5. Non-convex optimization is hard

Regularization6. Regularization

Keep7. Keep a Logbook of Experiments

fixed grammar

edit approved Aug 31, 2022 at 15:20

Dmytro Savochkin

193
1
4

To achieve state of the art, or even merely good, results, you have to have to have set up all of the parts configured to work well together. Setting up a neural network configuration that actually learns is a lot like picking a lock: all of the pieces have to be lined up just right. Just as it is not sufficient to have a single tumbler in the right place, neither is it sufficient to have only the architecture, or only the optimizer, set up correctly.

added 188 characters in body

edited Feb 28, 2022 at 13:52

95.8k
23
246
405

Variables are created but never used (usually because of copy-paste errors);
Expressions for gradient updates are incorrect;
Weight updates are not applied;
Loss functions are not measured on the correct scale (for example, cross-entropy loss can be expressed in terms of probability or logits)
The loss is not appropriate for the task (for example, using categorical cross-entropy loss for a regression task).
Dropout is used during testing, instead of only being used for training.
Make sure you're minimizing the loss function $L(x)$, instead of minimizing $-L(x)$.

Make sure your loss is computed correctly.

added 231 characters in body

edited Jan 30, 2022 at 13:56

95.8k
23
246
405

Loading

Fix broken link

edit approved Jan 9, 2022 at 16:20

1

Loading

added 283 characters in body

edited Dec 1, 2021 at 14:53

95.8k
23
246
405

Loading

added 247 characters in body

edited Nov 7, 2021 at 17:00

95.8k
23
246
405

Loading

added 111 characters in body

edited Sep 12, 2021 at 1:20

95.8k
23
246
405

Loading

added 138 characters in body

edited Jul 8, 2021 at 15:08

95.8k
23
246
405

Loading

added 6 characters in body

edited Jun 29, 2021 at 22:30

95.8k
23
246
405

Loading

added 294 characters in body

edited Jun 4, 2021 at 13:39

95.8k
23
246
405

Loading

added 92 characters in body

edited Apr 9, 2021 at 16:36

95.8k
23
246
405

Loading

added 698 characters in body

edited Mar 31, 2021 at 18:37

95.8k
23
246
405

Loading

added 110 characters in body

edited Oct 13, 2020 at 14:13

95.8k
23
246
405

Loading

added 261 characters in body

edited Jul 7, 2020 at 5:09

95.8k
23
246
405

Loading

redirect broken link (to archive.org version)

edited Mar 12, 2020 at 13:37

13.3k
3
43
77

Loading

added 65 characters in body

edited Dec 17, 2019 at 19:27

95.8k
23
246
405

Loading

added 119 characters in body

edited Oct 29, 2019 at 2:55

95.8k
23
246
405

Loading

added 433 characters in body

edited Apr 15, 2019 at 14:52

95.8k
23
246
405

Loading

added 147 characters in body

edited Dec 4, 2018 at 17:09

95.8k
23
246
405

Loading

added 174 characters in body

edited Dec 2, 2018 at 21:46

95.8k
23
246
405

Loading

added 247 characters in body

edited Dec 2, 2018 at 19:27

95.8k
23
246
405

Loading

added 120 characters in body

edited Oct 29, 2018 at 18:57

95.8k
23
246
405

Loading

added 160 characters in body

edited Sep 29, 2018 at 17:12

95.8k
23
246
405

Loading

added 162 characters in body

edited Sep 10, 2018 at 18:10

95.8k
23
246
405

Loading

1