Timeline for Formulation of a reward structure
Current License: CC BY-SA 4.0
5 events
| when toggle format | what | by | license | comment | |
|---|---|---|---|---|---|
| Nov 27, 2019 at 7:34 | comment | added | chink | Hi, I have added a separate question with more details. | |
| Nov 26, 2019 at 15:14 | comment | added | chink | sure, will have a new question with more details. thank you !! | |
| Nov 26, 2019 at 15:13 | comment | added | Neil Slater | @cvg: I cannot tell from your description.You should ask a new question and give a lot more details of your environment including your top-level goals for the task. | |
| Nov 26, 2019 at 14:42 | comment | added | chink | I am training agent for a control problem, if the action taken causes the new state to be in limits/boundaries , i am referring it as a good action and giving a very high positive reward, if the action taken causes the new state to be out of bounds i am giving a negative reward based on how bad the new state is. Is it recommended to have this kind of reward formulation for my use case or its not recommended? | |
| Nov 26, 2019 at 13:07 | history | answered | Neil Slater | CC BY-SA 4.0 |