Skip to main content

Timeline for Formulation of a reward structure

Current License: CC BY-SA 4.0

5 events
when toggle format what by license comment
Nov 27, 2019 at 7:34 comment added chink Hi, I have added a separate question with more details.
Nov 26, 2019 at 15:14 comment added chink sure, will have a new question with more details. thank you !!
Nov 26, 2019 at 15:13 comment added Neil Slater @cvg: I cannot tell from your description.You should ask a new question and give a lot more details of your environment including your top-level goals for the task.
Nov 26, 2019 at 14:42 comment added chink I am training agent for a control problem, if the action taken causes the new state to be in limits/boundaries , i am referring it as a good action and giving a very high positive reward, if the action taken causes the new state to be out of bounds i am giving a negative reward based on how bad the new state is. Is it recommended to have this kind of reward formulation for my use case or its not recommended?
Nov 26, 2019 at 13:07 history answered Neil Slater CC BY-SA 4.0