Timeline for Reinforcement Learning - What is the logic behind actor-critic methods? Why use a critic?
Current License: CC BY-SA 4.0
2 events
| when toggle format | what | by | license | comment | |
|---|---|---|---|---|---|
| Dec 18, 2018 at 16:29 | vote | accept | Gulzar | ||
| Dec 18, 2018 at 2:58 | history | answered | shimao | CC BY-SA 4.0 |