Skip to main content

Questions tagged [reinforcement-learning]

2 votes
2 answers
267 views

In fields such as game theory and reinforcement learning, it is standard to consider the regret-minimization strategy. I don't get the motivation for the definition. Yes, doing your best under worst-...
Amit Keinan's user avatar
1 vote
0 answers
59 views

I think R1 was born as an exploration of the AGI approach, and the design of the whole scheme is in line with Professor Sutton's philosophy of "searching + learning to find a meta-approach that ...
Wyatt's user avatar
  • 111
1 vote
1 answer
76 views

I am studying from the MARL textbook by Albrecht, Christianos and Schäfer. They define a stochastic game in Sec 3.3 as the multi-agent version of an MDP. In Fig 3.3 (pg 50) they give an intuition for ...
Arvind Raghavan's user avatar
1 vote
1 answer
60 views

Though the reward was assigned by the environment, the once the policy $\pi$ was fixed, the probability of the action on the states $\pi(a|s)$ could be assigned. However, this meant given different ...
ShoutOutAndCalculate's user avatar
0 votes
1 answer
146 views

In my PhD, I will work with ML models. However, I will only use ready-made models as a tool, but I want to delve deeper into Artificial Intelligence not just to use ready-made models, but to ...
Everson's user avatar
1 vote
0 answers
140 views

Google DeepMind recently published a new paper which describes how they used a reinforcement learning to discover faster sorting algorithms. A summary of the paper is here and the paper is here. It ...
equis's user avatar
  • 11
0 votes
2 answers
163 views

I am new to reinforcement learning, and recently came across the following issue. When implementing a multi-armed bandit algorithm, we assume we have k machines with reward probabilities [p_1,..., p_k]...
alextan's user avatar
1 vote
0 answers
164 views

There are various meta-learning algorithms in RL that are proposed for settings when we have a (deep) neural network and the policy (or the value function) are parameterized as such. Can these methods ...
Perissiane's user avatar
0 votes
1 answer
119 views

I read this article, mentioning that either here, or StackOverflow would be the best places to ask generic machine learning questions, however, if the question isn't programming specific with a ...
wildcat89's user avatar
  • 103
1 vote
0 answers
80 views

I'm writing an AI based on reinforcement learning to play Connect 4. That's my second bot and attempt to RNN and AI (first was copy a code of snake RNN AI from youtube) and I'm looking for some ...
Saguro's user avatar
  • 11
1 vote
0 answers
38 views

Is there any free/open-source environment, tasks, or dataset for evaluating deep RL algorithms in terms of safety? all available environments (like openAI's) are environments for simple games. These ...
mac179's user avatar
  • 161
1 vote
0 answers
62 views

I am currently a Ph.D. student in the computer science department, I was given the subject of Deep RL for Healthcare. However, after lots of research on the internet, I could not find any free dataset ...
mac179's user avatar
  • 161
2 votes
0 answers
55 views

Recently, I had an idea of a novel Deep RL algorithm that might perform better than existing algorithms such as DQN, TRPO, PPO, etc. However, I do not know of a website or a research paper that might ...
mac179's user avatar
  • 161
2 votes
1 answer
108 views

Note: I consider myself to be a beginner in the field of Deep RL. Deep RL has proven tremendous success in recent years like playing atari and beating go champion. Therefore, considerable interest for ...
mac179's user avatar
  • 161
4 votes
2 answers
2k views

I was watching a video on Reinforcement Learning by Andrew Ng, and at about minute 23 of the video he mentions that we can represent the Bellman equation as a linear system of equations. I am talking ...
krishnab's user avatar
  • 171

15 30 50 per page
1
2 3 4 5
7