Skip to main content

Questions tagged [hierarchical-rl]

For questions related to hierarchical reinforcement learning algorithms.

2 votes
0 answers
349 views

Hierarchical Reinforcement Learning is suited to tackle many complex problems such as robotics manipulation. Sticking to this example, the basic idea is that instead of learning a sequence of robotic ...
Rexcirus's user avatar
  • 1,339
0 votes
1 answer
151 views

I'd like to design a deep learning architecture in which the output of a primary neural network $M_{\theta}$ determines which neural network $N^i_{\alpha}$ in a set of secondary networks $\mathcal{N}$ ...
Wowee's user avatar
  • 1
1 vote
1 answer
98 views

I'm thinking of implementing "Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation" paper. In this paper authors used some custom object ...
noob_rl's user avatar
  • 13
2 votes
0 answers
66 views

I've been researching different frameworks for hierarchical RL (mainly options, HAMs, and MAXQ) and noticed that both options and HAMs have names that relate to how they function. I can't seem to find ...
Scorks's user avatar
  • 21
1 vote
0 answers
71 views

I want to try a hierarchical reinforcement learning (HRL) approach to hard logical problems with combinatorial complexity, i.e. games like chess or Rubik's cube. The majority of HRL papers I have ...
Tomasz Odrzygozdz's user avatar
2 votes
0 answers
71 views

Consider a problem where the agent must learn to control a hierarchy of agents acting against another such agent in a competitive environment. The agents on each team need to learn cooperate in order ...
iceburger's user avatar
  • 121
7 votes
1 answer
7k views

According to a lecture (week 10) about Reinforcement Learning [1], the concept of an option allows searching the state space of an agent much faster. The lecture was hard to follow because many new ...
user avatar
5 votes
1 answer
230 views

I have a system (like a bank) that people (customers) are entered into the systems by a Poisson process, so the time between the arrival of people (two consecutive customers) will be a random variable....
Amin's user avatar
  • 481
2 votes
2 answers
707 views

I have a steady hex-map and turn-based wargame featuring WWII carrier battles. On a given turn, a player may choose to perform a large number of actions. Actions can be of many different types, and ...
Carrier Battles's user avatar
4 votes
1 answer
251 views

I read some light material earlier about the possibility of building a hierarchy of agents, where the agents at the leaves solve primitive tasks while higher-level agents are optimized for ...
A.Rashad's user avatar
  • 251