Example/preset using a CompoundActionSpace #148

jamescasbon · 2018-12-10T14:32:47Z

Hi,

I see the starcraft example can use the CompoundActionSpace, but are there any examples of how the agent should be configured? It's unclear to me what presets should support this action space.

thanks!

zach-nervana · 2018-12-10T22:13:28Z

The parts of the network architecture which deal specifically with the action spaces are called heads. It appears that the only head which supports CompoundActionSpace is PolicyHead. This head is used by the actor critic agent by default so I would expect that it should work. @galleibo-intel may know of other heads which are already compatible.

Is there a particular configuration you are interested in?

gal-leibovich · 2018-12-11T07:01:34Z

Yep. At the moment, only PolicyHead supports CompoundActionSpace, and it is not in use in any of our existing presets. It is currently merely a infrastructure for for allowing future extensibility. In Starcraft, the goal is to allow the use of the full action space (comparing to what is used in Starcraft's presets - X,Y coordinates to move the troops to).

jamescasbon · 2018-12-11T08:22:03Z

Thank you, thats enough for me to get on with this. I hadn't quite got that this was the head part of the net, but it's obvious in hindsight.

jamescasbon · 2018-12-11T09:47:04Z

Spoke to soon....

I tried....

agent_params = ActorCriticAgentParameters()
agent_params.exploration = {CompoundActionSpace: CategoricalParameters()}

What exploration policies should support compound action spaces?
The ActorCriticAgent inherits from PolicyOptimisationMethod and therefore throws from here https://github.com/NervanaSystems/coach/blob/master/rl_coach/agents/policy_optimization_agent.py#L161 in choose_action

Would adding random_action for the space to actor critic agent be sufficient to get this to work?

ryanpeach · 2018-12-29T18:42:37Z

Also looking forward to this working.

rmitsch · 2020-04-29T12:41:13Z

What's the current status on this? Which agent(s)/agent configurations support CompoundActionSpace?

jamescasbon closed this as completed Dec 11, 2018

jamescasbon reopened this Dec 11, 2018

scttl added this to To do in Coach Dev Jan 11, 2019

galnov added the priority/p3 enhancements not currently in focus or low impact bugs label Jan 16, 2019

scttl moved this from Requires Grooming to Groomed but Not Started in Coach Dev Jan 17, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Example/preset using a CompoundActionSpace #148

Example/preset using a CompoundActionSpace #148

jamescasbon commented Dec 10, 2018

zach-nervana commented Dec 10, 2018

gal-leibovich commented Dec 11, 2018

jamescasbon commented Dec 11, 2018

jamescasbon commented Dec 11, 2018

ryanpeach commented Dec 29, 2018

rmitsch commented Apr 29, 2020 •

edited

Loading

Example/preset using a CompoundActionSpace #148

Example/preset using a CompoundActionSpace #148

Comments

jamescasbon commented Dec 10, 2018

zach-nervana commented Dec 10, 2018

gal-leibovich commented Dec 11, 2018

jamescasbon commented Dec 11, 2018

jamescasbon commented Dec 11, 2018

ryanpeach commented Dec 29, 2018

rmitsch commented Apr 29, 2020 • edited Loading

rmitsch commented Apr 29, 2020 •

edited

Loading