A3C presets for ControlSuite #146

KeAWang · 2018-12-10T00:57:33Z

Following the settings in rl_coach/presets/Mujoco_PPO.py and rl_coach/presets/ControlSuite_DDPG.py, I'm creating my own presets to run A3C on ControlSuite with the following preset ControlSuite_A3C.py file:

from rl_coach.agents.actor_critic_agent import ActorCriticAgentParameters
from rl_coach.base_parameters import VisualizationParameters, PresetValidationParameters
from rl_coach.core_types import TrainingSteps, EnvironmentEpisodes, EnvironmentSteps
from rl_coach.environments.control_suite_environment import ControlSuiteEnvironmentParameters, control_suite_envs
from rl_coach.environments.environment import SingleLevelSelection
from rl_coach.filters.filter import InputFilter
from rl_coach.filters.observation.observation_normalization_filter import ObservationNormalizationFilter
from rl_coach.filters.reward.reward_rescale_filter import RewardRescaleFilter
from rl_coach.graph_managers.basic_rl_graph_manager import BasicRLGraphManager
from rl_coach.graph_managers.graph_manager import ScheduleParameters

####################
# Graph Scheduling #
####################
schedule_params = ScheduleParameters()
schedule_params.improve_steps = TrainingSteps(10000000000)
schedule_params.steps_between_evaluation_periods = EnvironmentEpisodes(10)
schedule_params.evaluation_steps = EnvironmentEpisodes(1)
schedule_params.heatup_steps = EnvironmentSteps(0)

#########
# Agent #
#########
agent_params = ActorCriticAgentParameters()
agent_params.algorithm.apply_gradients_every_x_episodes = 1
agent_params.algorithm.num_steps_between_gradient_updates = 10000000
agent_params.algorithm.beta_entropy = 0.0001
agent_params.network_wrappers['main'].learning_rate = 0.00001

agent_params.input_filter = InputFilter()
agent_params.input_filter.add_reward_filter('rescale', RewardRescaleFilter(1/20.))
agent_params.input_filter.add_observation_filter('observation', 'normalize', ObservationNormalizationFilter())

###############
# Environment #
###############
env_params = ControlSuiteEnvironmentParameters(level=SingleLevelSelection(control_suite_envs))

########
# Test #
########
preset_validation_params = PresetValidationParameters()
preset_validation_params.trace_test_levels = ['cartpole:swingup', 'hopper:hop']
preset_validation_params.test = True
preset_validation_params.min_reward_threshold = 400
preset_validation_params.max_episodes_to_achieve_reward = 1000
preset_validation_params.num_workers = 8

graph_manager = BasicRLGraphManager(agent_params=agent_params, env_params=env_params,
                                    schedule_params=schedule_params, vis_params=VisualizationParameters(),
                                    preset_validation_params=preset_validation_params)

However, running python rl_coach/coach.py -p ControlSuite_A3C -lvl cartpole:swingup in the command line keeps returning the error ValueError: The key for the input embedder (observation) must match one of the following keys: dict_keys(['measurements', 'action', 'goal']).

I'm not trying to access the keys of the input embedder in my preset above at all so I'm not sure why I get this error. How can I fix this issue?

The text was updated successfully, but these errors were encountered:

ujjawalchugh97 · 2019-09-16T06:08:57Z

@KeAWang I'm facing the same issue on a different preset. Were you able to solve this?

ujjawalchugh97 · 2019-09-16T06:11:05Z

@scttl any update on this?

KeAWang changed the title ~~A3C presets~~ Dec 10, 2018

scttl added this to To do in Coach Dev Jan 11, 2019

galnov added the priority/p2 questions needing answered or medium impact bugs label Jan 16, 2019

scttl moved this from Requires Grooming to Groomed but Not Started in Coach Dev Jan 17, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A3C presets for ControlSuite #146

A3C presets for ControlSuite #146

KeAWang commented Dec 10, 2018 •

edited

Loading

ujjawalchugh97 commented Sep 16, 2019

ujjawalchugh97 commented Sep 16, 2019

A3C presets for ControlSuite #146

A3C presets for ControlSuite #146

Comments

KeAWang commented Dec 10, 2018 • edited Loading

ujjawalchugh97 commented Sep 16, 2019

ujjawalchugh97 commented Sep 16, 2019

KeAWang commented Dec 10, 2018 •

edited

Loading