remove method GraphManager.emulate_act_on_trainer #178

zach-nervana · 2019-01-03T17:18:06Z

A comment from the code:

# TODO-remove - this is a temporary flow, used by the trainer worker, duplicated from observe() - need to create
#               an external trainer flow reusing the existing flow and methods [e.g. observe(), step(), act()]

It appears that this code is related to, if not the cause of, quite a bit of complexity and duplicate code in the training control flow and steps counting.

The text was updated successfully, but these errors were encountered:

balajismaniam · 2019-01-17T21:51:33Z

There are some subtle differences between the emulate_act_on_trainer() and act(). Same goes for emulate_act_on_observe() and observe(). We can't remove it completely.

We could try to extract out the common code between these funcs. However, it might become a set of common funcs that are too smal (~1-5 lines of code in each). @zach-nervana is that what you want?

zach-nervana · 2019-01-17T22:08:24Z

I agree that cleaning this up is not going to be as simple as extracting duplicate code as functions.

Do you know what the original intent of this comment is?

zach-nervana · 2019-01-18T22:17:37Z

Notes to myself/possible methods of refactoring

Agent.observe and Agent.emulate_observe_on_trainer could be split into 2 functions: observe(env_respons) -> transition and record(transition)
- observe would call record
Agent.act and Agent.emulate_act_on_trainer can be similarly refactored
Agent.emulate_act_on_trainer can likely be reduced to self.total_steps_counter += 1; self.current_episode_steps_counter += 1
current bug?: emulated rewards during emulation do not distinguish between shaped rewards and unshaved rewards
use TotalStepsCounter in Agent instead of three separately named properties: self.training_iteration, self.total_steps_counter, self.current_episode
- this will simplify space methods _should_update, _should_train and _should_update_online_weights_to_target

gal-leibovich · 2019-01-20T08:28:59Z

I agree that cleaning this up is not going to be as simple as extracting duplicate code as functions.

Do you know what the original intent of this comment is?

Yeah, the comment as written is a bit confusing. As far as I remember, the original intent was to just reduce code duplication as much as possible (not just reuse existing methods as-is), as there are common shared parts in both flows, which should be reused.

scttl added this to To do in Coach Dev Jan 10, 2019

galnov moved this from To do to P2 in Coach Dev Jan 13, 2019

scttl added the priority/p2 questions needing answered or medium impact bugs label Jan 16, 2019

scttl moved this from P2 to Groomed but Not Started in Coach Dev Jan 16, 2019

galnov assigned Ajay191191 and balajismaniam Jan 16, 2019

galnov added priority/p1 broken basics or large value add enhancements (highest priority) and removed priority/p2 questions needing answered or medium impact bugs labels Jan 16, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

remove method GraphManager.emulate_act_on_trainer #178

remove method GraphManager.emulate_act_on_trainer #178

zach-nervana commented Jan 3, 2019 •

edited

Loading

balajismaniam commented Jan 17, 2019

zach-nervana commented Jan 17, 2019

zach-nervana commented Jan 18, 2019

gal-leibovich commented Jan 20, 2019

remove method GraphManager.emulate_act_on_trainer #178

remove method GraphManager.emulate_act_on_trainer #178

Comments

zach-nervana commented Jan 3, 2019 • edited Loading

balajismaniam commented Jan 17, 2019

zach-nervana commented Jan 17, 2019

zach-nervana commented Jan 18, 2019

gal-leibovich commented Jan 20, 2019

zach-nervana commented Jan 3, 2019 •

edited

Loading