Checkpointing Features #177

bbalaji-ucsd · 2019-01-02T16:33:11Z

Right now Coach saves checkpoints every X seconds. It would be great if I can save checkpoints every X iterations or save checkpoints if it reaches an evaluation reward threshold.

It will also help if I can save checkpoints as a protobuf file, so I can use the saved model independent of Coach.

safrooze · 2019-01-03T22:56:38Z

So you want to save a checkpoint at a certain reward threshold but continue training? Is this in case you notice overfit later on and can go back to an earlier set of parameters?

bbalaji-ucsd · 2019-01-03T23:32:06Z

Yes, to both questions. Although flexibility to stop training after a reward threshold also helps. In the latter case, it would help to save a checkpoint at the end of training.

safrooze · 2019-01-04T02:36:13Z

There already is ability to stop training when a specific reward is achieved: "-asc" or "--apply_stop_condition" will stop early if a specific reward is hit and saves a checkpoint at the end.

bbalaji-ucsd · 2019-01-04T23:07:09Z

Ok, that solves one of the problems then :)

scttl added this to To do in Coach Dev Jan 10, 2019

galnov moved this from To do to P2 in Coach Dev Jan 13, 2019

galnov moved this from P2 to P3 in Coach Dev Jan 13, 2019

balajismaniam added the priority/p3 enhancements not currently in focus or low impact bugs label Jan 16, 2019

balajismaniam moved this from P3 to Groomed but Not Started in Coach Dev Jan 16, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Checkpointing Features #177

Checkpointing Features #177

bbalaji-ucsd commented Jan 2, 2019

safrooze commented Jan 3, 2019

bbalaji-ucsd commented Jan 3, 2019

safrooze commented Jan 4, 2019

bbalaji-ucsd commented Jan 4, 2019

Checkpointing Features #177

Checkpointing Features #177

Comments

bbalaji-ucsd commented Jan 2, 2019

safrooze commented Jan 3, 2019

bbalaji-ucsd commented Jan 3, 2019

safrooze commented Jan 4, 2019

bbalaji-ucsd commented Jan 4, 2019