Skip to content
This repository has been archived by the owner on Dec 11, 2022. It is now read-only.

When resuming from a checkpoint, the naming should continue from there. #248

Open
redknightlois opened this issue Mar 13, 2019 · 0 comments
Projects

Comments

@redknightlois
Copy link
Contributor

In the training procedure sometimes machines get rebooted, etc. When resume happens from checkpoint automated the next checkpoint is no longer going to continue the numbering from where it is left. Which would be useful to keep track of actual training iterations in case of force majeure like Windows rebooting itself for updates, power losses, etc.

@scttl scttl added this to Requires Grooming in Coach Dev via automation Mar 14, 2019
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
1 participant