Skip to content

Unable to resume pretraining #7878

Apr 23, 2021 · 2 comments · 8 replies
Discussion options

You must be logged in to vote

As I suspected it looks like the resume path is wrong. When I print it out it contains only the full path to the folder without the model.bin file. so when it gets to the line 99 in pretrain.py it fails because its trying to open a directory as a file instead of an actual file.

with resume_path.open("rb") as file_:

If I have to specify an actual file in the resume path, then the documentation is a bit misleading because it doesn't actually state if a file is required to be part of the path and also the command requires that we specify the --epoch-resume parameter with a number which is kind of redundant if we supplied the model.bin file from which to resume in --resume-path.

When I specif…

Replies: 2 comments 8 replies

Comment options

You must be logged in to vote
6 replies
@adrianeboyd
Comment options

@erotavlas
Comment options

@erotavlas
Comment options

@polm
Comment options

@erotavlas
Comment options

Comment options

You must be logged in to vote
2 replies
@polm
Comment options

@polm
Comment options

Answer selected by polm
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
windows Issues related to Windows feat / config Feature: Training config
3 participants
Converted from issue

This discussion was converted from issue #7877 on April 24, 2021 08:13.