Unable to resume pretraining #7878
-
I was running pretraining on a machine that crashed so I wanted to resume where the training left off. However when I attempt to do so I get this error in spacy 3.0.5
So then I tried to move the output files to another directory in C:\temp folder, but I got the same error. The last file that was written is
|
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 8 replies
-
The permissions issue doesn't have anything to do with spaCy, not sure what's up with that. What versions of spaCy are you actually using? First you say you're using 3.0.5, then you say you upgraded to 3.0.0, so I'm not sure what's going on.
|
Beta Was this translation helpful? Give feedback.
-
As I suspected it looks like the resume path is wrong. When I print it out it contains only the full path to the folder without the model.bin file. so when it gets to the line 99 in
If I have to specify an actual file in the resume path, then the documentation is a bit misleading because it doesn't actually state if a file is required to be part of the path and also the command requires that we specify the --epoch-resume parameter with a number which is kind of redundant if we supplied the model.bin file from which to resume in --resume-path. When I specify the Some clarification on the correct usage of this command is definitely needed. |
Beta Was this translation helpful? Give feedback.
As I suspected it looks like the resume path is wrong. When I print it out it contains only the full path to the folder without the model.bin file. so when it gets to the line 99 in
pretrain.py
it fails because its trying to open a directory as a file instead of an actual file.with resume_path.open("rb") as file_:
If I have to specify an actual file in the resume path, then the documentation is a bit misleading because it doesn't actually state if a file is required to be part of the path and also the command requires that we specify the --epoch-resume parameter with a number which is kind of redundant if we supplied the model.bin file from which to resume in --resume-path.
When I specif…