You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
ClientError: Failed to invoke sagemaker:CreateHyperParameterTuningJob. Error Details: Only the following fields in TrainingJobDefinition are allowed to change
#3693
Closed
timxieICN opened this issue
Mar 2, 2023
· 2 comments
I already installed the latest version of sagemaker==2.135.0 and boto3==1.26.81. However, I'm still having trouble passing the custom environment variable from Estimator to HyperparameterTuner on my custom ECR image
Now it's failing in the parameter validation:
ClientError: Failed to invoke sagemaker:CreateHyperParameterTuningJob. Error Details: Only the following fields in TrainingJobDefinition are allowed to change: [algorithmSpecification, inputDataConfig, outputDataConfig, staticHyperParameters, roleArn, resourceConfig, stoppingCondition, vpcConfig, enableManagedSpotTraining, checkpointConfig].
To reproduce
A clear, step-by-step set of instructions to reproduce the bug.
I also tested it using a pre-built image provided by AWS and the errors are gone.
So it's very possible that the previous PR fix does not handle custom images well.
OK - it turns out the issue is due to the use of warm_start_config in HyperparameterTuner. Due to the upgraded version of sagemaker and boto3, the previous training job cannot be used as warm start configurations. When I start the HPO jobs from refresh, the errors are gone.
Describe the bug
It's related to several other bug reports:
HyperparameterTuner
not keeping estimator environment variables #3598I already installed the latest version of
sagemaker==2.135.0
andboto3==1.26.81
. However, I'm still having trouble passing the custom environment variable fromEstimator
toHyperparameterTuner
on my custom ECR imageNow it's failing in the parameter validation:
To reproduce
A clear, step-by-step set of instructions to reproduce the bug.
Screenshots or logs
System information
A description of your system. Please provide:
Python Package
The text was updated successfully, but these errors were encountered: