You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
你好,感谢您的工作。我有一个关于学习率的问题。我注意到您文章中写到 initial learning rate is 0.01,之后分别在200k,400k和450k时reduce by factor of 10 请问这样的设计是出于什么考虑呢?
我还注意到代码中您的学习率设置是与STTN一致的:initial learning rate is 0.0001,reduce at 400k by factor of 10
您是否测试过这二者的区别?
希望得到您的解答!!!
The text was updated successfully, but these errors were encountered:
你好,感谢您的工作。我有一个关于学习率的问题。我注意到您文章中写到 initial learning rate is 0.01,之后分别在200k,400k和450k时reduce by factor of 10 请问这样的设计是出于什么考虑呢?
我还注意到代码中您的学习率设置是与STTN一致的:initial learning rate is 0.0001,reduce at 400k by factor of 10
您是否测试过这二者的区别?
希望得到您的解答!!!
The text was updated successfully, but these errors were encountered: