Skip to content

Commit

Permalink
Fix exercise 6.1 description, it's confusing
Browse files Browse the repository at this point in the history
gamma is discount rate, not learning rate
these are different things
learning rate is alpha, there is no learning rate in Policy Evaluation
  • Loading branch information
q0o0p committed Jun 8, 2019
1 parent 2d2f924 commit 628b829
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions labs/notebooks/reinforcement_learning/exercises_1_4.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -33,7 +33,7 @@
"for i in range(20):\n",
" print(state_value_function)\n",
" \n",
" state_value_function=#TODO: Implement the Policy Evaluation Update with a Learning Rate of 0.1\n",
" state_value_function=#TODO: Implement the Policy Evaluation Update with a Discount Rate of 0.1\n",
"print(state_value_function)"
]
},
Expand All @@ -54,7 +54,7 @@
"metadata": {},
"outputs": [],
"source": [
"solution=#TODO: Implement the linear programming solution with a learning rate of 0.1\n",
"solution=#TODO: Implement the linear programming solution with a discount rate of 0.1\n",
"print(solution)"
]
},
Expand Down

0 comments on commit 628b829

Please sign in to comment.