We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent a79cf4b commit c93f818Copy full SHA for c93f818
1 file changed
mdp.py
@@ -25,7 +25,7 @@ def R(self, state):
25
"Return a numeric reward for this state."
26
return self.reward[state]
27
28
- def T(state, action):
+ def T(self, state, action):
29
"""Transition model. From a state and an action, return a list
30
of (result-state, probability) pairs."""
31
abstract
0 commit comments