S
- the state type.A
- the action type.public interface MarkovDecisionProcess<S,A extends Action>
Modifier and Type | Method and Description |
---|---|
java.util.Set<A> |
actions(S s)
Get the set of actions for state s.
|
S |
getInitialState()
Get the initial state s0 for this instance of a Markov
decision process.
|
double |
reward(S s)
Get the reward associated with being in state s.
|
java.util.Set<S> |
states()
Get the set of states associated with the Markov decision process.
|
double |
transitionProbability(S sDelta,
S s,
A a)
Return the probability of going from state s using action a to s' based
on the underlying transition model P(s' | s, a).
|
java.util.Set<S> states()
S getInitialState()
java.util.Set<A> actions(S s)
s
- the state.double transitionProbability(S sDelta, S s, A a)
sDelta
- the state s' being transitioned to.s
- the state s being transitions from.a
- the action used to move from state s to s'.double reward(S s)
s
- the state whose award is sought.