S
- the state type.A
- the action type.public class MDP<S,A extends Action> extends java.lang.Object implements MarkovDecisionProcess<S,A>
Constructor and Description |
---|
MDP(java.util.Set<S> states,
S initialState,
ActionsFunction<S,A> actionsFunction,
TransitionProbabilityFunction<S,A> transitionProbabilityFunction,
RewardFunction<S> rewardFunction) |
Modifier and Type | Method and Description |
---|---|
java.util.Set<A> |
actions(S s)
Get the set of actions for state s.
|
S |
getInitialState()
Get the initial state s0 for this instance of a Markov
decision process.
|
double |
reward(S s)
Get the reward associated with being in state s.
|
java.util.Set<S> |
states()
Get the set of states associated with the Markov decision process.
|
double |
transitionProbability(S sDelta,
S s,
A a)
Return the probability of going from state s using action a to s' based
on the underlying transition model P(s' | s, a).
|
public MDP(java.util.Set<S> states, S initialState, ActionsFunction<S,A> actionsFunction, TransitionProbabilityFunction<S,A> transitionProbabilityFunction, RewardFunction<S> rewardFunction)
public java.util.Set<S> states()
MarkovDecisionProcess
states
in interface MarkovDecisionProcess<S,A extends Action>
public S getInitialState()
MarkovDecisionProcess
getInitialState
in interface MarkovDecisionProcess<S,A extends Action>
public java.util.Set<A> actions(S s)
MarkovDecisionProcess
actions
in interface MarkovDecisionProcess<S,A extends Action>
s
- the state.public double transitionProbability(S sDelta, S s, A a)
MarkovDecisionProcess
transitionProbability
in interface MarkovDecisionProcess<S,A extends Action>
sDelta
- the state s' being transitioned to.s
- the state s being transitions from.a
- the action used to move from state s to s'.public double reward(S s)
MarkovDecisionProcess
reward
in interface MarkovDecisionProcess<S,A extends Action>
s
- the state whose award is sought.