S - the state type.A - the action type.public class MDP<S,A extends Action> extends java.lang.Object implements MarkovDecisionProcess<S,A>
| Constructor and Description |
|---|
MDP(java.util.Set<S> states,
S initialState,
ActionsFunction<S,A> actionsFunction,
TransitionProbabilityFunction<S,A> transitionProbabilityFunction,
RewardFunction<S> rewardFunction) |
| Modifier and Type | Method and Description |
|---|---|
java.util.Set<A> |
actions(S s)
Get the set of actions for state s.
|
S |
getInitialState()
Get the initial state s0 for this instance of a Markov
decision process.
|
double |
reward(S s)
Get the reward associated with being in state s.
|
java.util.Set<S> |
states()
Get the set of states associated with the Markov decision process.
|
double |
transitionProbability(S sDelta,
S s,
A a)
Return the probability of going from state s using action a to s' based
on the underlying transition model P(s' | s, a).
|
public MDP(java.util.Set<S> states, S initialState, ActionsFunction<S,A> actionsFunction, TransitionProbabilityFunction<S,A> transitionProbabilityFunction, RewardFunction<S> rewardFunction)
public java.util.Set<S> states()
MarkovDecisionProcessstates in interface MarkovDecisionProcess<S,A extends Action>public S getInitialState()
MarkovDecisionProcessgetInitialState in interface MarkovDecisionProcess<S,A extends Action>public java.util.Set<A> actions(S s)
MarkovDecisionProcessactions in interface MarkovDecisionProcess<S,A extends Action>s - the state.public double transitionProbability(S sDelta, S s, A a)
MarkovDecisionProcesstransitionProbability in interface MarkovDecisionProcess<S,A extends Action>sDelta - the state s' being transitioned to.s - the state s being transitions from.a - the action used to move from state s to s'.public double reward(S s)
MarkovDecisionProcessreward in interface MarkovDecisionProcess<S,A extends Action>s - the state whose award is sought.