And here's a somewhat non-trivial quiz.
For the state B3, calculate the value function
assuming that we have a value function as shown over here
and all the open states have a value of assumed 0,
because we're still in the beginning of our value update.
What would be our very first value function for B3
that we compute based on the values shown over here?