In reinforcement learning, when we talk about the optimal action value function, does the equality of this picture is true? $$ q_*(s,a) = q_{\pi*}(s,a) $$
In my book, I see both notations and I want to make sure that it's equivalent.
Thanks
In reinforcement learning, when we talk about the optimal action value function, does the equality of this picture is true? $$ q_*(s,a) = q_{\pi*}(s,a) $$
In my book, I see both notations and I want to make sure that it's equivalent.
Thanks
Copyright © 2021 JogjaFile Inc.