Bootstrapping in Q-learning Dueling Network using mean of V(s') instead of argmax Q(s',a)

by /u/RjRdrG in /r/MachineLearning

Upvotes: 1

Favorite this post:

Mark as read:

Your rating:

Add this post to a custom list