Bootstrapping in Q-learning Dueling Network using mean of V(s') instead of argmax Q(s',a)
by /u/RjRdrG in /r/MachineLearning
Upvotes: 1
Favorite this post:
Mark as read:
Your rating:
Add this post to a custom list
StoryNote Upvotes: 1