StoryNote
Log in
|
Sign up
Bootstrapping in Q-learning Dueling Network using mean of V(s') instead of argmax Q(s',a)
by
/u/RjRdrG
in
/r/MachineLearning
Read on Reddit
Upvotes:
1
Favorite this post:
Mark as read:
Your rating:
--
10
9
8
7
6
5
4
3
2
1
0
Add this post to a custom list