StoryNote
Log in
|
Sign up
Whisper-Medusa: Using multiple Decoding Heads to Achieve 1.5X Speedup
by
/u/AI_inator
in
/r/github
Read on Reddit
Upvotes:
1
Favorite this post:
Mark as read:
Your rating:
--
10
9
8
7
6
5
4
3
2
1
0
Add this post to a custom list