StoryNote logo

Whisper-Medusa: Using multiple Decoding Heads to Achieve 1.5X Speedup

by /u/AI_inator in /r/github

Upvotes: 1

Favorite this post:
Mark as read:
Your rating:
Add this post to a custom list

StoryNote©

Reddit is a registered trademark of Reddit, Inc. Use of this trademark on our website does not imply any affiliation with or endorsement by Reddit, Inc.