tinyBenchmarks: Revolutionizing LLM Evaluation with 100-Example Curated Sets, Reducing Costs by Over 98% While Maintaining High Accuracy [Colab Notebook Included]
Upvotes: 36
Favorite this post:
Mark as read:
Your rating:
Add this post to a custom list