/u/StartledWatermelon's posts
Large Language Monkeys: Scaling Inference Compute with Repeated Sampling, Brown et al. 2024 [Given sufficient number of attempts, smaller models can reach parity with larger models in solving tasks. Pareto frontier for compute cost varies from task to task]
Mark as read: Add to a list
Mark as read: Add to a list
RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold, Setlur et al. 2024
Mark as read: Add to a list