/r/mlscaling
Mark as read: Add to a list
"Character.AI CEO Noam Shazeer [and some staff] returns to Google as the tech giant invests in the AI company" (2nd Inflection-style acquihire as scaling shakeout continues)
Mark as read: Add to a list
Large Language Monkeys: Scaling Inference Compute with Repeated Sampling, Brown et al. 2024 [Given sufficient number of attempts, smaller models can reach parity with larger models in solving tasks. Pareto frontier for compute cost varies from task to task]
Mark as read: Add to a list
Mark as read: Add to a list
RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold, Setlur et al. 2024
Mark as read: Add to a list