Scaling laws for reward model overoptimization October 19, 2022 by OpenAI OpenAI Blog Previous Post NVIDIA, Oracle CEOs in Fireside Chat Light Pathways to Enterprise AI Next Post Design patterns for serial inference on Amazon SageMaker