Now we will have some grounding for when weird ChatGPT behaviors are intended or side-effects — shrinking the Overton window of RLHF bugs.
OpenAI’s Model (behavior) Spec, RLHF transparency, personalization questions
OpenAI’s Model (behavior) Spec, RLHF…
OpenAI’s Model (behavior) Spec, RLHF transparency, personalization questions
Now we will have some grounding for when weird ChatGPT behaviors are intended or side-effects — shrinking the Overton window of RLHF bugs.