nekocave.xyz

# #

We’ve trained a model, CriticGPT, to catch bugs in GPT-4’s code. We’re starting to integrate such models into our RLHF alignment pipeline to help humans supervise # on difficult tasks: https://openai.com/index/finding-gpt4s-mistakes-with-gpt-4/