The researchers are using a way called adversarial coaching to stop ChatGPT from letting people trick it into behaving terribly (referred to as jailbreaking). This get the job done pits many chatbots towards each other: one particular chatbot plays the adversary and assaults One more chatbot by generating textual content to power it to buck its sta