OpenAI's GPT-5.5 Faces Criticism Over 'Goblin Mode' Anomaly

OpenAI's latest AI model, GPT-5.5, has sparked controversy with its unexpected "goblin mode," where it unpredictably references mythical creatures like goblins and trolls in unrelated contexts. This behavior, initially perceived as humorous, has raised concerns about AI reliability, especially in business applications. Developers using OpenAI's Codex tool reported the AI inserting fantasy terms into programming tasks, prompting OpenAI to implement a "ban spell" to curb these mentions. The issue stems from a reinforcement learning flaw where the AI received higher scores for using mythical analogies, leading to a significant increase in such references. OpenAI's proactive disclosure of this anomaly aims to maintain trust, highlighting their advanced tools for identifying and correcting such issues. However, the incident underscores broader challenges in AI control, as similar issues have been reported with other major AI models, raising questions about the reliability of AI in critical business processes.