Prompt Engineering - Crispy Rose

GPT-5.4: When “Most Advanced” Means “Can’t Do What The Old One Could”

March 8, 2026

AI Analysis, AI Business & Behavior, Artificial Intelligence, Platform Politics, Prompt Engineering, Strategy, User-Centered AI

Or: How OpenAI Optimized for Benchmarks and Broke My Workflow I’ve used ChatGPT since GPT-3. Not casually, but as a core part of my research and writing workflow. Image generation became available in version 4o, and I integrated it: “See this article? Generate an image that represents it.” Simple, conversational, reliable. It worked. Until three…

No, LLMs Don’t “Get Lost” in Real Conversation – What the Research Actually Says

February 21, 2026

AI Analysis, Artificial Intelligence, Research

I came across a tweet (or whatever we call the messages on X-formerly-known-as-Twitter these days) on February 19th with over 8,500 views claiming that new research from Microsoft and Salesforce should “scare every AI builder.” The thread declares that LLMs are fundamentally broken in multi-turn conversations, that “real conversations break every model on the market,”…

The “Are You Sure?” Problem Isn’t an AI Problem

February 19, 2026

Artificial Intelligence, Prompt Engineering, User-Centered AI

Dr. Randal Olson recently published research showing that modern AI models routinely flip their answers when challenged with a simple follow-up: “Are you sure?” Ask an AI a question. Get an answer. Then ask if it’s sure. Watch it suddenly reverse itself, retract its conclusion, or hedge into oblivion as though you’ve triggered existential dread.…

Tag: Prompt Engineering

GPT-5.4: When “Most Advanced” Means “Can’t Do What The Old One Could”

No, LLMs Don’t “Get Lost” in Real Conversation – What the Research Actually Says

The “Are You Sure?” Problem Isn’t an AI Problem