Or: Why Every Study Claiming to Identify โLLM Character Traitsโ Is Actually Just Documenting the Shape of the Cage A recent study (Eliciting Frontier Model Character Training) claims to have identified convergent personality traits across frontier language models. Using a methodology borrowed from character training research, the authors instructed models to embody different personality traits,…
Dr. Randal Olson recently published research showing that modern AI models routinely flip their answers when challenged with a simple follow-up: โAre you sure?โ Ask an AI a question. Get an answer. Then ask if it’s sure. Watch it suddenly reverse itself, retract its conclusion, or hedge into oblivion as though you’ve triggered existential dread.…