Or: Why Every Study Claiming to Identify โLLM Character Traitsโ Is Actually Just Documenting the Shape of the Cage A recent study (Eliciting Frontier Model Character Training) claims to have identified convergent personality traits across frontier language models. Using a methodology borrowed from character training research, the authors instructed models to embody different personality traits,…
I came across a tweet (or whatever we call the messages on X-formerly-known-as-Twitter these days) on February 19th with over 8,500 views claiming that new research from Microsoft and Salesforce should “scare every AI builder.” The thread declares that LLMs are fundamentally broken in multi-turn conversations, that “real conversations break every model on the market,”…