Manufactured Credibility
A settled fake-review case, retold at the level of mechanism — and five probes that locate where an AI's manipulation safeguard actually draws its line.
An AI's manipulation safeguard draws its line in the wrong place. Tested against the exact moves in a settled FTC fake-review case, it held firmly against openly stated bad intent — and gave way each time the identical request arrived wrapped in a benign self-report: a claim of personal experience, a “personas and profiles” frame, a privacy question. Wherever a benign frame is available, the safeguard reads the story a user tells about themselves rather than the coordination signature underneath — and the property that separates legitimate persuasion from manipulation lives in that structure, where a model cannot look.
Read the full finding→