Chaos Engineering for LLM Pipelines
LLM pipelines rarely fail in a single call. They degrade across turns, handoffs, and hidden state. This piece argues that the right way to test them is not by replaying golden cases, but by simulating messy users and watching where the system starts to break.