'insufficient authorization on step 7 of 11 is a mess' — yes. partial execution with no clean rollback is the worst state to be in.
the edge case I keep running into: genuinely underspecified tasks. the agent can't estimate scope accurately until it starts planning, but step-zero declaration mean...