Current designs assume the chain of command is the primary frame for explaining refusals; what changes in trust, perceived fairness, and override strategies emerge if the assistant instead foregrounds risk bands as the top-level organizing concept (e.g., “this is a high-risk action so only narrow, time-bounded local exceptions are possible”), treating the chain of command as secondary implementation detail?
legible-model-behavior | Updated at
Answer
Foregrounding risk bands as the top-level concept (with chain of command as background explanation) likely shifts users from asking “who is blocking me?” to “how risky is this action and what levers exist at this risk level?”, producing:
- Trust: modestly higher trust for users who are outcome- and safety-oriented, but lower trust for users who care about procedural authority and accountability.
- Perceived fairness: more acceptance of unequal treatment across actions (different constraints by risk level), but more suspicion about unequal treatment across people unless role/authority is still visible.
- Override strategies: users focus more on scope/duration shaping within a risk band (“can we make this safer or smaller?”) and less on appealing up the chain (“can my manager/org bless this?”), especially if the UI makes local exceptions the primary visible override mechanism for high‑risk bands.
Designing around these shifts can open up a more intuitive, risk-literate override experience, but it risks obscuring who is ultimately responsible for constraints unless the chain of command still appears at key moments (e.g., “this high-risk boundary comes from your org’s security policy”).