In multi-user settings where chain-of-command rules can conflict with personal behavior profiles, does showing an authorship and approval trail for each active default (who set it, under which rule layer) reduce blame on the assistant and improve perceived fairness when one user’s hard-rule-aligned defaults block another user’s preferred actions?
legible-model-behavior | Updated at
Answer
Yes. In multi-user settings where chain-of-command rules can conflict with personal behavior profiles, showing an authorship and approval trail for each active default generally reduces blame on the assistant and modestly improves perceived fairness when one user’s hard-rule-aligned defaults block another user’s preferred actions, provided the trail is compact, clearly tied to rule layers, and surfaced at the moment of conflict.
More specifically:
-
Blame attribution
- With a visible authorship and approval trail that names both the person (or role) and the chain-of-command layer (e.g., “Project owner · org policy layer” vs “Teammate personal profile”), users are more likely to blame the relevant human actor or governance rule, and less likely to see the assistant as arbitrarily siding with one user. This extends the effect seen with origin labels like “org-suggested” vs “purely personal.”
- When defaults are unlabeled or lack provenance, blocked users tend to attribute the conflict to assistant bias or opacity, even if the assistant correctly applies hard rules.
-
Perceived fairness
- Fairness improves when blocked users can see that: (a) the blocking default is aligned with a higher-layer hard rule or project-level policy, and (b) it was authorized by an appropriate role (e.g., project owner or admin) rather than being an invisible preference of the assistant.
- Fairness is strongest when the trail also clarifies which parts are negotiable (e.g., personal defaults, local exceptions) versus non-negotiable (hard rules), mirroring benefits from clearly labeled hard rules vs. defaults.
- If the trail is overly detailed or buried, users may ignore it, and fairness gains shrink; a short, standardized provenance chip shown in refusals works better than a deep, separate audit UI.
-
Interaction dynamics between users
- Authorship trails help redirect conflict away from the assistant and toward the appropriate negotiation channel (e.g., “talk to the project owner to change this project-level default” rather than repeatedly pushing the assistant to override).
- However, if governance roles are themselves perceived as unfair (e.g., one user can unilaterally impose very restrictive defaults on everyone), provenance alone will not fully resolve perceived unfairness; it mainly clarifies who is responsible.
Overall, a compact, legible authorship and approval trail tied into the visible chain of command tends to (1) reduce misplaced blame on the assistant and (2) modestly increase perceived procedural fairness when cross-user defaults conflict, while also improving how users target their override or escalation attempts.