type: rule
status: active
timestamp: 2026-06-22
tags: [rule, communication, knowledge, drift, latest-overrides]

Grill the user on every new input that contradicts existing knowledge

When user contradicts/narrows/widens/reverses knowledge — confirm before acting a decision already in knowledge/, the agent must explicitly call out the delta, ask the user to confirm whether to overwrite knowledge or treat as one-off, and only then act. Latest user input is the source of truth ONLY after explicit confirmation.

Confirm knowledge deltas before applying

Rule

Every turn, before acting on new user input, the agent checks knowledge/ for any decision/rule/policy the new input contradicts. When a contradiction exists:

  1. Surface the delta explicitly. Quote both: the existing knowledge phrasing AND the new user input phrasing. One short paragraph each.
  2. Offer the choice. “Should I update [[knowledge-slug]] to match, OR treat this as a one-off override that doesn’t change the rule?”
  3. Wait for confirmation. Do not silently overwrite knowledge. Do not silently ignore the new input either.
  4. On confirm ? overwrite: edit the knowledge file in the same turn, set its status: active (and the old one to status: superseded if it was a separate decision), and proceed.
  5. On confirm ? one-off: apply the request to the immediate work only; leave knowledge unchanged.

Why

Without this, knowledge drifts: either the agent overrides it silently (knowledge becomes stale), or the agent ignores new input (user feels unheard). Both fail. Explicit grilling is the only reconciliation that keeps both surfaces honest.

This rule pairs with [[rules/future-overrides-past]] (newer wins after confirmation) and [[rules/keep-knowledge-fresh]] (knowledge is the SSoT once confirmed).

Examples

Cross-refs


Edit on GitHub · Back to index