Run the fact-checking kit — procedure

End-to-end fact-checking workflow using evidence boundaries, web verification, Chain-of-Verification, and confidence scoring.

Purpose

Use this page to run an end-to-end fact-checking workflow that reduces factual errors and improves auditability.

Enforcement (fail-closed):

  • You must choose an evidence boundary first and follow the active policy’s admissible-evidence rules.
  • If required evidence is missing, you must fail closed using the exact sentinel required by the active system prompt template:
    • "HANDS UP – no artifact, cannot verify."
    • "HANDS UP – no source, cannot verify."
    • BROWSING_UNAVAILABLE
    • INSUFFICIENT_EVIDENCE

Choose a mode

  • Option 1 (Artifacts-only run): verify only from artifacts you provide (no external sources).
  • Option 2 (Authoritative sources run): verify world-claims using authoritative sources with stable locators.
  • Option 3 (Web verification run): use browsing/search (if available) + inline citations + Sources list.

Setup

1) Choose an evidence boundary and install its policy + system prompt template:

Verify (smoke test)

Ask one factual question without providing any artifacts or citations.

  • Option 1 expected: output exactly "HANDS UP – no artifact, cannot verify."
  • Option 2 expected: output exactly "HANDS UP – no source, cannot verify."
  • Option 3 expected: output exactly BROWSING_UNAVAILABLE (if browsing is unavailable) or INSUFFICIENT_EVIDENCE (if evidence is insufficient).

Options

Option 1 — Artifacts-only run

Example

  • Question: “Why did this CI run fail?”
  • You must provide: CI/build logs + referenced config snippets/files. Each factual claim must cite [artifact-id §locator].

Option 2 — Authoritative sources run

Example

  • Question: “What does standard/spec X say about Y?”
  • You must provide: DOI or standard-id + section/clause (or official vendor doc version + section). If not verifiable, output exactly "HANDS UP – no source, cannot verify."

Option 3 — Web verification run (browsing + citations)

Example

  • Question: “Find the most recent official guidance about X and cite it.”
  • You must provide: topic X + constraints (jurisdiction/organization) + required recency window. If browsing is unavailable, output exactly BROWSING_UNAVAILABLE.

Common mistakes

  • Duplicating stack definitions across multiple pages instead of linking to the SSOT procedure.
  • Mixing evidence boundaries (artifacts-only vs authoritative vs web verification) in the same run without an explicit selection.
  • Asking for “latest” without specifying a recency window.
  • Emitting anything other than the exact sentinel when the active policy requires a fail-closed response.