Schema Registry Overview
Schema Registry
Citeable Systems Forensic Schema Registry — the machine-readable contracts that govern crawler output, validation, and audit evidence.
This registry documents the JSON Schema contracts used by the Citeable Systems forensic crawler. Every field in a forensic crawl output, validation report, or audit artifact is governed by one of these schemas.
Registry Statistics
Schema Contracts
8
Total Fields
46
Required Fields
14
Schema Contracts
Each schema below defines a specific audit capability. Click through to the Glossary for the full field-level reference.
No Results
Schema Architecture
The forensic crawler output is governed by a root Crawler Output schema that references specialized sub-schemas for each audit capability:
| Layer | Schema | Scope |
|---|---|---|
| Root | Crawler Output | Master contract — wraps all page and artifact data |
| Discovery | AI Bot Audit | robots.txt analysis for AI crawler access |
| Discovery | AI Bot Manifest | Declarative bot registry used by the audit |
| Discovery | llms.txt Audit | llms.txt and llms-full.txt structural checks |
| Page-Level | Schema Integrity | JSON-LD validation per page |
| Page-Level | RAG Readiness | Retrieval-readiness metrics per page |
| Page-Level | Prompt Injection Scan | Hidden manipulation pattern detection |
| Page-Level | WebMCP Signal | Agent-tool readiness signals |
| Config | Client Config | Client-specific audit parameters |
Registry: Citeable Systems Schema Contracts
Version: 2026-04-30
Authority: repo/schemas/