Substrate
The substrate category collects the platform's foundational mechanism concepts. Each substrate names a structural property the platform relies on — not a specific service or system, but a pattern that composes services, systems, and content into a coherent whole.
The category answers questions like: what makes the platform improve continuously without surrendering data ownership? what makes citations machine-auditable? what makes disclosures structurally compliant? what makes contributor work feed back into model training? The articles here describe the mechanisms; the architecture, services, and systems categories describe how they're realised in concrete components.
Start here
Read compounding-substrate first — it is the canonical pattern PointSav stewards and the frame that makes the other substrates legible. Then read apprenticeship-substrate (how editorial verdicts feed continued pretraining), citation-substrate (how every claim resolves to an authoritative source), and disclosure-substrate (how forward-looking statements remain BCSC-compliant by structure).
Core named substrates
The nine named substrates: each names a structural property the platform depends on.
- compounding-substrate — Five structural properties that make every operational interaction a compounding training event across all tenant deployments.
- apprenticeship-substrate — Routes work through a local SLM, captures signed senior verdicts, and uses the resulting preference pairs as continued-pretraining signal.
- citation-substrate — Platform-wide YAML citation registry with drift detection; makes provenance machine-auditable from regulatory instrument to published claim.
- disclosure-substrate — Version-controlled Markdown with signed authorship chains and cryptographic content hashes produces structurally BCSC-compliant continuous-disclosure records.
- trajectory-substrate — Converts operational work — commits, sessions, operator feedback — into structured JSONL training tuples for continued pretraining.
- language-protocol-substrate — Four adapter families and eighteen genre templates encoding register, brand voice, and target audience as reusable prompt scaffolding.
- design-system-substrate — Self-hosted design-system engine storing tokens and components in the customer's own git repository; W3C DTCG token format; machine-readable MCP endpoint.
- location-intelligence-substrate — Flat-file open-GIS architecture: Apache-licensed open data, Rust-aligned rendering stack, retail co-location analysis as the first deployed surface.
- brief-queue-substrate — Durable file-backed queue that makes idle-shutdown Yo-Yo compute viable without losing apprenticeship corpus capture data.
The compounding Doorman and AI boundary
The single AI gateway that enforces the Ring 3 boundary, routes inference, and accumulates training signal.
- compounding-doorman — The single service mediating every external compute call: sanitise-and-rehydrate discipline, audit ledger, accumulated training signal.
- mcp-substrate-protocol — Every Ring 1 and Ring 2 service exposes a Model Context Protocol server interface as its primary external contract; the Doorman is the MCP gateway.
- adapter-composition — The OS metaphor for AI in PointSav: Doorman as kernel, adapters as processes, service-content as filesystem; composition algebra for per-request intelligence from versioned LoRA layers.
- knowledge-graph-grounded-apprenticeship — The Doorman consults the per-tenant knowledge graph before every inference request; graph and adapter co-evolve as training tuples accumulate.
- single-boundary-compute-discipline — Every AI inference request routes exclusively through the Doorman; bypass is structurally prevented at the kernel level.
Small Language Model stack
How the SLM tier is structured, selected, and trained.
- llm-substrate-decision — The rationale for OLMo 3: the only fully open model family — training data, training code, and checkpoints included — satisfying a Canadian public-company procurement posture.
- four-tier-slm-substrate — A graduated sovereignty path from a lightweight API gateway with no local model up through a domain-specialist service trained on the vendor's aggregated corpus.
- yoyo-compute-substrate — The three-ring compute substrate that lets service-slm spin GPU inference capacity up and down while retaining state and producing an audit ledger of every compute event.
- yo-yo-lora-training-pipeline — The nightly two-phase pipeline: entity extraction for the business DataGraph (Phase 1) and LoRA adapter training against engineering and apprenticeship corpora using QLoRA on a single L4 GPU (Phase 2).
- tui-corpus-producer — Every terminal interaction with service-slm through the operator TUI is a curated training corpus contribution for the per-tenant adapter.
- nightly-datagraph-rebuild — The scheduled process that reconstructs the platform's knowledge graph from canonical flat-file sources each night, producing a fresh queryable substrate from deterministic inputs without AI involvement.
Cryptographic and microkernel primitives
The formal verification and cryptographic foundations beneath every PointSav operating system.
- sel4-microkernel-substrate — The mathematically formally verified seL4 microkernel as L1 kernel for all PointSav operating systems: structurally guaranteed memory isolation, zero buffer overflows, capability-based permissions.
- merkle-proofs-as-substrate-primitive — RFC 9162 Merkle proof primitives as the cryptographic floor of the platform's capability ledger; ledger validity verifiable without trust in any central authority.
- capability-ledger-substrate — The mechanism by which every access-control decision in a Foundry deployment becomes a cryptographically auditable event anchored to a customer-controlled log; extends seL4's formally verified capability model with a Merkle transparency layer.
- system-substrate-doctrine — The kernel-level architecture: a customer-rooted capability ledger that is the audit log, a two-bottoms sovereign OS strategy, and three mechanisms for time-bound capabilities and boot-anywhere recovery.
Sovereignty and customer ownership
What the platform makes freely transferable, customer-owned, and vendor-independent.
- sovereign-ai-commons — PointSav's market positioning as a steward of shared open AI infrastructure for regulated SMBs: five structural properties that large-scale cloud providers cannot offer without dismantling their own billing models.
- knowledge-commons — The economic model that separates what PointSav publishes freely from what it sells: public knowledge artifacts under open licences, paid service at the point of multi-Totebox aggregation.
- customer-owned-graph-ip — The per-tenant knowledge graph and trained adapter weights are the customer's intellectual property, portable and exportable without vendor approval.
- tier-zero-customer-side-sovereign-specialist — A sovereign specialist Totebox deployment running on the customer's own hardware with no required cloud dependency and a 1 GB total footprint.
- substrate-without-inference-base-case — The Totebox Archive remains fully operational and freely transferable even when no AI inference tier is available; the deterministic substrate is the load-bearing foundation.
- substrate-native-compatibility — Structural compatibility with MediaWiki reader conventions while deliberately declining API mimicry, maintaining substrate-native interfaces.
Platform mechanics
Cross-cutting principles that apply across all substrate implementations.
- code-for-machines-first — Every inter-service contract, audit record, configuration, and ontology is machine-readable as a primary surface; human-facing interfaces are skins on machine-first APIs.
- seed-taxonomy-as-smb-bootstrap — Every tenant deployment provisions a four-part seed taxonomy — Archetypes, Chart of Accounts, Domains, Themes — as the knowledge graph bootstrap.
- reverse-flow-substrate — The Doorman gateway and audit ledger that enforce inbound data discipline are planned to also enforce outbound commercial flows — data marketplace and ad exchange — both opt-in per tenant.
See also
- Architecture — cross-cutting platform architecture
- Patterns — named design patterns realised on top of substrates
All 37 articles in this area, A–Z
- The adapter composition algebra2026-05-01
The operating-system metaphor for AI in PointSav — the Doorman as kernel, adapters as processes, service-content as filesystem — and the composition algebra that assembles per-request intelligence from versioned, customer-owned LoRA adapter layers.
- The apprenticeship substrate2026-05-22
The platform mechanism that routes code-shaped and editorial work first through a local Small Language Model, captures a signed senior verdict on every attempt, and uses the resulting preference pairs as continued-pretraining signal — compounding toward task types that need no senior authoring.
- The brief queue substrate2026-05-25
A durable file-backed queue that makes idle-shutdown Yo-Yo compute viable without losing apprenticeship corpus capture data — the durability layer of the three-tier SLM substrate.
- The capability ledger substrate2026-05-22
The Capability Ledger Substrate is the mechanism by which every access-control decision in a platform deployment becomes a cryptographically auditable event anchored to a log the customer controls.
- The citation substrate2026-05-15
The citation system connects every published article to the external authorities it depends on — regulatory instruments, research papers, technical specifications — through a platform-wide YAML registry with drift detection and per-document frontmatter declarations that make provenance machine-auditable from regulatory instrument to published claim.
- Code for machines first2026-05-25
Every inter-service contract, audit record, configuration, and ontology is machine-readable as a primary surface; human-facing interfaces are skins on machine-first APIs.
- The compounding Doorman2026-04-30
The operational pattern at the heart of sovereign AI substrates: a single service that mediates every external compute call, enforces sanitise-and-rehydrate discipline, logs every event to an audit ledger, and accumulates training signal that compounds the substrate over time.
- The compounding substrate2026-05-21
The Compounding Substrate is the architectural pattern PointSav builds and stewards: open platform code, a deterministic data layer that runs without AI, and an optional intelligence layer whose every interaction generates training signal that compounds across all tenant deployments.
- Customer-owned graph IP2026-05-01
The per-tenant knowledge graph and trained adapter weights are the customer's intellectual property, portable and exportable without vendor approval.
- Design-system substrate2026-05-15
The design-system substrate is a self-hosted, customer-owned design-system engine that stores tokens and components in the customer's own Git repository, serves them through a machine-readable MCP endpoint, and uses the W3C DTCG token format to remain editor-agnostic.
- The disclosure substrate2026-05-15
The mechanism that makes a version-controlled Markdown wiki the primary continuous-disclosure record — signed authorship chains, cryptographic content hashes, and planned per-jurisdiction export adapters producing regulator-compliant outputs from a single source held under the issuer's own infrastructure.
- substrate/four-tier-slm-substrate
- substrate/knowledge-commons
- substrate/knowledge-graph-grounded-apprenticeship
- The language-protocol substrate2026-05-15
The editorial infrastructure that encodes register, brand voice, document sub-type, and target audience as reusable prompt scaffolding — four adapter families, eighteen genre templates, a frontmatter validator, and a four-service split that lets a customer replace any single component without touching the rest.
- substrate/llm-substrate-decision
- substrate/location-intelligence-substrate
- substrate/mcp-substrate-protocol
- Merkle proofs as a substrate primitive2026-05-25
Merkle proofs are the cryptographic mechanism that lets the platform substrate guarantee — to any third party, without trust — that a specific record is part of an append-only log and that the log has not been rewritten between two observed points in time.
- The moonshot-toolkit Build Orchestrator2026-05-29
- Nightly Datagraph rebuild2026-05-18
The scheduled process that reconstructs the platform's knowledge graph from canonical flat-file sources each night, producing a fresh queryable substrate from deterministic inputs without AI involvement.
- The organizational knowledge graph — ontological memory for business operations2026-06-09
An organizational knowledge graph that stores structured representations of people, companies, projects, and relationships — providing the persistent semantic memory layer that enables AI inference engines to answer queries about business state without re-reading source documents.
- substrate/reverse-flow-substrate
- Seed taxonomy as SMB bootstrap2026-05-01
Every tenant deployment provisions a four-part seed taxonomy — Archetypes, Chart of Accounts, Domains, Themes — as the knowledge graph bootstrap.
- The seL4 AArch64 QEMU Substrate Target2026-05-29
- substrate/sel4-microkernel-substrate
- Single-boundary compute discipline2026-05-01
Every AI inference request in a platform deployment routes exclusively through the Doorman, with bypass structurally prevented at the kernel level.
- The tiered inference gateway — local-first AI routing2026-06-09
A tiered inference gateway that routes AI requests through a local model first, escalating to remote GPU nodes and external APIs only when the local tier cannot serve — minimizing latency, cost, and data exposure while preserving full capability on demand.
- substrate/sovereign-ai-commons
- Substrate-native compatibility — why the Action API shim was dropped2026-05-25
Establishes structural compatibility with MediaWiki reader and integrator conventions while deliberately declining API mimicry, maintaining substrate-native interfaces that reduce maintenance burden and avoid disclosure obligations tied to compatibility guarantees.
- substrate/substrate-without-inference-base-case
- The system substrate architecture2026-05-15
The kernel-level architecture beneath every PointSav service — a customer-rooted capability ledger that is the audit log, a two-bottoms sovereign OS strategy, and three mechanisms for time-bound capabilities, reproducible verification, and boot-anywhere recovery.
- Tier 0 customer-side sovereign specialist2026-05-01
The Tier 0 Totebox is a sovereign specialist deployment running on the customer's own hardware with no required cloud dependency and a 1 GB total footprint.
- The trajectory substrate2026-05-15
The platform mechanism that converts operational work — commits, sessions, operator feedback — into structured JSONL training tuples, routing them into a continued-pretraining corpus that improves the OLMo base model over time.
- substrate/tui-corpus-producer
- substrate/yo-yo-lora-training-pipeline
- substrate/yoyo-compute-substrate