jesus/ontoref

Fork 0

Jesús Pérez 82a358f18d

Nickel Type Check / Nickel Type Checking (push) Has been cancelled

Details

Rust CI / Security Audit (push) Has been cancelled

Details

Rust CI / Check + Test + Lint (push) Has been cancelled

Details

feat: #[onto_mcp_tool] catalog, OCI credential vault layer, validate ADR-018 mode hierarchy

ontoref-derive: #[onto_mcp_tool] attribute macro registers MCP tool unit-structs in
  the catalog at link time via inventory::submit!; annotated item is emitted unchanged,
  ToolBase/AsyncTool impls stay on the struct. All 34 tools migrated from manual wiring
  (net +5: ontoref_list_projects, ontoref_search, ontoref_describe,
  ontoref_list_ontology_extensions, ontoref_get_ontology_extension).

  validate modes (ADR-018): reads level_hierarchy from workflow.ncl and checks every
  .ncl mode for level declared, strategy declared, delegate chain coherent, compose
  extends valid. mode resolve <id> shows which hierarchy level handles a mode and why.
  --self-test generates synthetic fixtures in a temp dir for CI smoke-testing.

  validate run-cargo: two-step Cargo.toml resolution — workspace layout first
  (crates/<check.crate>/Cargo.toml), single-crate fallback by package name or repo
  basename. Lets the same ADR constraint shape apply to workspace and single-crate repos.

  ontology/schemas/manifest.ncl: registry_topology_type contract — multi-registry
  coordination, push targets, participant scopes, per-namespace capability.

  reflection/requirements/base.ncl: oras ≥1.2.0, cosign ≥2.0.0, sops ≥3.9.0, age
  ≥1.1.0, restic declared as Hard/Soft requirements with version_min, check_cmd, and
  install_hint (ADR-017 toolchain surface).

  ADR-019: per-file recipient routing for tenant isolation without multi-vault. Schema
  additions: sops.recipient_groups + sops.recipient_rules in ontoref-project.ncl.
  secrets-bootstrap generates .sops.yaml from project.ncl in declarative mode. Three
  new secrets-audit checks: recipient-routing-coherent, recipient-routing-coverage,
  no-multi-vault. Adoption templates: single-team/, multi-tenant/, agent-first/.
  Integration templates: domain-producer/, mode-producer/, mode-consumer/.

  UI: project_picker surfaces registry badge (⟳ participant) and vault badge
  (⛁ vault_id · N, green=declarative / amber=legacy) per project card. Expanded panel
  adds collapsible Registry section with namespace, endpoint, and push/pull capability.
  manage.html gains Runtime Services card — MCP and GraphQL toggleable without restart
  via HTMX POST /ui/manage/services/{service}/toggle.

  describe.nu: capabilities JSON includes registry_topology and vault_state per project.
  sync.nu: drift check extended to detect //! absence on newly registered crates.
  qa.ncl: six entries — credential-vault-best-practice (layered data-flow diagram),
  credential-vault-templates (paths A/B/C), credential-vault-troubleshooting (15 named
  errors), integration-what-and-why (ADR-042 OCI federation), integration-how-to-implement,
  integration-troubleshooting.

  on+re: core.ncl + manifest.ncl updated to reflect OCI, MCP, and mode-hierarchy nodes.
  Deleted stale presentation assets (2026-02 slides + voice notes).

2026-05-12 04:46:15 +01:00

6.4 KiB

Raw Blame History

title

slug

subtitle

excerpt

author

date

published

featured

AI is a Knowledge Tool. But Who Keeps the Knowledge Alive?

Jessica Talisman delivered a talk at KGC 2026 called "Stop Betting, Start Building." The data she opens with is brutal:

89% of firms report zero productivity impact from AI after three years (NBER, Feb 2026, n=5,937 executives)
Experienced developers are 19% slower with AI tools, not faster (METR RCT, 2025)
The average AI-using worker gains −14 minutes per week in net productivity (Foxit/Sapio, March 2026)

The market is bullish. The evidence is not.

Her diagnosis is correct: AI is a knowledge tool, not a data tool. Organizations are pouring money into data lakes, vector stores, and ETL pipelines and expecting context and reasoning to emerge from raw tokens. It won't. Models were trained on linked data, RDF triples, and controlled vocabularies — the top fifteen C4 training sources are knowledge-graph-heavy. They're then deployed against environments stripped of all that structure, and we wonder why they hallucinate.

Her prescription is also correct: build knowledge infrastructure. Controlled vocabularies first. Taxonomies. Thesauri. Ontologies — formal commitments, classes, properties, constraints. Knowledge graphs on top. And govern them. Forever, not as a project.

She's right. And she stops exactly where the hard problem begins.

The Governance Gap

The sixth step in her stack is "Govern — this is infrastructure, stewardship, versioning, growth, forever." She names the problem. But naming it is not a mechanism.

The real question is: who ensures the ontology doesn't drift from the system it describes?

Software evolves. An architectural decision made in March invalidates a node in the graph by October. A new team joins with different mental models. A library gets replaced and a constraint becomes fictional. Knowledge graphs accumulate debt the same way codebases do — silently, without an alarm.

In the enterprise knowledge graph world, the answer to governance is headcount: hire ontologists, librarians, knowledge engineers. That works at organizational scale with dedicated budgets. It is not a viable answer for a software project, an infrastructure environment, or an individual trying to maintain structured self-knowledge.

What's Actually Missing: the Operational Loop

Every serious ontology without an operational closure layer becomes archaeology within eighteen months. Beautiful, formally correct, and describing a system that no longer exists.

The missing piece is not more knowledge. It's a feedback loop that:

Observes the current state of the system against its declared intent
Detects drift before it becomes permanent
Executes operations that reduce that drift
Records decisions with lasting architectural weight
Propagates changes to everything that depends on this knowledge

This is the Yang to the ontology's Yin. Talisman describes the Yin in precise detail. The Yang is what makes it not an artifact.

Ontoref: Both Halves

Ontoref is a protocol for structured self-knowledge in software projects. It operates as two coexisting layers that cannot function without each other:

Ontology (what IS): typed nodes and edges representing practices, principles, tensions, capabilities. Not documentation — formal commitments. A Practice node that implements a Principle, enforces a Constraint, enables a Capability, and is in active tension with another Practice. The project as a knowledge graph.

Reflection (what BECOMES): executable DAGs — operational modes that run against the ontological state, detect drift, report transitions, and propagate changes. The sync diff --docs command that catches when a crate's documentation has drifted from its ontology node. The FSM in state.ncl that tracks where each project dimension is versus where it intends to go. The migrations system that ensures protocol changes reach every consumer project.

The tension between these two layers is not a design flaw — it's named explicitly in the project's own ontology as its core identity:

"Ontology captures what IS. Reflection captures what BECOMES. Both must coexist without one dominating."

Without Reflection, the Ontology crystallizes. It becomes a snapshot of what the project wanted to be at the moment it was written. Without Ontology, Reflection is execution without truth — it knows how to operate but not what it's operating on.

The Accuracy Numbers

Talisman's data on what structured knowledge actually does to AI accuracy is worth repeating:

Question-answering on enterprise SQL: 16% → 72% when an ontology checks and repairs LLM-generated queries (Allemang & Sequeda, data.world AI Lab, 2024)
GraphRAG vs. vector RAG: 3.4× accuracy across 43 enterprise queries (Diffbot KG-LM Benchmark, 2023)
Vector RAG collapses to 0% accuracy past five entities per query. KG-grounded retrieval sustains performance well beyond that

The accuracy gap is not closed by a bigger model. It is closed by a defined schema, an ontology, and a validated query.

But only if the ontology is kept alive. Only if someone — or something — is running the operational loop that keeps it from drifting into fiction.

That's the part Talisman's framework doesn't provide. That's what Reflection is for.

Ontoref is open source. The protocol specification, Nushell automation, and Rust crates are at github.com/jesusperezlorenzo/ontoref.

6.4 KiB Raw Blame History Unescape Escape

AI is a Knowledge Tool. But Who Keeps the Knowledge Alive?

The Governance Gap

What's Actually Missing: the Operational Loop

Ontoref: Both Halves

The Accuracy Numbers

6.4 KiB

Raw Blame History