K · Knowledge Graphs deep dive

Connect claims without losing source, time, or authority

Use competency questions to model the relationships that tables and prompts cannot safely reconstruct on demand.

Core concepts

A graph is not automatically knowledge

Knowledge graph

A governed network of identified entities, relationships, claims, semantics, provenance, and time designed to answer domain questions.

Ontology

A formal domain model of classes/concepts, properties, relationships, constraints, and potentially inference rules.

Property graph

Nodes and relationships with properties, commonly queried with Cypher/Gremlin; practical for traversal and application graphs.

RDF graph

Subject-predicate-object triples identified by IRIs, queried with SPARQL and supported by W3C semantic-web standards.

Competency question

A precise business question the graph must answer, used to justify schema and test usefulness.

Entity resolution

Determining which records refer to the same real entity, with explainable rules, confidence, and human correction.

Provenance

Who asserted or derived a claim, from what evidence, when, with which method/version and confidence.

Inference

A new claim derived from asserted facts and governed rules; it must remain distinguishable and traceable.

Logistics example

From rows to connected impact

Truck → carries → Shipment → serves → Customer → governed by → SLA
Competency questionRequired relationshipsExpected proof
Which closure threatens the highest-value commitment?closure–segment–route–truck–shipment–customer–SLAAnswer traces every edge to source and valid time.
Can inventory be substituted?product–equivalent–inventory–warehouse–contract–customerInference cites equivalence and contract policy.
Who may approve a hazardous-cargo reroute?cargo–risk–policy–jurisdiction–role–personAuthorization reflects current role and policy version.

Keep in V

Canonical definitions, reference codes, semantic contracts, source facts, data quality, ownership, lineage, and permitted use.

Implement in K

Cross-domain identity, relationships, temporal claims, ontology constraints, graph authorization, competency questions, and inference.

Technology map

Graph engines, semantic standards, and platform integration

NeedOpen / specializedDatabricksSnowflakeMicrosoft Fabric / Azure
Property graphNeo4j, Amazon Neptune, JanusGraph, MemgraphGraphFrames on Spark for graph processing; integrate specialized graph DB for transactional traversalModel edges in tables or integrate a graph database/service; relational recursive queries are not a full graph platformAzure Cosmos DB Gremlin or specialized graph; Fabric data as governed source
RDF/ontologyApache Jena, RDF4J, GraphDB, Stardog; RDF/OWL/SHACL/SPARQLStore/process triples in lakehouse or integrate RDF store; Unity Catalog governs source assetsStore triples relationally or integrate semantic store; Semantic Views are not formal OWL ontologiesFabric IQ Ontology (Preview); use external RDF tooling for standards/interoperability needs
Analytical semanticsdbt Semantic Layer, Cube, metric storesUnity Catalog metric viewsSemantic ViewsPower BI semantic models
ValidationSHACL/ShEx, graph unit tests, constraint enginesPipeline expectations plus graph-specific validation codeDMFs/constraints plus graph-specific validationOntology validation capabilities plus external SHACL where standards are required
Agent retrievalGraphRAG patterns, hybrid graph/vector/searchMosaic AI retrieval with governed graph-derived contextCortex Search/Agents with governed graph-derived contextFabric data agents/Foundry with ontology or graph-derived context

Choose by question: semantic models standardize metrics; ontologies formalize domain meaning; knowledge graphs operationalize connected claims. They can complement one another but are not interchangeable labels.

Implementation

How to achieve the K layer

1

Question

Write 5–10 competency questions and prove why joins, search, or the semantic layer alone are insufficient.

2

Identify

Define stable entity identifiers, source-of-truth rules, aliases, matching confidence, and correction workflow.

3

Model

Create the smallest ontology/schema that answers the questions; define time, provenance, cardinality, constraints, and authority.

4

Ingest

Map governed V data to claims; preserve source, transformation, version, time, confidence, and asserted/inferred status.

5

Protect

Authorize nodes, relationships, attributes, queries, exports, and sensitive inferences; prevent cross-boundary leakage.

6

Test

Validate constraints, golden queries, conflicts, stale facts, inference, migrations, performance, and unauthorized paths.

Evidence

  • Competency questions and approved ontology/schema versions
  • Identity-resolution rules, samples, precision/recall, correction log
  • SHACL/constraint and golden-query results
  • Claim provenance, time, authorization, conflict, and inference samples

Acceptance

  • Questions return correct, complete, source-linked answers.
  • Inferred facts are visibly distinct and reproducible.
  • Conflicts and expired claims do not silently become truth.
  • Unauthorized queries and sensitive inference are denied and logged.