Skip to main content
Back to Research

Foundational Research: Ontology-Native AI

An open language model for IES4 data

To our knowledge the first openly published language model fine-tuned for the UK Information Exchange Standard (IES), the 4D ontology behind defence and security data. It turns a plain-English scenario into IES-conformant RDF, and it is released as a research prototype with its dataset and evaluation harness so the field can check the work.

Term conformance

0 → 88.6%

vs the untuned base model

Hallucinated terms

0.937 → 0.010

near-eliminated

Released

Model + data

and full eval harness

The Challenge

The Information Exchange Standard (IES) is the shared 4D vocabulary for UK defence and security data, but authoring IES-conformant RDF by hand is slow and needs scarce ontology expertise. General-purpose models do not fill that gap. Asked to write IES Turtle, even a strong 30B code model invents terms that do not exist in the ontology roughly 94% of the time. In a standards context that is worse than no help at all: invalid data that looks plausible is harder to catch than data that fails outright.

What we built

The core idea is to never trust the model to invent structure. Instead we generate the structure correctly, then teach the model to reproduce it. Every training graph was built programmatically with the telicent-ies-tool and validated twice before training.

ComponentWhat it is
Correct-by-construction data1,589 IES graphs built programmatically with telicent-ies-tool across 14 scenario patterns, paired with natural-language descriptions.
Double validationEvery graph passed both the telicent validation and an independent term-membership check built from the published dstl/IES4 ontology.
Vocabulary and boundariesClass and property definitions from the ontology, plus refusal examples teaching what IES4 cannot express.
Generalisation sliceOntology-conditioned extraction pairs from Text2KGBench, so the model conforms to any ontology it is given, not only IES.
Honest evaluationHeld-out test split by target graph, plus an out-of-distribution set from the real published IES sample data.
Open releaseModel, LoRA adapter, full dataset and evaluation harness, all published for independent reproduction.

The results

Measured on held-out prompts against the untuned base model. The point of the fine-tune is not fluency, which the base model already has, but conformance: using real IES terms, in structurally valid ways.

MetricBase modelFine-tunedMeaning
IES4 term conformance0%88.6%terms used that exist in the ontology
Hallucinated-term rate0.9370.010fraction of ies: terms that are invented
Syntactic validity93.2%95.5%output parses as valid Turtle
Structural conformancen/a0.955subject and object types satisfy domain and range
Namespace fidelityn/a100%uses the namespace the caller specifies
Ontology-conditioned extraction75%91.7%conforms to a supplied non-IES ontology

Out-of-distribution performance, on rich scenarios taken from the real published IES sample data rather than our training patterns, is markedly lower (term conformance around 30%). We report that openly in the model card rather than hide it: it is the honest edge of what a small on-device fine-tune achieves today, and the reason the model is released as a prototype to draft from, not an oracle to trust blind.

Where it fits

The right architecture for standards work is neuro-symbolic: the model drafts, and a symbolic layer verifies. This model pairs directly with our IES-to-HQDM crosswalk and the open-ontologies validation engine, which check conformance deterministically. The language model removes the blank-page problem; the validators guarantee correctness. Neither is asked to do the other's job.

"A general model asked to write a niche standard will confidently make it up. The fix is not a bigger model, it is correct-by-construction data and a validator in the loop. Publish both, with the weak spots named, and the field gets a tool it can actually build on."

Fabio Rovai, Tesseract Academy

This is an independent, self-initiated demonstration built entirely on published open standards and open tools. It is not affiliated with, or endorsed by, the Ministry of Defence, the IES Working Group, Telicent or Dstl. IES4 is Crown Copyright, MIT-licensed; the model weights are MIT; the generalisation data derived from Text2KGBench is attributed under CC-BY-SA-4.0 in the model card.

Model, dataset and evaluation harness

Weights and LoRA adapter, the full training data, and the code that produced every number above.