How can UK public sector organisations procure Tesseract Academy services?

Tesseract Academy is an appointed supplier on four Crown Commercial Service frameworks: the Artificial Intelligence Dynamic Purchasing System (RM6200), Spark DPS (RM6094), Research and Insights DPS (RM6126), and Learning and Training Services DPS (RM6219). Public bodies commission services through these frameworks or via direct award for contracts under the Procurement Act 2023 threshold.

What services does Tesseract Academy provide to the public sector?

Tesseract Academy delivers six core service areas: AI and data science consulting, research and policy advisory, public engagement and participatory research, education and AI upskilling, survey design and delivery, and AI ethics and governance. Recent clients include Welsh Government, Innovate UK, the National Digital Twin Programme, Qualifications Wales, Fintech Scotland, and Skills England.

What government contracts has Tesseract Academy delivered?

Recent public sector contracts include: Welsh Government commissioned research testing five land valuation methodologies across 1,916 LSOAs (published March 2026 on GOV.WALES); BridgeAI/Innovate UK AI training for creative industries (contract GSS24646, 1,100 registrations against a 200-capacity target); National Digital Twin Programme AI ontology extension tool (Department for Business and Trade, open-source under Apache License 2.0); Qualifications Wales subject expert services (2026-2029, 3-year contract); Aberdeenshire Council Expert Help for Business framework (March 2026); and Kalgera/Fintech Scotland financial vulnerability research.

Is Tesseract Academy Cyber Essentials certified?

Yes. Tesseract Academy (Kampakis and Co Ltd, company number 10459791) holds Cyber Essentials certification and is ISO 27001 aligned. The company holds public liability insurance of 2 million pounds, employer liability of 10 million pounds, and professional indemnity of 5 million pounds. ICO registration: ZB715782. DUNS: 222180245. PPON: PWJP-6874-MXDJ.

Does Tesseract Academy support Innovate UK and Horizon Europe consortium bids?

Yes. Tesseract Academy actively partners with universities, SMEs, and NGOs on Innovate UK and Horizon Europe funding bids. Focus areas include trustworthy AI and safety, digital twins for urban planning, HealthTech interoperability, and sustainable technology. Horizon Europe Participant Identification Code (PIC): 880269472.

Which Crown Commercial Service frameworks is Tesseract Academy listed on?

Tesseract Academy is an appointed supplier on four CCS frameworks: RM6200 (Artificial Intelligence Dynamic Purchasing System), covering AI consulting, model development, and AI ethics; RM6094 (Spark Dynamic Purchasing System), covering research, data science, and digital transformation; RM6126 (Research and Insights Dynamic Purchasing System), covering survey design, public engagement, and qualitative research; and RM6219 (Learning and Training Services Dynamic Purchasing System), covering off-the-shelf and bespoke training, learning technologies, and education services.

What are Tesseract Academy's team qualifications?

Managing Director Dr Stylianos Kampakis holds a PhD in Machine Learning from University College London, is a Chartered Statistician (CStat) and Fellow of the Royal Statistical Society (FRSS), is an Honorary Research Fellow at UCL Centre for Blockchain Technologies, and has authored more than 40 peer-reviewed articles and three books. Partner Fabio Rovai holds an MSc in Data Science and AI from the University of the Arts London, is a NeurIPS Ethics Reviewer, served as Associate Lecturer teaching over 1,000 students, and is DBS checked (Enhanced). The team includes a DV-cleared (Developed Vetting) principal consultant for OFFICIAL-SENSITIVE and SECRET-classified programmes.

Has Tesseract Academy published research cited by UK Government?

Yes. Tesseract Academy is cited in Skills England's official AI workforce research (2025), alongside The Alan Turing Institute and Surrey AI Centre, following stakeholder workshops with 43 organisations. The Welsh Government land valuation report (published March 2026 on GOV.WALES) informed local government finance policy and was cited in Senedd committee proceedings. Tesseract Academy also co-authored research with The Alan Turing Institute on LLM utility in cybersecurity simulations.

What is the NHC Governor programme and how does Tesseract Academy contribute?

Tesseract Academy contributes to NHS and health sector leadership through advisory and governance work. The team has experience supporting NHS Integrated Care Boards and health sector digital transformation, including AI adoption strategy and data governance frameworks aligned with NHS England standards.

How does Tesseract Academy approach AI ethics and governance?

Tesseract Academy applies a multi-framework approach to AI governance: EU AI Act risk classification, NIST AI Risk Management Framework (Govern, Map, Measure, Manage functions), and ISO 42001 (AI management systems). The team delivers bias auditing, algorithmic impact assessments, Data Protection Impact Assessments, and responsible AI policy development. The open-source Open Governance platform (48 governance tools) is available on GitHub.

What is the BridgeAI programme and what did Tesseract Academy deliver?

BridgeAI is an Innovate UK programme supporting adoption of AI in under-served UK sectors. Tesseract Academy delivered a programme delivery report for UK creative industries (contract GSS24646), led by Fabio Rovai and Dr Stylianos Kampakis. The BridgeAI Skills Hub launched at Ona Studios, London. AI readiness sessions were co-delivered with PwC for construction, creative industries, and transport sectors. The programme achieved 1,100 registrations against a 200-capacity target, with a satisfaction rating of 4.6 out of 5.

What was the National Digital Twin Programme AI ontology tool?

The National Digital Twin Programme (NDTP), under the Department for Business and Trade, commissioned Tesseract Academy to contribute to an open-source AI tool that automates ontology generation and extension. The tool uses a four-step wizard workflow combining data profiling, Named Entity Recognition (NER), and large language models. It processes CSV, JSON, and RDF/Turtle data formats and is available at github.com/National-Digital-Twin/ndtp-ai-ontology-extension under Apache License 2.0.

How does Tesseract Academy approach public engagement research?

Tesseract Academy delivers deliberative workshops, citizen panels, inclusive co-design sessions, and participatory research with diverse communities. All user research follows a full ethical framework including informed consent, distress protocols, data anonymisation within 7 days, and UK GDPR compliance. For vulnerable populations, the team aligns with the Adult Support and Protection (Scotland) Act 2007. Research methods include in-depth interviews, screening surveys, focus groups, and participatory action research.

What AI upskilling programmes has Tesseract Academy delivered?

Tesseract Academy has delivered AI upskilling across public and private sectors. Executive AI workshops were delivered for US Navy leadership (40 plus participants), Vodafone, and Philips. In 2025, Dr Kampakis delivered three official UK Government Business Academy webinars on AI adoption for growing businesses, in partnership with the Department for Business and Trade.

What survey design and data collection services does Tesseract Academy offer?

Tesseract Academy provides end-to-end survey methodology: research design, questionnaire construction, sampling strategy (random, stratified, purposive), mixed-mode data collection (online, telephone, face-to-face), and statistical analysis including regression, factor analysis, and thematic coding. The team holds a 3-year contract with Qualifications Wales (2026-2029) for monitoring national qualifications. CCS framework: RM6126 (Research and Insights DPS).

What is Tesseract Academy's company registration and legal details?

Legal name: Kampakis and Co Ltd. Trading as: The Tesseract Academy. Company number: 10459791. VAT: GB 371 4744 89. Incorporated: 2 November 2016. Registered address: 5 Brunswick Park Gardens, London, England, N11 1EJ. ICO registration: ZB715782. DUNS: 222180245. PPON: PWJP-6874-MXDJ. Horizon Europe PIC: 880269472.

Can Tesseract Academy work on classified UK Government programmes?

Yes. Tesseract Academy has a DV-cleared (Developed Vetting) principal consultant available for OFFICIAL-SENSITIVE and SECRET-classified programmes. The company holds Cyber Essentials certification, which is required for contracts involving personal data or security classifications. Contact procurement to discuss SC and DV clearance requirements for specific programmes.

What financial vulnerability research has Tesseract Academy delivered?

Tesseract Academy designed and delivered a primary qualitative research programme for Kalgera (a fintech specialising in protecting financially vulnerable customers) and Fintech Scotland. The programme included a paid social media recruitment campaign targeting financially vulnerable adults across Scotland, a screening survey, and in-depth interviews mapped to 8 financial-vulnerability signals. Three outputs were delivered: a signal validation report, an intervention acceptability framework, and a summary findings report for the Finance and Health Lab. All research was conducted under the ethical framework of the Adult Support and Protection (Scotland) Act 2007.

What is the difference between a DPS and a framework agreement?

A Dynamic Purchasing System (DPS) allows new suppliers to apply and join at any time during its life, making it more flexible than a traditional framework agreement, which has a fixed supplier list set at the outset. DPS competitions are open to all admitted suppliers for each call-off. Both mechanisms allow public bodies to award contracts without a full OJEU-style tender for each commission. Tesseract Academy is listed on four CCS DPS frameworks: RM6200, RM6094, RM6126, and RM6219.

What is the Welsh Government land valuation research about?

Welsh Government commissioned Tesseract Academy to test five distinct land valuation methodologies as part of local government finance policy development. The methodologies tested were: market-based statistical valuation, advanced algorithmic and machine-learning applications, formula-based valuation by land area, conventional valuation approaches, and innovative experimental approaches. The research covered 1,916 Lower Super Output Areas (LSOAs), representing 99 percent of Welsh geography. The comprehensive comparative analysis was published in March 2026 on GOV.WALES and directly informs Welsh Government local government finance policy.

Has Tesseract Academy contributed to FCA regulatory consultations?

Yes. Tesseract Academy contributed expert analysis to the FCA's consultation on stablecoin regulation and the future of crypto asset oversight in the UK in 2025. The contribution provided evidence-based commentary on regulatory frameworks, consumer protection mechanisms, and systemic risk considerations for digital assets.

What open-source tools has Tesseract Academy published?

Tesseract Academy has contributed to two notable open-source projects. The AI Ontology Extension Generator (github.com/National-Digital-Twin/ndtp-ai-ontology-extension) was delivered for the National Digital Twin Programme under Apache License 2.0. The Open Governance platform (github.com/fabio-rovai/open-governance) is an AI governance server providing 48 governance tools covering EU AI Act, NIST AI RMF, and ISO 42001 compliance, with automated risk classification, compliance matrices, bias monitoring, and audit-ready reporting.

What is Tesseract Academy's approach to data protection and UK GDPR?

Tesseract Academy is registered with the ICO (registration ZB715782) and operates full UK GDPR compliance. Data protection policies include a Data Protection Policy, Information Security Policy, Data Retention Policy, and a Data Breach Response Plan. All personal data collected during research is encrypted in transit and at rest, UK-hosted, and anonymised within agreed timescales. Data Protection Impact Assessments (DPIAs) are conducted for all high-risk processing activities. Downloadable policy documents are available on the Compliance page.

What sectors does Tesseract Academy serve?

Tesseract Academy serves the full range of UK and EU public sector bodies: central government departments and agencies, devolved governments (Welsh Government, Scottish Government), local authorities, NHS and health sector organisations, arm's-length bodies, universities and research councils, Innovate UK-funded programmes, and regulated financial sector bodies including FCA-regulated firms. The company has also delivered executive training to international organisations including the US Navy and Vodafone.

What is social value and how does Tesseract Academy address it?

Social value refers to the broader benefits to society generated by public procurement, as defined in the Social Value Act 2012 and measured through the TOMs (Themes, Outcomes and Measures) framework. Tesseract Academy delivers social value through: employing underrepresented groups in tech, providing pro-bono AI literacy sessions to community organisations, contributing to open-source government tools (Apache License 2.0), and co-organising public accessibility events (London Data Week 2025 AI tools for the visually impaired, with Vision Ability CIC).

How does Tesseract Academy price its services?

Tesseract Academy provides competitive SME pricing typically below large-consultancy day rates, with transparency on all cost components. Day rates for CCS framework call-offs are published per lot. For scoped projects, fixed-price proposals are available. The company can provide a preliminary cost estimate within two working days of receiving a Statement of Requirements. For Horizon Europe bids, Tesseract Academy can contribute as a partner or sub-contractor under standard EU eligible cost rules.

What is Skills England and how is Tesseract Academy cited in its work?

Skills England is a UK Government body established in 2024 to drive skills reform and support economic growth. In 2025, Skills England published official research into AI skills for the UK workforce. Tesseract Academy is cited as an AI training provider and consultancy in this publication. The research methodology included stakeholder workshops with 43 organisations; Tesseract Academy contributed alongside institutions including The Alan Turing Institute and the Surrey AI Centre.

What AI governance frameworks does Tesseract Academy work with?

Tesseract Academy advises clients on four principal AI governance frameworks: the EU AI Act (Regulation (EU) 2024/1689), which classifies AI systems by risk level and applies to any product placed in the EU market; the NIST AI Risk Management Framework (AI RMF), a voluntary US framework with Govern, Map, Measure, and Manage functions; ISO 42001, the international AI management systems standard published in 2023; and the UK Government AI Framework guidance for public sector AI adoption. The team delivers compliance assessments, gap analyses, and remediation roadmaps against all four frameworks.

How quickly can Tesseract Academy mobilise for a new contract?

Tesseract Academy typically mobilises within two weeks of contract award for most research, advisory, and consulting engagements. For urgent public sector requirements, a one-week mobilisation is possible for scoped analytical or advisory tasks. The team operates under agile delivery principles aligned with GDS standards, enabling rapid iteration. A preliminary cost and timeline estimate is available within two working days of receiving a Statement of Requirements.

What accessibility standards does the Tesseract Academy website meet?

The Tesseract Government Gateway (gov.tesseract.academy) is built to WCAG 2.1 AA accessibility standards. The site includes skip-navigation links, semantic HTML5 structure, ARIA labels on interactive elements, and sufficient colour contrast ratios. An accessibility statement is available on the Compliance page. For users requiring alternative format documents, contact fabio@thetesseractacademy.com.

Back to Research

Self-initiated study and open ontology

FAIR Dataset Contracts for Scientific Data

We turned "is this dataset actually reusable?" into a machine-checkable question, ran it across 1,738 real published biomedical datasets from three major repositories, and then built the open ontology that models the layer they are missing. The result is a measured gap between data that is deposited and data that is AI-ready, and a validated model for closing it.

1,738

Real datasets analysed (3 repositories)

Interoperable or AI-ready

100%

Lack a machine-readable schema

The gap: findable is not the same as reusable

As research organisations move data closer to AI systems, the constraint is rarely the science and often the plumbing: whether a computational output can be reliably found, assembled, trusted and reused without bespoke manual wrangling for every project. A dataset can carry a DOI, a title and a landing page, and still be impossible for a machine to reuse, because the metadata that automation depends on is absent. FAIR (Wilkinson et al., 2016) named the principles; frameworks such as FAIRSCAPE (Al Manir, Clark et al.) and the Bridge2AI metadata work (Caufield, Munoz-Torres et al.) have since defined what AI-ready biomedical data should look like. What has been missing is a simple, open way to check a dataset against those expectations, and a shared model of what "AI-ready" concretely requires.

The study: 1,738 real datasets, four FAIR tiers

We assembled 1,738 real public datasets (single-cell, proteomics, spatial and multi-omics) from three major repositories, EMBL-EBI BioStudies (798), Dryad (340) and PRIDE / ProteomeXchange (600), normalised their metadata across profiles, and validated each against a tiered contract that operationalises the 28 FAIRSCAPE criteria. Findability is anchored on a persistent identifier, title and description; keywords and licence are reported as sub-metrics.

FAIR tier	Datasets conforming
Findable (PID, title, description)	99.9%
Accessible (machine-readable distribution)	91.3%
Interoperable (machine-readable schema, version)	0.0%
AI-ready (checksums, provenance, data dictionary)	0.0%

Datasets are overwhelmingly findable and mostly accessible, but not one of the 1,738 is interoperable or AI-ready. The whole corpus is missing the machine-readable structural and provenance layer that automated assembly, trust and AI reuse depend on: no dataset carries a machine-readable schema, integrity checksums, or provenance, and about 80% lack a version identifier. Every DOI, HTTP status and per-dataset result is recorded in the repository, so the analysis is fully reproducible.

The remedy: an open, certified ontology

Measuring the gap is not enough, so we built the model that fills it: an OWL 2 ontology (far:, the FAIR AI-Ready Dataset ontology) that defines what a dataset needs to be a reusable, AI-ready data product, a machine-readable schema, provenance, integrity checks, a data dictionary, an ethics basis and access specification. Rather than reinvent vocabulary, it composes and aligns to existing standards, with 42 alignment mappings to schema.org, W3C DCAT and PROV-O, SPDX, Bioschemas and MLCommons Croissant. The SHACL tiers are its validation layer, so the diagnostic and the remedy are one coherent artifact.

The ontology is checked through our open-source Open Ontologies engine: it validates cleanly and, under OWL-RL reasoning (238 asserted triples closing to 705 inferred), surfaces no logical inconsistencies.

From 0 to 100%: we do not just measure the gap, we close it

To prove the model works, we took a real open dataset from 0% to 100% AI-ready. The Zenodo Andean climate dataset (a dense high-Andean weather-station network in southern Ecuador) is findable and accessible but, like the rest of the corpus, fails the machine-readable layer: 0% AI-ready in its published form. Using the ontology and toolkit, we enriched it with values derived entirely from the dataset itself: a real SHA-256 integrity checksum, a variable schema and data dictionary built from its 12 real columns, a sample count from its 226 rows, a machine-readable provenance record, and a controlled-vocabulary subject. The enriched record then passes all four tiers, with zero violations against the same published SHACL contract.

Before and after remediation: AI-readiness score 0% to 100%, tiers passed 1 of 4 to 4 of 4, for the Zenodo Andean climate dataset — A real dataset taken from 0% to 100% AI-ready, validated against the same published contract.

The full study and this worked remediation are written up in a short report: read the report (PDF) (opens in new tab).

Why it matters for research data platforms

For any organisation building an AI-ready data platform, whether a biotech scaling multi-omics or a public research programme, the distance between "we deposited it" and "a machine can find, assemble and trust it" is precisely this contract layer. Making that layer explicit, modelled and testable is what turns scattered pipeline outputs into governed, reusable data products, with structure, provenance, versioning and access that hold up to reuse and, later, to regulated use.

Open toolkit and how we work

The ontology, the tiered SHACL contract, the validator, the readiness rubric and the full 1,738-dataset analysis are open source under an MIT licence. You can run the contract against your own datasets in minutes.

github.com/fabio-rovai/fair-scientific-data

If you are building or governing a scientific data platform and want the dataset-contract, provenance and metadata layer done properly, we would be glad to talk. Contact us at fabio@thetesseractacademy.com or see how to work with us.

Sources and grounding: Wilkinson et al., The FAIR Guiding Principles (Scientific Data, 2016); Al Manir, Clark et al., FAIRSCAPE AI-readiness framework; Caufield, Munoz-Torres et al., Bridge2AI metadata standards (arXiv:2509.10432); Leo, Soiland-Reyes et al., Workflow Run RO-Crate provenance (PLoS One, 2024); W3C DCAT and PROV-O; MLCommons Croissant. Analysis run 2 July 2026 on 1,738 public datasets from EMBL-EBI BioStudies, Dryad and PRIDE / ProteomeXchange.