Defence Data Catalogue: You Cannot Exploit What You Cannot Find
Overview
The MOD generates and holds an extraordinary volume of data. Personnel records, logistics databases, intelligence holdings, engineering specifications, operational reports, training records, financial systems, sensor outputs — the list is vast and growing. The problem is not that the data does not exist. The problem is that nobody knows what data exists, where it is, what it contains, who owns it, how current it is, or how to get access to it. Thousands of datasets sit in organisational silos across Defence, invisible to the analysts, AI systems and decision-makers who could exploit them. The Defence Data Catalogue exists to solve that problem.
The Defence Data Catalogue is a metadata catalogue that enables discovery of Defence data assets. Users can search for datasets, understand their content through metadata descriptions, and request access through appropriate channels. The Catalogue is part of the Data Strategy for Defence implementation and addresses data silos by making assets discoverable across organisational boundaries. It integrates with the Defence Information Environment and emerging data platforms, with governance through the Defence Data Office ensuring quality and accessibility.
The Catalogue is operational and evolving, with coverage expanding across the Defence data estate. It integrates with emerging data platforms and services, providing the foundation upon which data-driven decision-making, analytics and AI exploitation depend. Without it, every data request begins with the same question: does this data even exist, and if so, where?
Strategic Purpose and Objectives
Making Defence Data Discoverable and Exploitable
Data exploitation requires knowing what data exists. The Catalogue enables the discovery that is essential for analytics, AI and decision support. Key capabilities include asset discovery and search, metadata management, access request workflows, data quality indicators, and lineage and provenance tracking. These functions transform the Defence data estate from an opaque collection of siloed datasets into a searchable, governed resource that users across Defence can discover and request access to.
The Catalogue is a foundation for data-driven Defence. The Defence AI Strategy depends on accessible training data — machine learning algorithms cannot be developed without curated, labelled datasets, and analysts cannot exploit data they cannot find. By cataloguing what data exists and describing its content, quality and access requirements, the Catalogue removes the first and most fundamental barrier to data exploitation: ignorance of what is available.
The governance dimension is equally important. The Defence Data Office ensures that data assets registered in the Catalogue meet quality standards, that metadata is accurate and current, and that access controls are appropriate. This governance prevents the Catalogue from becoming a dumping ground of poorly described, inaccessible data, ensuring instead that it serves as a reliable, curated directory of Defence data assets.
Budget and Financial Structure
Programme Value
The Data Catalogue is delivered as part of Defence Data Office investments. Technology costs are modest, with the emphasis on governance and metadata curation rather than infrastructure. The programme is integrated with broader data infrastructure investments across Defence Digital. The real cost is not the technology platform but the sustained effort required to register, describe and govern the data assets across Defence.
Budget Division and Holder
Defence Digital provides technology delivery. The Defence Data Office manages governance and curation. Data owners across Defence are responsible for registering their assets. The Chief Data Officer holds budget authority through the Defence Data Office.
Procurement and Acquisition
Acquisition Pipeline
The Data Catalogue is operational with continuous improvement. Coverage is expanding across the Defence data estate as more organisations register their data assets. Integration with emerging data platforms and services is ongoing.
Tender Information
Technology platforms are procured through standard commercial frameworks. The governance function is delivered internally through the Defence Data Office. Platform contracts are managed through standard procurement channels.
Why It Matters
The Defence Data Catalogue matters because data is the foundation of modern military capability, and you cannot exploit what you cannot find. Every ambition in the Defence AI Strategy, every use case in the Data Strategy for Defence, every requirement for data-driven decision-making begins with the same prerequisite: someone needs to find the right data. Without the Catalogue, that search involves phone calls, emails, personal contacts and luck. With it, it involves a search query and a metadata description that tells the user whether the dataset meets their needs before they invest time in obtaining access.
The programme’s significance grows as Defence becomes more data-dependent. As AI systems require training data, as analytics tools require input datasets, as decision-support systems require real-time information feeds, the demand for discoverable, described, accessible data increases exponentially. The Catalogue is the mechanism that matches this demand with the supply of data that already exists across Defence — turning a fragmented, invisible data estate into a discoverable, exploitable resource.
For industry, the Defence Data Catalogue creates opportunity in data management platforms, metadata tooling, data governance solutions, catalogue technology, search and discovery tools, data quality assessment and integration services. Companies with expertise in enterprise data cataloguing, metadata standards and data governance will find sustained demand as the MOD expands coverage across its data estate. The programme also creates demand for consultancy services to help Defence organisations register, describe and govern their data assets.

.png)
.png)
.png)

