
Features
For Data Providers
-
Centralised request tracking – Gain full visibility and oversight of dataset requests for better governance and compliance.
-
Zero-copy data sharing – Share datasets efficiently without duplicating or moving data.
For Data Consumers
-
WOG-wide data discovery – Search and access over 250 datasets across Individual, Business and Sensor domains, with more being added.
-
Multiple access methods – Choose how to work with data based on your needs:
-
Direct download of flat files.
-
Databricks integration – Analyse approved datasets and personal data within a secure analytics workbench (up to Confidential (Cloud-Eligible)/Sensitive Normal).
-
APIs – Programmatically retrieve, curate, and download datasets.
-
System connections – Connect directly to DataHive’s API and data warehouse for streamlined integration.
-
-
Pseudonymisation support – Work securely with de-identified data for privacy-preserving analysis.
-
Third-party data sharing – Facilitate onward data sharing with non-government entities by appointing GovTech as the implementing agency.
Roadmap
- Enhance security features to support system reclassification to Confidential (Cloud-Eligible) / Sensitive (High) levels.
- Introduce a low-latency Dataset API Data Product, prioritising high-usage, chargeable datasets for faster access.
- Enable integration with SaaS-based connectors (e.g. Power BI, Tableau, Qlik) to support direct access to data via familiar analytics tools.
- Increase dataset availability by simplifying the process for Data Providing Agencies (DPAs) to share their data catalogues on DataHive.
Techstack
React, Typescript, Python, Snowflake, TileServer GL, Azure Databricks / CosmosDB / OpenAI, Terraform, Azure
Last updated 22 May 2025
Thanks for letting us know that this page is useful for you!
If you've got a moment, please tell us what we did right so that we can do more of it.
Did this page help you? - No
Thanks for letting us know that this page still needs work to be done.
If you've got a moment, please tell us how we can make this page better.

A Data Platform That Enables Faster, More Efficient Inter-Agency Data Sharing.