Overview
This page provides an overview of the process for publishing Risk Data Library Standard (RDLS) metadata.
Before publishing RDLS metadata, you need to format, package and host your risk datasets so that they can be accessed. For more information, see How to publish risk datasets.
The process for publishing RDLS metadata can be divided into two phases:
Prepare your metadata
You should prepare your metadata in Java Script Object Notation (JSON) format, which is the main metadata format used in data catalogues.
The recommended approach to prepare RDLS metadata is to use the RDLS Metadata Editor, it offers input guidance, validation and export functionality.
It is strongly suggested that you do not author RDLS metadata in JSON format ‘by hand’ as doing so is time-consuming and error-prone. However, if you do choose this approach, you ought to use a text editor with support for JSON formatting and schema validation, such as Visual Studio Code.
If you are exporting existing metadata from a data catalog or database, and you have access to a software developer, the suggested approach is to develop a data pipeline to transform your metadata to RDLS format.
In either case, if your risk datasets use terms from EM-DAT or the UNDRR Hazard Information Profiles, use the mappings in the hazard_type and process_type codelists to identify the equivalent codes in RDLS.
If you are publishing an access-restricted resource, see how to publish an access-restricted resource.
Develop a data pipeline
A data pipeline is a series of processing steps that transforms your existing metadata to RDLS format in bulk. It might be as simple as a Python script for a one-off transformation of simple metadata, or a scheduled pipeline created using dlt and dbt for periodic transformation of complex metadata.
If you plan to develop a data pipeline to transform existing metadata to RDLS format, you first need to identify how your existing metadata ‘maps’ to RDLS - that is, identifying which data elements within your metadata match which RDLS fields and codes. You then need to create a data pipeline that implements your mapping in code. To get help with creating a data pipeline, contact the RDLS team.
You need to ensure that your data is structured and formatted correctly according to the RDLS schema. Once you have prepared your RDLS metadata in JSON format, the next step is to validate it against the RDLS schema. You can validate individual files using the Metadata Editor. Batch validation functionality is not yet available, but is on the development roadmap.
Publish your metadata
The steps involved in publishing your RDLS metadata will depend on the specific data catalog or website to which you are adding your risk datasets.
The Risk Data Library Catalog is the official RDL repository. Datasets can be published here directly from the Metadata Editor. Note that this functionality requires a GitHub account.
The World Bank Data Catalog enables World Bank users to load custom metadata for a dataset. For more information, refer to the internal guidance for World Bank users.
For any data catalog that does not natively support RDLS, attach your RDLS metadata as a custom metadata file.
