21

Algorithms for analyzing, generating and validating metadata in the JSON data format.

 

Main functions
  • Validation of catalog and dataset metadata.
  • Generation of reports on the content and validity of catalog and dataset metadata.
  • Conversion of metadata files to standard format (JSON).
  • Generation of monitoring indicators for catalogs and their datasets.
Description

Python algorithms that validate metadata from data catalogs of the portals linked to Argentina’s national open data portal.


 
This tool was developed by the Argentine Ministry of Modernization’s National Directorate of Public Data and Information, which currently uses it to decentralize and combine data from other government entities and publish it on the country’s main open data website (http://datos.gob.ar).


 
Upon integrating the National Public Administration’s data portals into a network node and combining the most relevant catalogs in a central portal, the challenge of translating various preexisting formats into a common metadata profile arose.


 
Therefore they expanded on the EU’s profile (dcat-ap v1.1) and generated the "pydatajson” library, which automates frequent metadata manipulation tasks.


 
Governments in other countries can adapt it for similar open data frameworks in their countries.

You can see the source code here:

https://github.com/datosgobar/pydatajson

Language
Python
OPEN CODE
Knowledge areas
Science and Technology
Regional Integration
Reform / modernization of the state