A Maturity Model for Urban Dataset Metadata

Authors

  • Mark S Fox, Professor University of Toronto
  • Bart Gajderowicz, Dr University of Toronto
  • Dishu Lyu, Mr University of Toronto

DOI:

https://doi.org/10.2218/ijdc.v19i1.906

Abstract

The rapid increase in published datasets has intensified challenges in sourcing and integrating relevant data for analysis. Persistent obstacles include poor metadata, ineffective presentation, and difficulties in locating and integrating datasets. This paper delves into the intricacies of dataset retrieval, emphasising the pivotal role of metadata in aligning datasets with user queries. Through an exploration of existing literature, it highlights prevailing issues, such as identifying valuable metadata and developing tools to maintain and annotate them effectively. The paper proposes a dataset metadata maturity model, inspired by software engineering frameworks, to guide dataset creators from basic to advanced documentation. The model encompasses seven pivotal dimensions, spanning content to quality information, each stratified across five maturity levels to guide the optimal documentation of datasets, ensuring ease of discovery, accurate relevance assessment, and comprehensive understanding of datasets. This paper also incorporates the maturity model into a data cataloguing tool called CKAN through a custom plugin, CKANext-udc. The plugin introduces custom fields based on different maturity levels, allows for user interface customisation, and integrates with a graph database, converting catalogue data into a knowledge graph based on the Maturity Model ontology.

Downloads

Download data is not yet available.

Author Biographies

  • Mark S Fox, Professor, University of Toronto

    Professor Mark S. Fox, is a Distinguished Professor of Urban Systems Engineering at the University of Toronto, Professor of Industrial Engineering and Computer Science, and Director of the Centre for Social Service Engineering (CSSE). He is also a Fellow of the Association for Artificial Intelligence and Institute for Electrical and Electronic Engineering. His expertise in Artificial Intelligence is in Knowledge Representation and Reasoning, with a focus on Ontologies and Constraint-Directed Reasoning. Of relevance to this project, he was a member of the original Hearsay-II team that developed the blackboard model (Hayes-Roth, Mostow & Fox, 1978) and led the development of the Common Approach project’s Common Impact Data Standard (Fox & Ruff, 2022) (CIDS) which has been adopted by Employment and Social Development Canada for reporting impact by the agencies they fund. He also led the development of the Compass Project Ontology. Dr. Fox is also the editor of the ISO/IEC 5087 city data model series of standards.

  • Bart Gajderowicz, Dr, University of Toronto

    Dr. Bart Gajderowicz is the Executive Director and a Research Associate at the Urban Data Centre. His research goals focus on developing tools and methods for data-driven policymaking in the social service domain. He manages the Urban Data Repository and Catalogue project (CUDR), is a co-author of the Common Impact Data Standard (CIDS), is the lead researcher on SMILE, an explainable AI language model for measuring impact, and the development of tools for data translation, consolidation, and analysis. In 2019, he completed his Ph.D. in industrial engineering from the University of Toronto, creating a high-fidelity simulation environment (BRAMA) for emotion-based reasoning of social services clients and an ontology of social service needs (OSSN).

Published

2025-09-16

Issue

Section

Research Papers