
The SDMX Metadata AI Assistant (MAIA) is a Python-based tool powered by OpenAI GPT, specifically designed to address the challenges of managing and editing statistical metadata. Leveraging AI, SDMX MAIA simplifies metadata workflows by automating key tasks such as checking syntax, consistency checks and formatting adjustments.
The SDMX Metadata AI Assistant (MAIA) is a Python-based tool powered by OpenAI GPT, specifically designed to address the challenges of managing and editing statistical metadata. Leveraging AI, SDMX MAIA simplifies metadata workflows by automating key tasks such as checking syntax, consistency checks and formatting adjustments. MAIA provides an efficient Streamlit app interface to edit, validate, and format metadata automatically, simplifying metadata management within statistical production environments. Fully compliant with the Statistical Data and Metadata eXchange (SDMX) standard, MAIA ensures seamless integration into statistical pipelines, supporting the efficient exchange and harmonisation of metadata.
The SDMX Metadata AI Assistant (MAIA)’s primary use cases include enhancing the quality and readability of metadata. Users can upload their data through SDMX-ML files along with their Data Structure Definitions (DSDs), select attributes for editing and allow MAIA to automatically validate and refine the content. By offering features such as customisable AI assistants, interactive before/after comparisons and dynamic reporting in SDMX-ML format, MAIA empowers statisticians to maintain high-quality metadata while saving time and resources. Whether addressing metadata consistency issues, aligning metadata with statistical frameworks or preparing data for dissemination, MAIA is a robust and user-friendly solution for modern statistical pipelines.
MAIA operates directly on SDMX-ML 2.1 metadata and Data Structure Definition (DSD) files. It uses the structural information in the DSD to ensure attribute alignment and proper parsing.
Through the pysdmx library, MAIA can read, modify, and output SDMX files while maintaining schema validity. This tight integration makes it ideal for use in national and international statistical systems that adopt SDMX standards for metadata exchange.
At its core, MAIA employs OpenAI’s GPT models to analyze and refine SDMX metadata. The engine applies predefined and customizable prompts to:
Each execution generates both before/after metadata views and a performance log capturing timestamps, assistant model, and processing duration.
SDMX MAIA is designed with the following features:
📝 Metadata editing
🤖 Custom AI assistant
🔗 End-to-end SDMX integration
📊 Transparency
🔍 Before/after interactive comparison
🖥️ Streamlit UI
📊 Run status and metrics
This application is fully developed in Python and is built upon four key dependencies:
Using SDMX MAIA involves the following workflow:


Access to MAIA source code and installation instructions is available upon request by sending an email to contact.sdmx.io@bis.org .
An installable package will be made available in the near future.
Users must ensure that no confidential or restricted information is uploaded. Files are processed through the OpenAI API, and institutional data-handling policies apply.
This project is licensed under the Apache 2.0 License.