190 likes | 211 Views
Learn how the Palestinian Central Bureau of Statistics is developing a data interoperability framework for efficient SDGs data dissemination and exchange, focusing on enhancing user access and understanding through interactive techniques and best practices.
E N D
2019 SDMX Global Conference 16 to 19 September 2019 Budapest, Hungary Palestinian Central Bureau of Statistics Data interoperability framework for SDGs data: dissemination and exchange HaithamZeidan Director of Dissemination and Documentation Department
Introduction • Statistics is very important for planning, development, research and analysis processes as it effectively contributes to decision-making in all sectors. • Official statistics needs to be presented in a clear and easily understandable manner by using interactive techniques, best practices and international standards (e.g., The Statistical Data and Metadata Exchange (SDMX)). • The disseminated information must be available in different forms via various media to enable easy access for users with minimum efforts. Even highly accessible published information cannot be fully useful if presented in a manner that is not easily understood • Users need to be provided with additional information (like metadata) to be able to understand and use information.
Background • The SDG information is often represented in variety of (usually incompatible) ways across different systems and organizations • Different organizations often handle data and metadata modeling differently • Prioritizing internal operational needs over data sharing • Having a specific applications in mind, instead of broader use and integration • There is no single “right” way of representing information • Some data structures are better suited for operational processes (e.g., capturing data from a survey or maintaining a civil registration database) • Others are being better suited for data sharing and dissemination (e.g., for the creation of data visualizations) • Challenge: Ensure all SDG data across PCBS are mapped to a common structure
SDG compilation process map Data modeling+ Validation SDMX and DSD
Main data modeling objective • Structure data in a way that enhances data interoperability, while flexibly accommodating the needs and priorities of PCBS • Have a common understanding of how to structure SDG data • Providing a data interoperability framework for SDG data (from collection to dissemination) • Foster efficiency, consistency and high-quality • Enable PCBS to more easily create user applications and data visualizations that directly interact with PCBS’s SDG database • https://indicators.ps/ focus on Data Visualization features, Dynamic Filtering, Export Feature and using a standard format by using long format to integrate it with SDGs common structure format
SDG multi-dimensional data model • PCBS and UNSD have jointly developed a multi-dimensional table “MDT” template for the compilation of statistical data from multiple sources • Highly reusable and conducive to data sharing • Focused on simplicity, so data is easily understood by a wide range of users and applications • Self-contained and stable over time • Based on the multi-dimensional SDMX information model • Incorporating standard definitions and classifications • Extensible to include PCBS-specific needs • Data platform and technology-independent • Already tested in by various PCBS teams and implemented in SDG databasebeing currently developed by PCBS in collaboration with Istat • Next Phase: expand to other statistical programmes within PCBS Like Census 2017 indicator in indicators.ps interactive website
Develop and maintain Multidimensional Data Creator application • Still working to Implement SDG indicator framework for PCBS on central database • The Application and Central database will have these features: • Validate individual sub-indicator templates • Collect and upload data templates • Conduct cross-checks and data integrity validations • Make data available in various formats (including CSV, PX and SDMX) for publication on various PCBS data dissemination platforms and applications • Data Visualization, dynamic filtering and export features
PCBS Platforms and Tools • PX-Web 2019 for data dissemination • Support databases • Data Visualization features • Improvements in output formats (HTML5 table, CSV, Relational table, Json, JSON-stat) • Dynamic Filtering • Export feature to different formats • ArcGIS online data dissemination (http://sdg-pcbs.opendata.arcgis.com) • ArcGIS Hub would be deployed together with ArcGIS Portal • Explore our data for SDGs data, access spatial indicators relating to a particular Goal • Maps and API Explore • Explore the SDGs via Story Maps
PCBS Platforms and Tools • https://indicators.ps • Indicators in particular highlights the results of Census 2017 • Indicators aligns with efforts to expand interactive, technology-driven data dissemination and visualization while advancing data accessibility • Using Long Format • Using Data Visualization techniques ( Line chart, Bar chart, Bubble chart) • Dynamic filtering from database with structured format, export feature • Will be integrated with SDGs platform and with other existing systems within PCBS
PCBS Platforms and Tools • Framework for metadata and microdatadocumentation that introduced in Palestinian central bureau of statistics (PCBS) for better documenting, preserving, anonymizing and disseminating of existing microdata, • This framework produced based on international standards: Data Documentation Initiative (DDI) and the Dublin Core Metadata Initiative (DCMI). • DDI provides the ability to describe a rich set of metadata in an XMLformat, with an emphasis on micro-data, but also allowing for tabular formats and multidimensional cubes.
PCBS Platforms and Tools • In the 3.0 version, DDI supports all phases of the lifecycle from a description of concepts and the survey instrument used to collect data to the end product held in a data archive and used for analysis. • DDI 3.0 also provides an XML format for micro-data and tabular/multi-dimensional data, but very often the data is held in text or statistical software specific binary files. The user-configurable aspects of DDI ("variables") are mixed with specific metadata fields.
Future Work • Ensure all SDG data across PCBS are mapped to a common structure • Expand to other statistical programmes within PCBS • Integrate indicators.ps with SDGs platform and with other existing systems within PCBS • DDI/SDMX Overlap: These two standards are well aligned means that they can be combined in powerful ways, and that users of the two standards can move data from one standard format to the other fairly easily.