290 likes | 314 Views
The Importance of Databases in the Dissemination Process – The UNECE Approach. UNECE Training Workshop on Dissemination of MDG Indicators and Statistical Information Astana, Kazakhstan 23 – 25 November 2009 Steven Vale, UNECE. Contents. UNECE system overview Introduction to data cubes
E N D
The Importance of Databasesin the Dissemination Process– The UNECE Approach UNECE Training Workshop on Dissemination ofMDG Indicators and Statistical Information Astana, Kazakhstan 23 – 25 November 2009 Steven Vale, UNECE
Contents UNECE system overview Introduction to data cubes Input systems Data processing Dissemination systems 04 January 2020
What is a Data Cube? A multi-dimensional structure containing data points that represent unique combinations of several classifications A flexible way of storing and disseminating data 04 January 2020
Two-dimensional Cube 04 January 2020
Three-dimensional Cube 04 January 2020
More dimensions are possible,but not easy to display! 04 January 2020
Why Data Cubes are Important Many statistical data management models and systems are based on cubes Users can select just those data that are of interest Cubes can easily be expanded, e.g. for extra years, countries, or other categories At least in theory, cubes can have an infinite number of dimensions 04 January 2020
Input Systems • Functionality needed: • Bulk input of large data files • Automatic data collection routines • Data format conversion • Metadata capture and “translation” • Manual entry of data values • Link to electronic questionnaires • Data validation 04 January 2020
UNECE Approach • Automatic data collection each night from some important sources • File transfers in standard formats for other bulk updates • Questionnaires for some types of data • Automatic updates under development • Manual input / editing interface 04 January 2020
Data Processing • Functionality needed: • Data validation • Imputation of missing values • Calculation of derived variables • Calculation of regional aggregates, e.g. for CIS countries • Definition of data outputs 04 January 2020
UNECE Approach Create a “super cube” containing all data Use applications developed ourselves for validation, imputation and calculation High level programming language allows statisticians to develop and manage their own calculation routines Smaller output cubes are defined using metadata, and updated every night 04 January 2020
Dissemination Systems • Functionality needed: • Internet enabled • Easy access to key data • User-friendly interface • Multiple languages • Possibility to manipulate and download data 04 January 2020
Why UNECE adopted PC-Axis Lack of resources for system development PC-Axis advantages: Rich in features User-friendly Flexible structure Strong support network of users – over 40 other statistical organizations 04 January 2020
Europe Licenses (68) Basque (5) Croatia Denmark (9) Estonia Faroe Islands Finland (15) Åland Greece Greenland Iceland Ireland (2) Latvia Lithuania Macedonia F.Y.R. Norway Slovakia Slovenia (2) Spain (3) Ukraine, Lviv UNECE Sweden (18) PC-Axis Around the World Americas Licenses (3) Brasil Bolivia Guatemala Prospects Canada Guyana ArgentinaEl Salvador Costa Rica IMF Bahamas UNSD US Dep.Agric. Ecuador CountrySTAT in Projects (2006-2007) (2008-2009) Bhutan Ethiopia Haiti Iraq Malawi Mali Mozambique Palestine O.T. Philippines Sudan Tanzania Angola Benin Burkina Faso Cameroon Ethiopia Ghana Ivory Coast Kenya Malawi Mali Mozambique Nigeria Rwanda Senegal Tanzania Uganda Zambia Africa Licenses (14) AlgeriaMocambiqueNamibiaSouth AfricaTanzaniaUgandaEast Africa Commission West Africa (ECOWAS) UEMOAS (FAO) KenyaSenegal(FAO) Mali(FAO) Togo(FAO)Cap Verde Asia and Pacific Licenses (5) Philippines (2) Taiwan(R.O.C.) Bhutan(FAO) Iraq(FAO)New Zealand ProspectsHong Kong Tadjikistan Prospects UK ONS Cyprus Moldova Montenegro North Ireland Romania Serbia Kirgizistan (FAO) Ukraine Albania Switzerland UK Dep. Work&pens. FAO Forest Stat.
What We Have Added • Metadata input application • Data cube management application • Time Series Computation Language • PX-Web update server • Russian interface 04 January 2020
Metadata Input Application 04 January 2020
Open-source Components • Visual HTML Designer • Spell checker 04 January 2020
The User Interface • Uses “PX-Web” a component of thePC-Axis software suite produced by Statistics Sweden • Currently being upgraded to latest version • English and Russian interfaces • “Tree structure” to help users find data • Possibility to manipulate data and download in several formats
Plans for the Future Develop End-to-End UNECE applications: Data import Validation Processing Calculation Imputation Dissemination Develop online analytical tool 04 January 2020
New UNECE Database System • Under construction • Calculations and “Supercube” implemented • Expected to be fully operational end 2010
Technical Assistance • UNECE is happy to share software / experience • Russian speaking database coordinator • Technical assistance missions 2008/09 • Kazakhstan • Kyrgyzstan • Tajikistan