280 likes | 438 Views
Conversions from national grid data to harmonized European grid data. EFGS Lisbon 12-14 October 2011 Production and challenges Rina Tammisto, Senior Statistician , Statistics Finland Marja Tammilehto-Luode , Chief Adviser , Statistics Finland. Harmonization.
E N D
Conversionsfrom national grid data to harmonizedEuropeangrid data EFGS Lisbon 12-14 October 2011Production and challenges Rina Tammisto, Senior Statistician, Statistics Finland Marja Tammilehto-Luode, ChiefAdviser, Statistics Finland
Harmonization Data harmonization Spatialharmonization A gridnet covers the whole of Europe • Source data • Georeferencednational data • DisaggregatedEuropean data • Methodsused • Aggregated • Disaggregated • Hybridmethod
ETRS89-LAEA Grid NetDownloadable ZIP • http://www.efgs.info/data/GEOSTAT-1km-Grid.zip/view • Grid_ETRS89_LAEA_1K.shp • Abt. 500 Mt
ETRS89-LAEA Grid Net ETRS89-TM35FIN Grid Net
ETRS89-LAEA ETRS89-TM35FIN
LAEA gridnet in relation to national gridnetin Finland LAEA gridnet in relation to national gridnet in Austria
Differences in locations of grid cells in different projections (or co-ordinate systems) • A grid cell produced by using the national ETRS89-TM35FIN co-ordinate system and projection is divided among several ETRS89-LAEA grid cells • Direct derivation between different co-ordinate systems or projection is not usable • grids are located differently in relation to each others A issue to be solved: How to use national griddatasetswhile the directconversion is notrelevant…?
Tested method 1. Aggregation of grid data by using converted building points • 1) Georeferencedsource data is converted • Buildingsareconvertedfrom ETRS89-TM35FIN to ETRS89-LAEA • 2) Convertedbuildingpointsarejoinedwith the ETRS89-LAEA gridnet • 3) Aggregation of statistical data
Building points in ETRS89-TM35FIN Building points in ETRS89-LAEA Aggregation of statistical data
Method 1 Advantages Disadvantages Doublesets of primary data Doubleproductionprocessesfrom the beginning Risk of data disclosure – due to use of severalco-ordinatesystems- gapsbetweendatasets • Pointseasilyconvertible –originalquality of locationmaintained • Fromgeostatisticalpoint of view data qualitythroughly the same as in national data
Testedmethod 2. Conversion of grid data by using ready-made national grid datasets • 1) Ready-made national griddataset in ETRS89-TM35FIN is converted into ETRS89-LAEA • Polygon to Point – using the middlepoints of national gridcells • Conversion of the middlepoints of grids • 2) Convertedpointsarejoinedwith the ETRS89-LAEA gridnet • 3) Aggregation of statistical data
PRODUCTION OF THE NATIONAL GRID DATA CONVERSION OF THE POINTS, SPATIAL JOIN WITH ETRS89-LAEA GRID NET MIDDLE POINTS OF NATIONAL GRIDS AGGREGATION OF STATISTICAL DATA
Effects of the grid cell size on the quality of the conducted data • Testedgridcellsizes: National grid data: - 125 m x 125 m – highestresolution data - 250 m x 250 m - 1 km x 1 km Reference data: Data producedbyusingmethod 1; (conversion made on buildingpoints) Additionaltest: JRC/GISCO disaggregated data – data produced for the Finnish Grid Database
250 m ETRS89-LAEA from 250 m grids
125 m ETRS89-LAEA from125 m grids
ETRS89-LAEA frombuildingpoints ETRS89-LAEA from 1 km grids ETRS89-LAEA from125 m grids ETRS89-LAEA from 250 m grids POP/KM²
Comparison of the testdatasets • Statistics: • Number of grids, mean (inhabitants/gridpopulatedgridcell), totalnumber of inhabitants in the dataset, min, max
Datasetfromconvertedbuildingpoints Datasetfromconvertedgridpoints
Identityline (the 45 degreeline) Values of converteddatasetin relation to values of national datasets 125 m ETRS89-LAEA from125 m grids 250 m ETRS89-LAEA from250 m grids 1 km ETRS89-LAEA from1 km grids ETRS89-LAEA frombuildingpoints ETRS89-LAEA frombuildingpoints Evaluation of differencesbyusingabsolutevalues of inhabitants/km² gridcell(absolutevalues of differences) ETRS89-LAEA frombuildingpoints Dis.agg. ETRS89-LAEA disaggregate data ETRS89-LAEA frombuildingpoints
DIFFERENCES (abs.values) betweenmethod 1 data (from LAEA buildings) to deriveddatasets DIFFERENCES (abs.values) betweenmethod 1 data (from LAEA buildings) to JRC/GISCOdisaggregated data
Method 2 Advantages Disadvantages Geostatisticalpoint of view data quality is weakerthan the original national data Qualityerrors – qualitydistortioncompared to the correctone (measuringbynumber of inhabitants) • Use of the ready-madegriddatasets! • Lessphases • Smaller data mass • Level of quality is a matter of choice • Adequatelevel of quality (?) • Dependent on use • Min. target: SUM of the wholedataset is correct • No increase of confidentialityproblemswithdoubledatasets
Nextsteps • For GEOSTAT 1A project from October - November 2011 • More tests, any volunteers? • Quality definitions concerning adequate level of quality and grid scale used • Step-by-step guidelines • LAEA dataset – filling the empty grid net with data!
ThankYou! rina.tammisto@stat.fi marja.tammilehto-luode@stat.fi