280 likes | 426 Views
FINESTION-2003-2006 a complete dataset of routine air ion measurements 2003 – 2006 Hannes.Tammet@ut.ee in co-operation with: Urmas Hõrrak, Kaupo Komsaare, Pasi P. Aalto, Petri Keronen, and Sander Mirme Pühajärve 2007. What is included?.
E N D
FINESTION-2003-2006a complete dataset of routine air ion measurements 2003 – 2006 Hannes.Tammet@ut.ee in co-operation with:Urmas Hõrrak, Kaupo Komsaare, Pasi P. Aalto, Petri Keronen, and Sander Mirme Pühajärve 2007
What is included? Neutral aerosols are not presented in the distribution version of the dataset
Amount of data 283 164 diurnal records, mostly 24×6 = 144 measurements per day (some variables 48 or 24 measurements per day) . Total about 40×106 values (inlc. less than 10% of missing codes). Unzipped file 146 958 411 bytes, about 3.8 bytes per value, Zipped file 52 megabytes, about 1.4 bytes per value. Format: DD (DataDiurna) diary.
Preprocessing of data • BSMA (single-channel scanning) data: • The records with BSMA noise index ≥ 50 were deleted as unreliable(about 15% of measurements was lost in Hyytiälä and 16% in Tartu). • The measurements made with a period of 15 min were converted to the10-minute period using interpolation by means of the DD data manager. • The values of RH% recorded by BSMA were corrected so that the nighttime maximums do not exceed 100%. • The fraction concentrations were converted to the values of distribution functions dn / d(log d) and dn / d(log Z) assigned to the centers of the fractions in the logarithmic scale of diameter or mobility. • The indices of technical diagnostics were replaced with the integral parameters: cluster ion concentration (Z > 0.56 cm2V−1s−1), intermediate ion concentration (Z = 0.032 − 0.56 cm2V−1s−1), cluster ion average mobility and air polar conductivity.
Tahkuse AIS(multichannel instrument) data: • The physically unbelievable measurements were marked as missing. • The records with too many missingmeasurements were deleted . • The measurements recorded every 5 minutes were converted to the time step of 10 minutes. • If a missing value was located between known values then the gap was filled using a sophisticated interpolation method. • The data was complemented with the integral parameters: cluster ion (Z > 0.56 cm2V−1s−1), intermediate ion (Z = 0.032 − 0.56 cm2V−1s−1), and large ion (Z = 0.0006 − 0.032 cm2V−1s−1) concentrations; cluster ion average mobility and air polar conductivity. • The data was transformed to the dn / d(log d) and dn / d(log Z) values so that a decade is logarithmically uniformly divided into 8 fractions. Thus the first 16 fractions are just the same as in the BSMA measurements.The mobility was converted to the particle size considering the measurements of air pressure and temperature.
Included quantities Symbol Quantity T:C Air temperature near the station Tg:C Ground temperature near the station Tgr:K/m Gradient of temperature Tc:C Chimney temperature near the station RH% Relative humidity P:mb Air pressure near the station Wind:m/s Wind velocity at standard height Wmax:m/s Wind 5-minute maximum at standard height Wind:deg Wind direction Prec:mm/h Precipitation intensity Illum:lux Solar illuminance Rad:W/m2 Total insolation sun+sky UVB:W/m2 UV radiation 310-320 nm Vis:km Visibility DR:uSv/h Gamma radiation dose rate NOx:ug/m3 Concentration of NOx (1 ppb is converted to 191 ug/cm3) SO2:ug/m3 Concentration of SO2 (1 ppb is converted to 2.66 ug/cm3) O3:ug/m3 Concentration of O3 (1 ppb is converted to 2.0 ug/cm3)
Symbol Unit Quantity B-noise cm-3 BSMA noise index RC+ cm-3 Tahkuse index of spectra roughness for positive clusters RN+ cm-3 Tahkuse index of spectra roughness for positive nanoparticles RP+ cm-3 Tahkuse index of spectra roughness for positive particles RC- cm-3 Tahkuse index of spectra roughness for negative clusters RN- cm-3 Tahkuse index of spectra roughness for negative nanoparticles RP- cm-3 Tahkuse index of spectra roughness for negative particles NC+:cm-3 1/cm3 Concentration of positive cluster ions (Z > 0.5 cm2/Vs) NC-:cm-3 1/cm3 Concentration of negative cluster ions (Z > 0.5 cm2/Vs) NN+:cm-3 1/cm3 Concentration of positive nanometer ions (Z = 0.03-0.5 cm2/Vs) NN-:cm-3 1/cm3 Concentration of negative nanometer ions (Z = 0.03-0.5 cm2/Vs) NP+:cm-3 1/cm3 Concentration of positive particle ions (Z = 0.0003-0.03 cm2/Vs) NP-:cm-3 1/cm3 Concentration of negative particle ions (Z = 0.0003-0.03 cm2/Vs) ZC+cm cm2/Vs Average mobility of positive cluster ions (Z > 0.5 cm2/Vs) ZC-cm cm2/Vs Average mobility of negative cluster ions (Z > 0.5 cm2/Vs) L+:fS/m fS/m Positive conductivity of air L-:fS/m fS/m Negative conductivity of air
z+01 cm-3 dN/d(log Z) of positive ions at Z = 2.74 cm2/Vs z+02 cm-3 dN/d(log Z) of positive ions at Z = 2.05 cm2/Vs z+03 cm-3 dN/d(log Z) of positive ions at Z = 1.54 cm2/Vs z+04 cm-3 dN/d(log Z) of positive ions at Z = 1.15 cm2/Vs z+05 cm-3 dN/d(log Z) of positive ions at Z = 0.87 cm2/Vs z+06 cm-3 dN/d(log Z) of positive ions at Z = 0.65 cm2/Vs z+07 cm-3 dN/d(log Z) of positive ions at Z = 0.487 cm2/Vs z+08 cm-3 dN/d(log Z) of positive ions at Z = 0.365 cm2/Vs z+09 cm-3 dN/d(log Z) of positive ions at Z = 0.274 cm2/Vs z+10 cm-3 dN/d(log Z) of positive ions at Z = 0.205 cm2/Vs z+11 cm-3 dN/d(log Z) of positive ions at Z = 0.164 cm2/Vs z+12 cm-3 dN/d(log Z) of positive ions at Z = 0.115 cm2/Vs z+13 cm-3 dN/d(log Z) of positive ions at Z = 0.0866 cm2/Vs z+14 cm-3 dN/d(log Z) of positive ions at Z = 0.0649 cm2/Vs z+15 cm-3 dN/d(log Z) of positive ions at Z = 0.0487 cm2/Vs z+16 cm-3 dN/d(log Z) of positive ions at Z = 0.0365 cm2/Vs z+17 cm-3 dN/d(log Z) of positive ions at Z = 0.0274 cm2/Vs z+18 cm-3 dN/d(log Z) of positive ions at Z = 0.0205 cm2/Vs z+19 cm-3 dN/d(log Z) of positive ions at Z = 0.0154 cm2/Vs z+20 cm-3 dN/d(log Z) of positive ions at Z = 0.0115 cm2/Vs z+21 cm-3 dN/d(log Z) of positive ions at Z = 0.0087 cm2/Vs z+22 cm-3 dN/d(log Z) of positive ions at Z = 0.0065 cm2/Vs z+23 cm-3 dN/d(log Z) of positive ions at Z = 0.00487 cm2/Vs z+24 cm-3 dN/d(log Z) of positive ions at Z = 0.00365 cm2/Vs z+25 cm-3 dN/d(log Z) of positive ions at Z = 0.00274 cm2/Vs z+26 cm-3 dN/d(log Z) of positive ions at Z = 0.00205 cm2/Vs z+27 cm-3 dN/d(log Z) of positive ions at Z = 0.00154 cm2/Vs z+28 cm-3 dN/d(log Z) of positive ions at Z = 0.00115 cm2/Vs z+29 cm-3 dN/d(log Z) of positive ions at Z = 0.00087 cm2/Vs z+30 cm-3 dN/d(log Z) of positive ions at Z = 0.00065 cm2/Vs
d+01 cm-3 dN/d(log D) of positive ions at D = 0.49 nm d+02 cm-3 dN/d(log D) of positive ions at D = 0.65 nm d+03 cm-3 dN/d(log D) of positive ions at D = 0.87 nm d+04 cm-3 dN/d(log D) of positive ions at D = 1.15 nm d+05 cm-3 dN/d(log D) of positive ions at D = 1.54 nm d+06 cm-3 dN/d(log D) of positive ions at D = 2.05 nm d+07 cm-3 dN/d(log D) of positive ions at D = 2.74 nm d+08 cm-3 dN/d(log D) of positive ions at D = 3.65 nm d+09 cm-3 dN/d(log D) of positive ions at D = 4.87 nm d+10 cm-3 dN/d(log D) of positive ions at D = 6.49 nm d+11 cm-3 dN/d(log D) of positive ions at D = 8.66 nm d+12 cm-3 dN/d(log D) of positive ions at D = 11.6 nm d+13 cm-3 dN/d(log D) of positive ions at D = 15.4 nm d+14 cm-3 dN/d(log D) of positive ions at D = 20.5 nm d+15 cm-3 dN/d(log D) of positive ions at D = 27.4 nm d+16 cm-3 dN/d(log D) of positive ions at D = 36.5 nm d+17 cm-3 dN/d(log D) of positive ions at D = 48.7 nm d+18 cm-3 dN/d(log D) of positive ions at D = 65.0 nm
Station QuantityDays Min Ave Max Sigma Hyyt NC+:cm-3 1298 27 737.9 1788 225.3 Hyyt NC-:cm-3 1298 -481 721.3 2383 239.8 Hyyt NN+:cm-3 1285 -869 16.4 1032 47.1 Hyyt NN-:cm-3 1285 -804 21.5 6046 93.9 Hyyt ZC+cm1298 0.85 1.445 2.04 0.139 Hyyt ZC-cm1298 0.94 1.638 2.24 0.155 Tartu NC+:cm-3 750 -143 397.1 1618 175.6 Tartu NC-:cm-3 750 -164 303.4 4317 160.3 Tartu NN+:cm-3 745 -997 30.1 1933 88.9 Tartu NN-:cm-3 745 -880 36.9 7421 121 Tartu ZC+cm 750 0.02 1.434 2.59 0.086 Tartu ZC-cm 750 0.12 1.581 3.6 0.122 Tahk NC+:cm-3 1103 0 436.4 1200 161.2 Tahk NC-:cm-3 1100 0 370.3 1200 153.7 Tahk NN+:cm-3 1103 -30 114.4 1199 117.1 Tahk NN-:cm-3 1102 -30 98.5 1199 136.1 Tahk NP+:cm-3 1102 0 1371.5 8982 819 Tahk NP-:cm-31103 0 1389.8 8984 830.6 Tahk ZC+cm 1100 0.6 1.419 2.1 0.081 Tahk ZC-cm 1094 0.6 1.563 2.08 0.113
Station Quantity Days Min Ave Max Sigma Hyyt d-01 1298 -1565 1293.9 4952 568.7 Hyyt d-02 1298 -7720 1974.9 6060 603.4 Hyyt d-03 1298 -741 1730.7 10912 813.3 Hyyt d-04 1298 -1085 707.7 4857 628.8 Hyyt d-05 1298 -1186 93.4 6824 182.6 Hyyt d-06 1298 -816 37.0 11016 193.0 Hyyt d-07 1298 -2376 37.4 11344 207.3 Hyyt d-08 1298 -3160 28.0 7784 157.8 Hyyt d-09 1298 -4504 19.8 7560 157.7 Hyyt d-10 1298 -5568 15.6 10200 282.7 Tartu d-01 750 -384 580.9 5936 280.7 Tartu d-02 750 -72 640.1 6296 305.4 Tartu d-03 750 -640 693.5 7192 375.1 Tartu d-04 750 -672 424.9 9240 328.5 Tartu d-05 750 -1248 135.5 12776 209.7 Tartu d-06 750 -1240 46.9 16576 202.5 Tartu d-07 750 -3640 41.4 14312 198.3 Tartu d-08 750 -3904 43.3 9536 191.1 Tartu d-09 750 -6800 50.3 7360 259.2 Tartu d-10 750 -8000 51.0 14272 475.6
Station Quantity Days Min Ave Max Sigma Tahk d-01 1100 -563 507.7 2372 230.0 Tahk d-02 1100 -531 929.2 3592 382.2 Tahk d-03 1100 -448 888.5 3512 404.0 Tahk d-04 1100 -900 395.8 3919 279.4 Tahk d-05 1103 -416 187.2 10461 289.1 Tahk d-06 1102 -679 149.3 2812 188.4 Tahk d-07 1102 -503 76.4 2501 121.0 Tahk d-08 1102 -840 98.3 2489 158.6 Tahk d-09 1102 -792 152.2 3082 246.5 Tahk d-10 1102 -852 223.0 3741 371.1 Tahk d-11 1103 -991 318.5 13703 576.1 Tahk d-12 1103 -894 423.0 10357 431.1 Tahk d-13 1103 -827 650.3 11369 501.1 Tahk d-14 1103 -962 995.1 13466 677.6 Tahk d-15 1103 -955 1464.8 19793 978.2 Tahk d-16 1103 -951 2286.7 18459 1426.4 Tahk d-17 1103 -845 3111.7 28679 2021.9 Tahk d-18 1103 0 3893.4 40320 2866.8
How to use? Empty folders
tolerable percent of errors how many headerlines delimiter (tab) base year time pattern time shift from UT, s(Helsinki winter time) variable name in table code of a missing value station factor table/diary quantity name in diary time regime
Result Double-click
tolerable percent of errors how many headerlines delimiter (tab) base year time pattern time shift (s)(Helsinki winter time) variable name in table missing code station factor table/diary quantity name in diary time regime
Why DataDiurna? A rectangular table can be conveniently processed and analyzed with common software like MS Excel (if the number of rows < 65536). Different quantitiesare often saved with different time steps andmany cells in atectangular table will be empty. If a table holds a large complex dataset then it is composed mostly of the missing value codes like–999. An alternative is a set of different tables joined into a database. Common database systems, e.g. MS Access, are optimized for the business applications. The time structure and the specifity of environmental measurements is not sufficiently exploited in these systems. Another alternative is the diary format well known in hand-written measurement records. This alternative is applied in DataDiurna, which is optimized for managing of environmental measurements. An additional advantage: the rules of DataDiurna force the user to include minimum explanations into the dataset, e.g. the data cannot be saved without indicating how much the time is shifted from UT.
Anoutlettable Anoutlettable A source table Asourcetable Commondiary Asourcetable NB: DataDiurna is not a replacement for Excel1) to manage large data: choose DataDiurna,2) to analyze the tables: choose Statistica, Excel, etc.
Excerpt from a diary: 20041110,Tartu.m,Rad:W/m2,hm,2,2,2,2,2,2,7,20,28,32,39,33,18,8,2,2,2,2,2,2,2,2,2,2 20041110,Tartu.m,Wind:m/s,hm,3.8,3.8,3.9,4.4,4.9,4.9,4.4,2.4,1.8,1.8,2.5,2.1,2,1.9,0.7,1.7,2,1.5,2,1,1.2,1.1,1.3,1.1 20041111,Hyyt,L+:fS/m,m10,9.6,9.1,8.7,8.9,9.1,9.4,10.2,9.8,9.6,10.3,10.9,11.4,11.6,12.2,12.5,11.8,11.9,12.2,12.7,12.5,12.1,11.9,11.3,11,11.6,11.5,11.3,10.9,11.2,11.5,11.5,11.3,11,10.7,11.1,11.5,11.1,11,,,,10.1,10.1,10,9.9,10.1,10.2,10.2,10,10.3,10.8,11.4,11.2,11.1,11.4,11.4,11.3,11.2,11.7,12,11.5,12.2,12.9,12.7,12.9,13.1,13.1,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,, 20041111,Hyyt,Rad:W/m2,m30,0,0,0,0,0,0,0,0,0,0,0,0,1,1,2,3,5,4,7,15,24,23,24,28,15,10,3,1,0,0,0,0,0,0,-1,0,0,0,0,0,0,-1,-1,0,0,0,0,0 Time regime Date Quantity Station Here are missing data
DataDiurna is omnivorous: The time stamp and the numbers can be written as you data provider likes, e.g. following data rows, where the time stamp is highlighted, are considered equal Time 13:25 Date 21.05.2005 23.1,-17,?,1.6e-19,2783597+003 2005-05-21 13:25:00,23.1,-17,x,1.60-19,0.2783597e3 DOY=141.559, measurements: +23.10 -17 -999 278.3597 20050521132500) 23.1;-17;-999;2,783,597e-4 The time is automatically converted into UT when importing a source table to the diary and to any time zone when exporting an output table from the diary. The rows of the source table may be non-uniformly distributed in time: DataDiurna can interpolate and average the data during the import and export. The code of missing value may be individual for every variable and may be numerical (e.g. –999) or non-numerical (e.g. ?). etc... etc... etc... ....... Warning: The data manager DD2007T.exe is still a beta-version.
The basic version of the datasetFINESTION-2003-2006 was composed froma large number of source tableswith different time stepsfrom 10 minutes to 1 hour
The working version of the datasetFINESTION-2003-2006 is located in the computerof the user Thus:Every user can create extended personal version of the dataset importing additional private data
How to get? The DataDiurna data manager is free for unlimited distribution. The dataset ATMEL2007A is free for unlimited distribution. Both they can be retrieved from the page: http://ael.physic.ut.ee/tammet/dd/ The dataset FINESTION-2003-2006 is free but available only for our research group. It is not published in Internet today. Thus: Ask the memory stick or the CD with the dataset and copy the folder DD into your own computer. NB: You should have 200 MB free on your hard disk.
Thank youfor choosing DataDiurna.....hard to learn but easy to usewhen managing large complex datasets likeFINESTION-2003-2006