160 likes | 284 Views
So the data are in the db(s)…. Now what?. NSF AS. So the data are in the db(s)…. Now what?. G. dbBadger (data pig?). Issues. For visualization, playing, and analysis, data need to be synchronized SQL doesn’t deal well with many situations Crossing databases
E N D
So the data are in the db(s)… Now what? NSF AS
So the data are in the db(s)… Now what? G dbBadger (data pig?)
Issues • For visualization, playing, and analysis, data need to be synchronized • SQL doesn’t deal well with many situations • Crossing databases • Asynchronous data are troublesome • Data with different time formats are buggers • Synchronization, other than aggregation a pain • Need a tool that can request data, synchronize, and interface with other programs
dbBadger • Define the universe • Connects simultaneously to multiple databases • Select the variables • Allows any combination of time-based variables • Assumes all variables have a time component • No requirement of relationship among variables • Aggregate, using any of 3 • Union • Interpolate • Aggregate • Export results • Additional signal processing • Other software
dbBadger – Other Features • Graphics interface or command line • Runs w/in Matlab or stand-alone • Freely distributable • Savable state for easy reruns
1. Locations 2. Tables 3. Fields 4. Selections 5. Date range 6. Get data 7. Additional functions {
1. Locations 2. Tables 3. Fields 4. Selections 5. Date range 6. Get data 7. Additional functions {
4. Selections 7a. Aggregate 7b. Save
Interaction with other software dbBadger dbBadger dbBadger db Oracle in Madison Oracle in Taiwan Access on desktop • Pass: • Config file • New date • Output location model
Issues • Speed • Easily beats SQL • High-resolution data slow • Standardization • Measurement naming standards • Routing w/in database • Interoperability • Other software • Server mode • Emphasis • Currently, location • Future, measurement variable
1 month HIRES WTEMP Trout Lake Retrieval time: 15sec for ~200,000 records 1 month HIRES WTEMP YYL Retrieval time: 15 minutes Data Retrieval Speed
Measurement Defs. Sever • Time • Integrated • MM:DD:YY:HH:SS • Separated • NTL • Year • DayOfYear • Military • Temperature • Dissolved oxygen • Winkler • Sensor • YSI Model 58 • Greenspan • Optical • etc. Database generator Database dbBadger
Database • What variables are measured? • What are their locations (3D coordinates)?