200 likes | 302 Views
Managing processing and storage for MODIS and OMI. Edward Masuoka Terrestrial Information Systems Branch. What our data systems support. Integration of science team software Development of Level 1 and Level 3 products Processing science products Product Quality Assessment
E N D
Managing processing and storage for MODIS and OMI Edward Masuoka Terrestrial Information Systems Branch
What our data systems support • Integration of science team software • Development of Level 1 and Level 3 products • Processing science products • Product Quality Assessment • Archiving/distributing data products and imagery
System Administration Tools • Depot http://www.cs.cmu.edu/~help/unix_linux/software_collections/local_depot.html Manages software under /usr/local • SATE (System Administrator Tool Environment) Handles all user accounts, Integrated with NAMS Property database (location, value description) • Problem Queue Email-based problem tracking Automated notifications from h/w are sent here • System Administrator Wiki (MediaWiki) • Procedures for managing systems and storage
Development Team Tools/Process • Subversion – Configuration management • Bugzilla – Software Bug Tracking • PCR (PGE Change Request) Process Science S/W PGE (Product Generation Executive) delivered unit test (specified by developer) science test (defined by science discipline lead) science test results reviewed by Quality Assessment Team Science disciplines and Science Team leader review If approved put into production
MODAPS H/W Architecture Distribution Database Server Production Database Server Black Diamond Ethernet Switch Web, ftp and proxy servers Head Nodes (Ingest and Staging Server) Outside Network Archive Servers Fast Storage High Volume Storage Compute Servers
Moving processing resources between Clusters • Operations monitors level of use on clusters with Ganglia • Extreme Networks Black Diamond switches enable compute servers to be shifted between clusters supporting activities as needed to meet demand in different areas • Software on compute servers can be quickly reconfigured via Depot and sync from Subversion repository
Databases are the foundations for processing and distribution • Processing databases track files on production systems and processing jobs • LAADS database supports search/order and custom product generation • Processing and distribution are decoupled • Searches don’t impact production rates or delivery to end-users via ftp push • Processing at 100x doesn’t impact product searches or post-processing on LAADS
Current Evaluations • Running iRODS + FUSE as replacement for NFS for presenting the all_data tree (/Collection /Mission/Product/…/granule) to ftp users • Prevents ftp from hanging if storage node is down • Running PGEs in the Nebula Cloud • Need to understand cost with respect to computing, storage and network bandwidth • May be used to handle peak demand
Finish in 2011 • Migrate all servers to CentOS Linux • Migrate LAADS database to PostgreSQL • Databases migrated for most production systems • Single set of security plans for MODIS and OMI
Quality Assessment Teamhttp://landweb.nascom.nasa.gov/cgi-bin/QA_WWW/newPage.cgi
Quality Assessment • Global browse • Images of Daily and Multi-day products • Golden Tiles (9 tiles over key land cover types) • Browse images • Time Series Plots allow comparisons between different years and reprocessing campaigns • Tools written in C interfaced to ENVI facilitate manipulation and assessment of MODIS standard products
Acronyms ENVI GIS and image processing COTS s/w FUSE File System in User Space iRODS integrated Rule-Oriented Data System NFS Network File System LAADS Level 1 and Atmosphere Archive and Distribution System MODIS Moderate-resolution Imaging Spectroradiometer OMI Ozone Monitoring Instrument