460 likes | 471 Views
NPP Atmosphere PEATE Preliminary Design Review. Part 2: Science Processing System Scott Mindock. Presented by the Atmosphere PEATE Team Space Science and Engineering Center University of Wisconsin-Madison 10 April 2007. Definitions. UML - Unified Modeling Language
E N D
NPP Atmosphere PEATEPreliminary Design Review Part 2: Science Processing SystemScott Mindock Presented by the Atmosphere PEATE TeamSpace Science and Engineering CenterUniversity of Wisconsin-Madison10 April 2007
Definitions UML - Unified Modeling Language Use Case - Captures requirements at functional level Activity - Describes the steps of a use case Package - Show software structure and dependencies Class - Show software structure and dependencies
Sample Activity Diagram Like Flowchart Ovals = Activity Rectangle = Data Action A and B Decoupled Dot = start Circle = end
Presentation Format Key Features and Flow Requirements
Design Goals Maintainability - System lifetime spans years Reusability - Subsystems and design patterns Testability - Subsystem dependencies are managed Scalability - Design must scale to larger and smaller systems
Design Methodology Learn from others - IDPS, Clusters, Ocean PEATE, Disney Parkwide Leverage existing proven technologies - Use successful open source Prototype key features - Demonstration Projects Model System - UML Activity Model, Java System Decouple Major Systems - Manage dependencies, Define interfaces
Software Functional Grouping • APSPS Software Subsystems • Ingest • Data Management System • Algorithm Rule Management • Computation Resource Grid • Software Development Support • Configuration Management • Science Investigation Support • Product Creation
APSPS, Atmosphere PEATE Science Processing System DMS: Data Management System Stores Data CRG: Computational Resource Grid Processes Data ARM: Algorithm Rule Manage Applies Product Rules to Data ING : Ingest System Brings Data into System
ING Supports pull ingests Supports push ingests Verification functions localized Catalog entry function localized Notices missing or late data
APSPS, Atmosphere PEATE Science Processing System DMS - Data Management System CRG - Computational Resource Grid ARM - Algorithm Rule Manager ING - Ingest System
DMS Provides well-defined interface DMS is autonomous Provides storage Provides catalog
DMS Executive Purges old data Watchdogs Catalog Server Watchdogs Delivery Servers Advertises catalog
DMS Catalog Catalog of files in DMS Health of DMS Information stored in database Apache Server
DMS Delivery Supports raw access protocols (e.g. FTP,RSYNC,HTTP) Support intelligent access (e.g. OpenDAP, ADDE)
APSPS, Atmosphere PEATE Science Processing System DMS - Data Management System CRG - Computational Resource Grid ARM - Algorithm Rule Manager ING - Ingest System
CRG Provides well-defined interface Accepts job request Provides Job Status Monitor Job State Scalable Testable
CRG Host One or more per node Retrieves data Retrieves algorithm Runs Algorithm Saves results
APSPS, Atmosphere PEATE Science Processing System DMS - Data Management System CRG - Computational Resource Grid ARM - Algorithm Rule Manager ING - Ingest System
ARM Assigns jobs to CRG Monitors data in DMS Monitors the status of jobs in CRG Applies rules to data Generates notifications Volatile logic lives here
ARM Manager Match Rules and Algorithms to available data. Allow users to start processing. Monitor job progress. Implements volatile logic.
ARM Monitor Monitors the DMS Finds new data. Finds deleted data. Uses metadata encoded in filenames. Provides scalability.
APSPS, Atmosphere PEATE Science Processing System DMS - Data Management System CRG - Computational Resource Grid ARM - Algorithm Rule Manager ING - Ingest System
Configuration Management Subversion Bugzilla Apache Tomcat SSEC - Document Management Process
Configuration Management Provides versioned access to software Provides defect-tracking facilities Used for source code Used for test scripts Used for binaries All open source
Lunch Content
Product Generation Support Algorithm Ingest - Importing, Developing, Wrapping Algorithm Qualification - Preparing for APSPS LEOCAT - Low Earth Orbit Cloud Algorithm Test bed
Algorithm Lifecycle Algorithm can come from anywhere. Once qualified, the algorithm can be applied from ARM.
Algorithm Ingest Algorithm entered into subversion Product created in bugzilla Algorithm is ported and wrapped Tests are created
Algorithm Qualification Write a script to execute algorithm Script manages execution environment Algorithm name, inputs and outputs entered into ARM
LEOCAT Reads in all data. Executes algorithm. Writes data.