30 likes | 139 Views
IU OREChem Summary Slides. Marlon Pierce, Geoffrey Fox, Sashikiran Challa. IU’s ORE-CHEM Pipeline. Harvest NIH PubChem for 3D Structures. Convert Gaussian Output to CML. Convert CML to RDF->ORE- Chem. Convert PubChem XML to CML. Submit Jobs to TeraGrid with Swarm.
E N D
IU OREChem Summary Slides Marlon Pierce, Geoffrey Fox, SashikiranChalla
IU’s ORE-CHEM Pipeline Harvest NIH PubChem for 3D Structures Convert Gaussian Output to CML Convert CML to RDF->ORE-Chem Convert PubChem XML to CML Submit Jobs to TeraGrid with Swarm Insert RDF into RDF Triple Store Goal is to create a public, searchable triple store populated with ORE-CHEM data on drug-like molecules. Convert PubChem XML to CML Convert CML to Gaussian Input Conversions are done with Jumbo/CML tools from Peter Murray Rust’s group at Cambridge. Swarm is a Web service capable of managing 10,000’s of jobs on the TeraGrid. We are developing a Dryad version of the pipeline.
Swarm-Grid • Swarm considers traditional Grid HPC cluster are suitable for the high-throughput jobs. • Parallel jobs (e.g. MPI jobs) • Long running jobs • Resource Ranking Manager • Prioritizes the resources with QBETS, INCA • Fault Manager • Fatal faults • Recoverable faults Swarm-Grid Standard Web Service Interface Request Manager QBETS Web Service Resource Ranking Manager Data Model Manager Fault Manager Hosted by UCSB User A’s Job Board Local RDMBS Job Queue Job Distributor MyProxy Server Grid HPC/Condor pool Resource Connector Hosted by TeraGrid Project Condor(Grid/Vanilla) with Birdbath Grid HPC Clusters Grid HPC Clusters Condor Cluster Grid HPC Clusters Grid HPC Clusters