170 likes | 288 Views
New Capabilities in QosCosGrid Middleware for Advanced Job Management, Advance Reservation and Co- allocation of Computing Resources. B. Bosak, P. Kopta , K. Kurowski , M. Mamonski , T. Piontek Poznan Supercomputing and Networking Center. Cracow Grid Workshop 8-th November 2011.
E N D
New Capabilities in QosCosGrid Middleware for Advanced Job Management, Advance Reservation and Co-allocationof Computing Resources B. Bosak, P. Kopta, K. Kurowski, M. Mamonski, T. Piontek Poznan Supercomputing and Networking Center Cracow Grid Workshop 8-th November 2011
Introduction • Advance Reservation • Co-allocation • Large-scale parallel applications • Cross-cluster MPI and ProActive • Workflows • Multiscale simulations
QCG for Parallel Applications • MulticlusterOpenMPI and ProActive • Hybrid applications, e.g. OpenMPI/OpenMP • Multiscale, cross-cluster applications based on the MUSCLE framework. • Application consisting of groups of processes with different resource requirements • Topology aware scheduling: • by QCG basing on application requirements • by application basing on topology discovery
QCG for Workflow Applications • Workflows based on direct acyclic graphs (DAG) • Task may be triggered by statuses of proceeding tasks (e.g. some task may be started when the proceeding task is in a state “Running”) • Multi dimensional parameter sweep experiments (as a part of workflow)
Advance Reservation and Co-allocation • AR is a mechanism offering execution of applications in a specified timeslots • Main use case in cross-cluster application execution, where co-allocation of resources is required • QCG creates co-allocation based on parameters specified by users: it may use not only resource requirements, but also requested start time, end time or duration
QCG-Broker Grid domain meta-scheduling framework Deals with load-balancing and scheduling of cross-cluster jobs Provides consisted WebService interface to the Grid; JobProfile XML-based language is used as a job description format Interacts directly with cluster level services (QCG-Computing, QCG-Notification, gridFTP, …)
QCG-Computing The key component of the cluster domain; Provides WebService interface to various DRMs – integration based on DRMAA (e.g. PBS Pro, LoadLeveler, GE, Torque/Maui); Compliant with OGF HPC Basic Profile Specification (JSDL as a job description language, BES interface); Offers methods for creation and management of advance reservations; Many plugins for authentication, authorization and accounting.
QCG-Notification • Its main function in QCG system is brokering asynchronous notifications between the QCG-Computing and QCG-Broker services • Implementation of brokered version of WS-Notification standard; features: • Advanced two-level filtering based on topics and content of the notification messages • Pull and push styles of distributing notification messages • HTTP/HTTPS and XMPP transport protocols
QCG Science Gateways and Tools Nano portal – advanced web-based portal dedicated for nanotechnologists (Abinit, NAMD, Quantum Espresso) QCG-Icon – lightweight desktop interface to QCG (MATLAB) QCG-Mobile – mobile access to QosCosGrid services (Android, JME)
Deployments • QCG is deployed at 4 production sites in PL-Grid (PSNC, Cyfronet AGH, TASK, WSNC): • proxy certificates, • LDAP grid-mapfile generation, • BAT accounting, • Nagiosprobes, • RPM packages. • NEL, the application from Quantum Chemistry written by prof. JacekKomasa was adapted to cross-cluster execution on top of QosCosGrid. Tests were performed on the PL-Grid infrastructure. • Ongoing production deployments in Europe on EGI and PRACE resources (e.g. LRZ, UCL, SARA) – MAPPER project.
Summary • QosCosGrid is an alternative grid middleware • Great support for cross-cluster application execution (MPI, ProActive, MUSCLE) • Ready production deployments in PL-Grid infrastructure. • Further reading: http://www.qoscosgrid.org