490 likes | 1.58k Views
Vishal Bakshi Aditya Polumetla. PARAM Padma SuperComputer. Topics. Introduction Specifications Machine Architecture Microkernel Architecture PARAMNet II System Software Applications on PARAM. PARAM. PARAM – PARAllel Machine
E N D
Vishal Bakshi Aditya Polumetla PARAM Padma SuperComputer
Topics • Introduction • Specifications • Machine Architecture • Microkernel Architecture • PARAMNet II • System Software • Applications on PARAM
PARAM • PARAM – PARAllel Machine • Developed by Center for Development of Advanced Computing (C-DAC) India • PARAM Padma is the latest in the line of PARAM series after PARAM 8000, PARAM 9000 and PARAM 10000 • Introduced in 2003 • Its a next generation high performance scalable computing cluster • PARAM Padma's theoretical peak performance of complete configuration is 1.13 TFlops
Compute Nodes: 248 Power 4 1 Ghz processors 54 4-way SMP and 1 32-way SMP (Symmetric MultiProcessing) configuration OS – AIX / LINUX AIX (Advanced Interactive eXecutive) is a IBM's UNIX operating system Aggregate Memory of 0.5 TeraBytes Specifications
Network: PARAMNet II (primary) GigaByte Ethernet (backup) File Servers: 24 UltraSparc-III 900MHz processors 6 4-way SMP configuration OS – Solaris File System – QFS (Quantitative File System) Specifications continued
External Storage: Storage array 5 TeraBytes Tape Library 12 TeraBytes System Software: C-DAC's HPCC Suite of System Software It provides a high flexible software environment for the cluster Specifications continued
Cluster Personality - Data Link Provider Interface (DLPI) driver allows mapping of common network protocols to the underlying network MPP (Massively Parallel Processor) Personality - Architecture allows microkernel to be loaded on compute nodes - implements custom designed Concurrent Runtime Environment (CORE) - Standard message passing interface 1. Parallel Virtual Machine (PVM) 2. Message Passing Interface (MPI) PARAM Machine Architecture continued
Message for high MPP Supports multiple tasks with a paged virtual memory space Multiple threads of execution within each task Message based interprocess communication Communication between tasks by message passing Location independent interprocess communication supported by port abstraction PARAS Microkernel Architecture
Services: - An executive which schedules priority based threads - Location transparent interprocess communication mechanism - Simple virtual memory model - Low kernel hardware supervisor PARAS Microkernel Architecture continued
PARAS service request dispatcher routes all requests to appropriate service provides Resource Managers - Process Manager - Virtual Memory Manager - Inter-Process Communication (IPC) Manager Microkernel Architecture
Five basic abstractions supported - Tasks - Threads - Ports - Messages - Regions Microkernel Abstractions
Components of Operating Environment - Partition Manager: global resource manager and management of that partition's resources - Process Server: Spawning tasks on a node and provides remote system call interface - Microkernel File System Server: to serve UNIX file system calls - Name Server: provides port naming services to user application Operating Environment continued
Developed by C-DAC System Area Network (SAN) switch to interconnect the nodes of the system Provides a low latency and high bandwidth interconnect data rate @ 2.5 Gigabits/sec message latency of 10 µ sec switch latency of 0.5 µ sec PARAMNet II
Major components are - 1. 8 / 16-port SAN switch 2. Network Interface Card (NIC) with CCP-III (C-DAC Co-Processor) communication coprocessor 3. Application Programming Interface (C-VIPL) C-DAC's Virtual Interface Provider Library PARAMNet II continued
Non-blocking crossbar based architecture 2.5 Gbps full duplex bandwidth per port Distributed schedulers allow individual routing tables per port, allowing for any network topology More than 8 / 16 ports supported using multi-level switching Ethernet based management interface allows for remote access and control of multiple switches PARAMNet II – SAN Switch
Virtual channel based routing with 1 KB buffering per port each at input and output Interval routing scheme based on 32-bit header (16-bit routing information) Group adaptive routing based on LRU algorithm to ensure uniform bandwidth distribution in a group Point-to-point flow control with pause and resume token PARAMNet II – SAN Switch continued
Provides interface to SANSW8 (8 port ) and SANSW16 (16 port) PARAMNet II switch Supports for connection oriented and connectionless protocols Can be configured for other protocols Capable of performing I/O from paged virtual memory PARAMNet II - NIC
Support for up to 1024 connections and up to 1024 completion queues Support for different page sizes Packetization and reassembly done in hardware Error detection and recovery done in hardware Latency of 10 µ sec PARAMNet II – NIC continued
C-VIPL is an application programming interface for PARAMNet II Major feature – Thread safe implementation Supports AIX, Linux, Solaris and Windows OS PARAMNet II – C-VIPL continued
PARAMNet II network comprises of N hosts connected in non-blocking fat tree topology For more than 8 / 16 hosts multiple SANSW8 / SANSW16 are required As switch latency is low multiple levels of switches have less latency System Level Block diagram for PARAMNet II continued
PARAM Padma has 12 PARAMNet II switches connected to two-level configuration to form 64 node CLOS network CLOS Network: multi-stage network topology that provides alternate paths between input and output making it possible to eliminate the blocking that occurs in other networks Latency associated with packet routing are small for a non -blocking topology PARAM Padma and PARAMNet II
HPCC software on the PARAM Padma provides a high performance flexible software environment HPCC Software
HPCC Software consists of -program development tools - system management tools - software engineering tools Provides low overhead for communication Optimized Message Passing Interface (MPI) Parallel file system with MPI-IO interface to enable applications to scale on large cluster HPCC Software continued
C-FPS is a client-server and user level parallel file system that addresses high I/O throughput requirements The Fortran programming environment consists of - Fortran 90 compiler, - parallelizing compiler for Fortran 77/90 for SMP - integrated development environment and Fortran 77 to Fortran 90 converter HPCC Software continued
The DIViA debugging environment comprises of a coherent set of tools that help programmers in debugging parallel programs PARMON the system management tool, enables the administrator to monitor activities and resource utilization of various cluster components RMS resource management tool enables users to optimally schedule jobs across the cluster nodes HPCC Software continued
Storage is provided by Storage array and Tape library Storage Area Network PARAM Padma has a network centric storage architecture Storage capacity of 5 TB scalable to 22 TB Use of Fibre Channel Arbitration Loop (FC-AL) technology to interconnect storage subsystems Storage Architecture
Bioinformatics - stimulations of large bio molecules - analysis of large databases for studying functions of genes Computational Structural Mechanics - fracture analysis - stress analysis - visualization for structural and fluid mechanics - study of thin walled structures Applications on PARAM Padma
Seismic Data Processing - for oil and natural gas exploration Computational Chemistry - calculation of electronic structure and properties of molecules - molecular dynamics simulation Computational Fluid Dynamics - simulation of external and internal flows - simulation of hypersonic flows Applications on PARAM Padma contt
1. http://cdac.in/html/parampma.asp 2.A Microkernel Based Operating System for PARAM 9000, Mohan Ram, N et al, CDAC 1995 3. PARAM Padma – A Teraflops Computing System And High Performance Computing in India, Purohit, S.K., CDAC, 2003 4.Current State and Future Trends in High Performance Computing & Communications (HPCC) Research in India, Sinha, P.K. et al, 10th IEEE Internaltional Workshop on FTDCS 2004. 5.Overview of recent supercomputers, Aad J. van der Steen, NCF, The Hague, 1997. References