190 likes | 427 Views
Introducing the CRAY SV1. CRAY SV1-128 SuperCluster. Agenda. Silicon Graphics Supercomputer Roadmap Introducing the Cray SV1 Cray SV1 hardware overview Multi-Streaming Processor Cray SV1 software overview. SV2 Vector. T90. SV1. SN2 Origin 3. J90. T3E. Origin.
E N D
Introducing theCRAY SV1 CRAY SV1-128 SuperCluster
Agenda • Silicon Graphics Supercomputer Roadmap • Introducing the Cray SV1 • Cray SV1 hardware overview • Multi-Streaming Processor • Cray SV1 software overview
SV2 Vector T90 SV1 SN2 Origin 3 J90 T3E Origin Five Year Product Roadmap 1997 1998 1999 2000 2001 2002 Common DSM Architecture SV1e SN1 Origin 2 UNICOS UNICOS/mk IRIX
Vector Supercomputer Roadmap • CRAY T90 • Flagship vector supercomputer • Performance leader • Engineering, production and quality initiatives continue • CRAY SV1 • Scaling CMOS & UNICOS • 1 TFLOPS & 1 TByte in 1998 • Supercomputer performance & environment • Outstanding reliability • CRAY SV2 • 10+ GFLOPS per CPU, scaling to 50+ TFLOPS • In 2001 • CMOS commodity leverage • DSM & IRIX
Introducing the CRAY SV1 • 4th generation CMOS & DRAM vector supercomputer from Cray • 4 Gflops processors • cluster of tightly coupled Shared Memory Processing Systems • configurations scale up to 1 Tflop / 1 Tbyte • highly competitive price / performance • compatible with Cray UNICOS vector applications • first shipment August 1998
CRAY SV1 Highlights • Three technology breakthroughs: • Caches combined with vectors • Pipe selectable processors • 1 GFLOPS peak with 2 pipes • 4 GFLOPS peak with 8 pipes • Scaling to 1,024 processors using vector CPUs • And 1 TByte of DRAM memory • Using 32-way flat memory SMP nodes
CRAY SV1A-1 (4x4 Backplane) Mem Mem Mem Mem CPU CPU CPU CPU CRAY SV1-1 (8x8 Backplane) Mem Mem CPU CPU CPU CPU Mem Mem Mem Mem Mem Mem CPU CPU CPU CPU Single Node CRAY SV1 Configurations • CRAY SV1A-1: • Up to 3 four GFLOPS processors • Up to 16 one GFLOPS processors • 25.6 GByte/sec memory bandwidth • 2 to 16 GByte of central memory • 1 to 4 GigaRing channel adaptors • CRAY SV1-1: • Up to 6 four GFLOPS processors • Up to 32 one GFLOPS processors • 51.2 GByte/sec memory bandwidth • 4 to 32 GByte of central memory • 1 to 8 GigaRing channel adaptors
Minimum Configuration = Four Nodes (A Row of 4 -- 32 Processor Cabinets) 32 PEs 32 PEs 32 PEs 32 PEs I/O I/O MPN GigaRings SWS SuperClustering The CRAY SV1:Basic Building Block = 32 Processor SMP Node • Vector Clustering Experience: • Over 5 years of providing production clusters to customers • Several customer sites • Each with multiple years of usage
VA CA PV VB VA CA PV VB VA CA PV VB VA CA PV VB SV1 Processor Overview • CPU Module • High performance vector processors • 250MHz • 2 vector pipes per processor • 4 GFLOPS per module • BMM Functional Unit • 1 GigaRing IO channel per module • Cache • Cache for both vector and scalar data • 256KBytes • 4-way associative • 1 word (8 byte) per cache line • Air cooled, surface mount design • Field upgradable
Multi-Streaming Processor 4 SV1 CPUs tightly coupled Based on optimal memory performance Coupled using shared registers
CPU CPU CPU CPU CPU CPU CPU CPU CPU CPU CPU CPU CPU CPU CPU CPU Pipe Selectable Processor Concept • In one CRAY SV1-32 cabinet: • 32 -- 1 GFLOPS, 2 vector pipe processors or; • 4 -- 4 GFLOPS, 8 vector pipe, processors, plus 16 1 GFLOPS processors or; • 6 -- 4 GFLOPS, 8 vector pipe processors, plus 8 1GFLOPS processors MultiStream-CPU MultiStream-CPU Memory Cache=1MB
CRAY SV1 Software Overview • CRAY PVP compatibility • YMP Architecture compatibility • CRAY UNICOS operating system • Source and binary compatible with the YMP, C90 and J90 products • Immediate data center integration • large (~500) optimized applications-catalog • Outstanding throughput • Supports popular programming models • PVM, MPI, SHMEM, Autotasking, OpenMP, F-- • Many tools for clustering • Several new enhancements planned
UNICOS • Initially delivered in 1986 • A 64 bit OS for 13 years • A parallel OS for the highest performance computers in the world • Reliable and robust with an MTTI of 100,000+ hours on the CRAY J90 series systems • Full Data Center functionality • Security (B1) – Full tape support • Accounting – ESCON, BlockMux, SCSI • 64 bit filesystem – Data Migration • State-of-the-art programming environment
SV1 SuperCluster Capabilities • Production quality OS, compilers, tools & batch systems • Single system image • Within a 32 processor SMP node • High speed, transparent file access (NFS, DFS, BDS) • Single system sign-on (via DCE) • Mass Storage • DMF Data Migration Facility • Data migration from cluster nodes to central tape resource • Job Scheduling • NQE Network Queuing Environment --- access to cluster from entire network • Parallel Programing • Cray Message Passing Toolkit (MPI) and PVM • Autotasking, OpenMP • Array Services • Cluster wide status information and command execution
SuperCluster Enhancements planned • Improved enviroment for parallel programming • Increase IO bandwidth • GigaRing OS Bypass to reduce interconnect latency • Scheduled Transport protocal • SHMEM and MPI • Improved administration and user tools • Single point of administration • Accounting, UDB, installation, startup/shutdown, backup/recovery, etc • Enhanced system recovery and redundancy • Distributed HSM • Client/Server DMF
CRAY SV1 Applications • Supports the large robust UNICOS ISV set of applications, e.g.: • Structural Dynamics • Chemistry • CFD • Environmental • Electronics • Petroleum • Provides compatibility with customer in-house application codes • Source and binary compatible with J90, C90 and YMP products
SV1 Product Summary • 4 GFLOPS processors • Customers can select either 2 or 8 pipe operation • Mixed processor sizes allow customers to optimize their code performance • Highly competitive price/performance • Low cost design and low power CMOS and DRAM • Low life-cycle costs in power, cooling and maintenance • Up to 1 TFLOP peak and 1TByte memory capacity • Excellent for throughput needs • Peak capability is 20 times the powerful CRAY T932 • Memory size is up to 125 times the largest CRAY T90 • Proven UNICOS Operating System • Runs UNICOS applications • Proves Silicon Graphics commitment from to the Very High End Supercomputing