130 likes | 213 Views
OPIOM(-G) : I/O for clusters. Patrick Geoffray UTK - University of Lyon (Aerospatiale MATRA, LHPC, INRIA, French Wrestling Federation, group for the suppression of the French military service for students). The world is changing. Parallel machines are dying :
E N D
OPIOM(-G) : I/O for clusters Patrick Geoffray UTK - University of Lyon (Aerospatiale MATRA, LHPC, INRIA, French Wrestling Federation, group for the suppression of the French military service for students)
The world is changing... • Parallel machines are dying : • replaced by parallel machines without names, much cheaper, with very bad fans. • The application data sets are getting bigger and bigger : • physicists and biologists problems entropy. • several TBs for a DNA transcription.
Cluster computing • Well-known recipe : • Commodity nodes. • Myrinet. • Linux. • “Cluster for dummies”. • Patience.
Cluster IO (ScaLAPACK) • ScaLAPACK LU, I/O time(4 GB matrix, 8 nodes / 512 MB) :
How to improve cluster I/O ? • Fiber Channel (FC): • Remote reads as local reads (100 MB/s). • Increase significantly the price of the cluster. • Parallel file systems : • PVFS, GPFS. • Put the nodes “on their knees”. • Read 1 GB file : 40 % of a Pentium III 800 MHz
A proposition : OPIOM • OPIOM : Off-Processor IO with Myrinet. • Transfer data from SCSI disks to Myrinet board and send it to a remote node, bypassing the host. • Works with Linux, support any local file system, any SCSI controller. • Implemented with BIP and GM.
Linux SCSI sub-system Application Hardware
Normal I/O : 2 copies over the PCI bus. 4 memory copies. Buffer used in RAM. Processor used for control and data. Synchronous. OPIOM : 1 copy over the PCI bus. No memory copy. No buffer used. Processor used for control. Asynchronous. Normal I/O vs Opiom
What will it be good for ? • Finish my Phd. • Save money. • High performance video server. • Out-of-core computation. • MPI-IO. • Parallel file-systems. • High performance NFS for cluster.