1 / 13

OPIOM(-G) : I/O for clusters

OPIOM(-G) : I/O for clusters. Patrick Geoffray UTK - University of Lyon (Aerospatiale MATRA, LHPC, INRIA, French Wrestling Federation, group for the suppression of the French military service for students). The world is changing. Parallel machines are dying :

cana
Download Presentation

OPIOM(-G) : I/O for clusters

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. OPIOM(-G) : I/O for clusters Patrick Geoffray UTK - University of Lyon (Aerospatiale MATRA, LHPC, INRIA, French Wrestling Federation, group for the suppression of the French military service for students)

  2. The world is changing... • Parallel machines are dying : • replaced by parallel machines without names, much cheaper, with very bad fans. • The application data sets are getting bigger and bigger : • physicists and biologists problems entropy. • several TBs for a DNA transcription.

  3. Cluster computing • Well-known recipe : • Commodity nodes. • Myrinet. • Linux. • “Cluster for dummies”. • Patience.

  4. Cluster IO (ScaLAPACK) • ScaLAPACK LU, I/O time(4 GB matrix, 8 nodes / 512 MB) :

  5. How to improve cluster I/O ? • Fiber Channel (FC): • Remote reads as local reads (100 MB/s). • Increase significantly the price of the cluster. • Parallel file systems : • PVFS, GPFS. • Put the nodes “on their knees”. • Read 1 GB file : 40 % of a Pentium III 800 MHz

  6. A proposition : OPIOM • OPIOM : Off-Processor IO with Myrinet. • Transfer data from SCSI disks to Myrinet board and send it to a remote node, bypassing the host. • Works with Linux, support any local file system, any SCSI controller. • Implemented with BIP and GM.

  7. Linux SCSI sub-system Application Hardware

  8. Normal I/O : 2 copies over the PCI bus. 4 memory copies. Buffer used in RAM. Processor used for control and data. Synchronous. OPIOM : 1 copy over the PCI bus. No memory copy. No buffer used. Processor used for control. Asynchronous. Normal I/O vs Opiom

  9. Some quick results

  10. Some other quick results

  11. What will it be good for ? • Finish my Phd. • Save money. • High performance video server. • Out-of-core computation. • MPI-IO. • Parallel file-systems. • High performance NFS for cluster.

More Related