1 / 15

Biowulf: 10 Years of Large-scale Computing at the NIH

Learn about the NIH Biowulf Cluster, a cutting-edge supercomputing resource available to all intramural scientists since 1999. Managed by CIT, it's one of the world's largest biomedical clusters, offering exceptional price performance and scale. Explore its architecture, applications across domains like genomics and proteomics, and recent storage enhancements. Discover how it supports over 70 publications annually and drives groundbreaking research in computational biology.

Download Presentation

Biowulf: 10 Years of Large-scale Computing at the NIH

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Biowulf: 10 Years of Large-scale Computing at the NIH Steven Fellini Scientific Computing Branch, Division of Computer System Services CIT, NIH

  2. The NIH Biowulf Clusterhttp://biowulf.nih.gov • Central scientific supercomputing resource managed by CIT • Operational since 1999 • Funded through NIH Management Fund • Available to all NIH intramural scientists • Used by 19 ICs in 2008 • Among the largest biomedical clusters in the world • Value to the NIH: price/performance, economy of scale, unique resource

  3. NIH Biowulf Cluster Architecture Fileservers Core network switch Login node Network switches Compute nodes 3

  4. Cluster Supercomputing

  5. Biowulf 1999

  6. Biowulf 2000-2002

  7. Biowulf 2003-2005

  8. Biowulf 2006-2008

  9. Biowulf 2009

  10. NIH Biowulf Monthly Usage1999-2008

  11. Application Domains on Biowulf Sequence Analysis Blast, EMBOSS, Iprscan, MFOLD… Genome Assembly Phred/Phrap/Consed, MIRA, Velvet… Linkage Analysis PLINK, Mach, Fastlink, Genehunter… Phylogenetic Analysis PAUP, Phylip, PAML… Molecular Dynamics NAMD, Charmm, GROMACS… Proteomics OMSSA, X!Tandem, Inspect… Mathematics/Statistics R, Matlab, Mathematica, SAS… Image Analysis FSL, AFNI, Huygens, Imaris… Structural Biology Rosetta++, Xplor-NIH… Computational Chemistry Gaussian, GAMESS… 11

  12. Over 70 publications in 2008

  13. FY2009: Focus on Storage • Add 200-400 TB. • Re-architect storage from single to 3-tier. • High performance parallel file servers. • Goal: provide supercomputing-scale storage.

  14. The Helix/Biowulf Systems Staff

  15. NIH Biowulf FY2008 CPU Utilization by IC (total: 21,070,667 hours) Number of Jobs by IC (total: 671,739 jobs)

More Related