150 likes | 166 Views
Learn about the NIH Biowulf Cluster, a cutting-edge supercomputing resource available to all intramural scientists since 1999. Managed by CIT, it's one of the world's largest biomedical clusters, offering exceptional price performance and scale. Explore its architecture, applications across domains like genomics and proteomics, and recent storage enhancements. Discover how it supports over 70 publications annually and drives groundbreaking research in computational biology.
E N D
Biowulf: 10 Years of Large-scale Computing at the NIH Steven Fellini Scientific Computing Branch, Division of Computer System Services CIT, NIH
The NIH Biowulf Clusterhttp://biowulf.nih.gov • Central scientific supercomputing resource managed by CIT • Operational since 1999 • Funded through NIH Management Fund • Available to all NIH intramural scientists • Used by 19 ICs in 2008 • Among the largest biomedical clusters in the world • Value to the NIH: price/performance, economy of scale, unique resource
NIH Biowulf Cluster Architecture Fileservers Core network switch Login node Network switches Compute nodes 3
Application Domains on Biowulf Sequence Analysis Blast, EMBOSS, Iprscan, MFOLD… Genome Assembly Phred/Phrap/Consed, MIRA, Velvet… Linkage Analysis PLINK, Mach, Fastlink, Genehunter… Phylogenetic Analysis PAUP, Phylip, PAML… Molecular Dynamics NAMD, Charmm, GROMACS… Proteomics OMSSA, X!Tandem, Inspect… Mathematics/Statistics R, Matlab, Mathematica, SAS… Image Analysis FSL, AFNI, Huygens, Imaris… Structural Biology Rosetta++, Xplor-NIH… Computational Chemistry Gaussian, GAMESS… 11
FY2009: Focus on Storage • Add 200-400 TB. • Re-architect storage from single to 3-tier. • High performance parallel file servers. • Goal: provide supercomputing-scale storage.
NIH Biowulf FY2008 CPU Utilization by IC (total: 21,070,667 hours) Number of Jobs by IC (total: 671,739 jobs)