1 / 18

The Next Big Thing Collaborations in the Life Sciences

The Next Big Thing Collaborations in the Life Sciences. Gordon K Springer University of Missouri Internet2 Spring Member Meeting April 24, 2012. Developing a 100G TestBed for Life Science Collaborations .

berget
Download Presentation

The Next Big Thing Collaborations in the Life Sciences

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. The Next Big ThingCollaborations in the Life Sciences Gordon K Springer University of Missouri Internet2 Spring Member Meeting April 24, 2012

  2. Developing a 100G TestBedfor Life Science Collaborations • Taking advantage of existing UM/SURA dark fiber to create a research 100G pathway from St Louis to Kansas City via Columbia along Interstate 70 • Using InCommon Federated Identities for authentication via Shibboleth; authorizations occur via an autonomous Entitlement Server to provide fine-grained authorizations to/from service providers • Developing distributed resource sharing by mapping needs to resources eligible for assignment depending on geography and resource availability • At 100G some distributed computing latency issues can likely be overcome

  3. Problem Set • High-Throughput Sequencing is producing enormous quantities of data; needing a storage infrastructure as a private cloud • Need to provision collecting, analyzing and using resources according to demand and includes the processing applications being net-aware • Develop security, measurement and analysis tools to efficiently run at 100G across a regional multi-cluster environment using OpenFlow and other specialized protocols

  4. Where Does the Data Come From?A High-Volume EST Pipeline For Discovery High throughput DNA sequencing at MU DNA Core Facility. Swine female reproductive tissues and embryos are removed at various times of gestation. Sequence data analyzed at MU on high-speed systems. Iterate microarray & other experiments to focus on gene discovery. Gene annotations quickly obtained by access to other data-bases through the MU Internet2 high speed network. Improved efficiency, quality and profitability is the goal. Patterns of gene expression analyzed with microarrays to reveal mechanisms that contribute to reproduction efficiency.

  5. I70 SL KC Col Big Data/Big Science Collaboratory

  6. The GPN Network

  7. University of Missouri System Grant Writers Professional Development Sessions

  8. UM InterCampusNetwork with 100G Pathway along I70 HtSeq LSC Internet2 100G 100G UM Portion of MOREnet

  9. OpenFlow And Other Protocols CLOUD CLOUD H C R U6 H C R U6 H C R U6 H C R U6 H C R U6 H C R U6 H C R U6 H C R U6 IBM IBM IBM IBM IBM IBM IBM IBM LOGIN LOGIN Multi-Site Sharing Protocols CIFS NFS HTTP FTP SCP Protocols CIFS NFS HTTP FTP SCP HPC Infiniband Network HPC Infiniband Network Mgmt Mgmt Management Central Administration Monitoring File Mgmt Management Central Administration Monitoring File Mgmt HPC Storage SMP Servers Linux Cluster GPGPUs HPC Storage Next-Gen Linux Cluster GPGPUs Machine Room GbE Network Machine Room GbE Network Availability Data Migration Replication Backup Availability Data Migration Replication Backup Visualization & Display Visualization & Display Research Data Store Research Data Store Campus GbE Network Campus GbE Network Lab (Research & Clinical) Lab (Research & Clinical) Instruments (Core Service) Instrument (Research) Instruments (Core Service) Instrument (Research) Instruments (Medical) Instruments (Medical) Site A Site B

  10. OpenFlow-enabled Commercial Switch Normal Software Secure Channel Normal Datapath Flow Table OpenFlowat the packet level Controller PC User Storage Cloud Analysis Engines NetStorage NetProcessors Adapted from: The Stanford Clean Slate Program http://cleanslate.stanford.edu

  11. Collaborative Framework

  12. Bridging the Gaps(Some are very Large)

  13. Using Middleware Tools for VO Collaboration Administrative User WAYF 2 4 3 14 User Command 5 Identity Provider Service Provider 5 1 Credentials Handle Service SHIRE 6 Handle Identity Directory Resource Manager Reso urce 6 Attribute Authority Handle 7 Handle SHAR 13 Credentials 8 YES/NO Attributes 12 9 VO Entitlement Command Entitlement Client App Entitlement Server 11 Command ES DB 10 YES/NO Entitlement Server

  14. uses public key encryption for authenticationand privacy Simplified Design Identity Provider User 1: request by URL or command 2 3 5 Entitlement Server Service Provider 4 6 Page or computational results

  15. Getting Authenticated If you belong to a GPN member organization, but do not see your institution in the list, please contact your local GPN representative to request help in authenticating in this environment.

  16. Entering the VO Environment

  17. And the story continues …More Data, Resources, People & Knowledge

  18. UMBC http://umbc.rnet.missouri.edu Grant Writers Professional Development Sessions

More Related