1 / 21

Project DASH: Securing Direct MySQL Database Access for the Grid

Project DASH: Securing Direct MySQL Database Access for the Grid. D. Malon, E. May, D. Ratnikov, A. Vaniachine Argonne National Laboratory M. Vranicar, J. Weicher PIOCON Technologies XV International Conference on Computing in High Energy and Nuclear Physics T.I.F.R., Mumbai, India

carlyn
Download Presentation

Project DASH: Securing Direct MySQL Database Access for the Grid

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Project DASH: Securing Direct MySQL Database Access for the Grid D. Malon, E. May, D. Ratnikov, A. Vaniachine Argonne National Laboratory M. Vranicar, J. Weicher PIOCON Technologies XV International Conference on Computing in High Energy and Nuclear Physics T.I.F.R., Mumbai, India February 13-17, 2006

  2. Databases and Grids • In addition to petabytes of file-based event data, high energy physics applications require access to non-event data (detector conditions, calibrations, etc.) stored in relational databases • Databases also play a critical role in grid middleware: file catalogues, monitoring, etc. • Crosscutting the computational grid infrastructure, a database hyper-infrastructure emerges

  3. Project DASH • As grid computing technologies mature, development must focus on database and grid integration • New technologies are required to bridge the gap between data accessibility and the increasing power of grid computing used for distributed event production and processing • The Database Access for Secure Hyperinfrastructure (DASH) project is funded by the DOE Small Business Innovative Research Program to build and test secure high-performance database access technology for distributed computing www.piocon.com/DASH.php A project of PIOCON Technologies, Inc and Argonne National Laboratory

  4. Database Access on the Grid Two different architectures: • A separate middleware server does the grid authorization: • OGSA-DAI: SOAP/XML+ XML binary extensions • Spitfire (EDG WP2): SOAP/XML text-only data transport • Perl DBI database proxy (ALICE): SQL data transport • Oracle 10g (separate authorization layer) • Grid middleware is integrated in database server process: • Instead of surrounding database with external secure middleware layers the safety features are embedded inside of the code • By pushing secure authorization into the database engine the inefficient data transfer bottlenecks are eliminated

  5. Embedded Security Approach • The embedded security approach is listed among the top ten innovations in security by the panel of experts convened by Battelle: • “The Global Cyber Net: Communications and information are the lifeblood of security. Today we enjoy a worldwide web, which is open but unsecured. In the future, we will have a global cyber net that is faster and better protected than today… Software will contain embedded safety features inside of the code rather than just surrounding it.” http://www.battelle.org/forecasts/defense.stm

  6. End-to-End Secure Transport • DASH technology bridges the gap between data accessibility and the increasing power of grid computing • To overcome database access inefficiencies inherent in a traditional middleware approach the DASH project implements secure authorization on the transport level • Pushing the grid authorization into the database engine eliminates the middleware message-level security layer and delivers transport-level efficiency of SSL/TLS protocols for grid applications • The DASH proof-of-concept prototype provides Globus grid proxy certificate authorization technologies for MySQL database access control • DASH technology brings database access efficiencies similar to the https advantages introduced in the Globus Toolkit 4.0 • The database architecture with embedded grid authorization provides a foundation for secure end-to-end data processing solutions for the grids

  7. Aspect-Oriented Programming • To avoid a brittle, monolithic system DASH uses an aspect-oriented programming approach • By localizing Globus security concerns in a software aspect, DASH achieves a clean separation of Globus Grid Security Infrastructure dependencies from the MySQL server code • During the database server build, the AspectC++ tool automatically generates the transport-level code to support a grid security infrastructure • www.aspectc.org

  8. Automatic Code Generation

  9. AOP is the Next ‘Big Thing’ A 2001 paper on Aspect Oriented Programming is on Top 10 Downloads from ACM’s Digital Library • Paper by our collaborators from Illinois Institute of Technology ATLAS experience with AOP was first reported at the previous CHEP04

  10. Testing New Functionalities • Prototype servers built with DASH technology are being tested in ANL, BNL, CERN and U Geneva • We thank to • Jason Smith (BNL) • Yuri Smirnov (BNL) • Frederik Orellana (U Geneva) Among the new functionalities are • Check for the proxy expiration time • Host name checking (to reject impersonation)

  11. Packaging Challenge • Initial response from our beta-testers suggested that because of the globus gsi libraries dependencies the preferred distribution would be the static build • However test showed that static builds works best on the platforms (Linux distributions) very close to those that of the build machine • We experienced unexpected sensitivities to the minor variations in the glibc library version • We are now addressing that issue by developing the dynamic build that will have the static globus gsi and openssl libraries built in

  12. Scalability Challenge • Large-scale world-wide distributed simulations performed by the ATLAS Collaboration show steady progress in grid computing • The chaotic nature of opportunistic grid computations results in variations in daily production rates • Database services capacities should be adequate for peak demand

  13. Why Dynamic Deployment? • The high level of sharing of computational resources achieved on grids result in increased fluctuations in demand for database services, because of the chaotic nature of shared resource availability • Static services deployment require over-capacity • Opportunistic production on non-LCG sites requires database services deployment on-demand • To provide on-demand database services capability for Open Science Grid, the Edge Services Framework activity builds the DASH mysql-gsi database server into the virtual machine image, which is dynamically deployed via Globus Virtual Workspaces

  14. Edge Services • Services executing on the edge of the public and private network ATLAS CMS CE SE CDF Guest VO Site Compute nodes and Storage nodes • See CHEP06 contribution id # 214 http://indico.cern.ch/contributionDisplay.py?contribId=214&sessionId=7&confId=048

  15. Synergistic Collaboration CMS & ATLAS collaborate in OSG ESF Activity http://www.opensciencegrid.org/esf To achieve the ESF proof-of-concept milestone: • The first ESF VM was deployed by CMS • The first ESF service on that VM was by ATLAS: • Grid-enabled MySQL database built by the DASH project • To access the server the grid job used proxy certificate (instead of the clear-text passwords hardwired in the scripts that are distributed world-wide)

  16. Collaboration Benefits Celebrating ESF proof-of-concept milestone at Supercomputing 2005

  17. Globus Folder at SC05 http://www.globus.org/alliance/events/sc05

  18. Complementary Project • A new collaborative project with the Globus team has just started at Argonne • to grid-enable the PostgreSQL database • Both DASH and the new project target technology integration with OSGA-DAI • Please contact us if you are interested to contribute to these projects

  19. OGSA-DAI Complementarity • Neil P Chue Hong, OGSA-DAI Status SummaryThird OGSA-DAI Users Group Meeting, 6/1/2005 • Through our continued interactions with OGSA-DAI team we have established working relationships to achieve technological compatibility

  20. Additional Benefits • Direct access to database servers unleashes a broad range of vendor-specific server capabilities for data processing applications: distributed XA transactions, binary data transport, etc. • Grid proxy certificate technology opens technical opportunities to enable fine-grained delegation of rights for access control (attribute certificates) • Grid-enabled relational database server technology has the potential for application beyond the domain of high energy physics, and is of interest to bioinformatics and other data-intensive sciences

  21. DASH Outreach

More Related