1 / 20

An Introduction to DAS

An Introduction to DAS. Andy Jenkinson , EBI. Summary of Topics. What is Data Integration? Problems in Data Integration An architectural overview of DAS Brief History of DAS. What is Data Integration. All These are Data Integration. Reading some papers so you can write a report

gore
Download Presentation

An Introduction to DAS

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. An Introduction to DAS Andy Jenkinson, EBI

  2. Summary of Topics What is Data Integration? Problems in Data Integration An architectural overview of DAS Brief History of DAS

  3. What is Data Integration

  4. All These are Data Integration Reading some papers so you can write a report Exploring some database websites so you can learn about a topic Downloading some data from different databases so you can analyse it Downloading some data from different databases so you can combine it with your own

  5. All These are Data Integration Reading some papers so you can write a report Exploring some database websites so you can learn about a topic Downloading some data from different databases so you can analyse it Downloading some data from different databases so you can combine it with your own

  6. Data Integration • “Automatic” data integration • pulling in data from different locations • processing it • creating a resource derived from the data • done via computers, not humans • e.g. creating/updating a data warehouse

  7. Warehouse model

  8. Data Integration: like herding cats

  9. Databases are all different

  10. Databases evolve

  11. Data ages

  12. Databases are big

  13. Distributed Annotation System • Distributed • Client-Server architecture • Federation • RESTfulweb services

  14. Warehouse model

  15. DAS model

  16. Architectural Overview

  17. DAS • Databases are all different • DAS is a uniform facet of a database – always the same • Databases change their structure • when the database changes, DAS stays the same • Databases are updated • DAS data comes directly from the provider so is always fresh • Databases are big • DAS uses real-time targeted queries

  18. History Developed circa 1999 for sharing genome annotations Expanded 2004 onwards more data types better metadata addition of Registry DAS/2 project split from DAS, not backwards compatible inspired some DAS developments

  19. To Summarise… The Distributed Annotation System is… A network of biological data sources An example of federation A collection of REST web services The DAS Protocol is… An integration platform A client-server protocol An agreed standard

  20. Image Credits Flickr/muir.ceardach Flickr/HoriaVarlan Flickr/Alessandro Pinna Fotopedia/Jean-Marie Hullot listicles.com/?p=3485 Google Earth/Cnes/Spot Image Olivier H. Beauchesne

More Related