1 / 30

DATA

Explore. Discover. DATA. Search. Browse. Data Centers. CDIAC. Discover. SEARCH. Learn. P L O T. EXPLORE. DOE’s. A. T. A. User Facilities. Dig Deep. It's hot. What ?. Discovering Data Across the Department. The DOE Data Explorer. Jannean Elliott. The DOE Data Explorer is:

iola-reeves
Download Presentation

DATA

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Explore Discover DATA

  2. Search Browse

  3. Data Centers CDIAC

  4. Discover

  5. SEARCH

  6. Learn

  7. P L O T

  8. EXPLORE

  9. DOE’s A T A

  10. User Facilities

  11. Dig Deep

  12. It's hot

  13. What ?

  14. Discovering Data Across the Department The DOE Data Explorer Jannean Elliott

  15. The DOE Data Explorer is: • A new product • A discovery tool for non-text data • The first of three phases • A small but unique database Why does DOE need another search product? OSTI's bibliographic and full-text databases provide access to millions of technical reports, peer-reviewed journal articles, and scientific conference documents. This literature documents the results of scientific experiments, both small and large and from unique, one-time events in a particle accelerator to monitored observation stretching over decades. The data from these experiments and projects are described, interpreted, theorized about, and shown in many different ways. Most of the documents do not, however, indicate where the raw and supporting data actually reside. Researchers working in their individual fields of expertise know where to find specialized data. They also interface with DOE's Data Centers, organizations that focus on collecting, processing, organizing, archiving, and providing data related to a specific scientific field such as nuclear physics or fusion energy. The DDE is a data discovery tool for those who may not be as aware of these resources.

  16. In the beginning……… Questions What data collections does DOE have? Where are they? Can I Google for them? Is there a linked list somewhere?

  17. Decisions • Terminology • Scope • Types • Indexing Level • Metadata

  18. What to call the “data”? • Background data? • Almost seemed to relegate data to a secondary position. In a paper, some of the “less important” information is the “background information.” • Basis data? • Data may be the “basis” of published, scientific literature, but the term reminds us of the old Basis databases. • Factual data? • Would we be implying that the published literature may be something other than factual? Terminology

  19. Numeric data? • Information consisting of numbers rather than text.The world of data that we are talking about has many formats/media. Numeric is only one of them. • Primary data? • Information that comes before published literatureIt’s only “primary” in a certain, limited sense. Once data has been reformatted, analyzed, re-used, massaged into a table, etc….it’s not primary (i.e. in its first form)anymore. • Raw data? • Some of the data available on the web is raw, but most of it is not. It’s been processed, organized, analyzed. “Raw” carries a specific meaning for a scientist. Terminology

  20. What to call the “data”? • Research data? • Could be data gained from research about anything: economics, sociology, novellas of the 19th century, etc. Good term but not quite specific enough • Scientific data? • Information coming from experiments – not consumer data that is based on pricing, usage, etc. Team liked this one, but decided “scientific research data” was even better • Source data? • Information directly from the “source”, the source being a scientist or a science instrument…but on the Web will it imply something else? • What to call the tool? Terminology

  21. Scope DOE involvement Online vs offline Free vs fee Active vs archived Accessible vs password-protected

  22. Indexing Level • Collections • Datasets • Data files • Records • What about a database?

  23. Data Types • “Results” data • Reference data • Posted for educational/promotional purposes • Operational data • beamline parameters, logbooks, diagrams of instruments, archives of technical “notes”; operating schedules for instruments • Project descriptions and “research thrust” items • Data-rich highlights, summaries, overviews, press releases • Published items • Poster presentations • Computer codes and toolkits

  24. Content Type Definitions • Computer Models/Simulations • Simulations that can be viewed or interacted with online or downloaded to run on a user's computer. • Figures/Plots • Collections showcasing data diagrams, charts, drawings, and data plots. May link to lists of publications in which some of this material occurs, but the figures and plots are stand-alone. • items and recognized for their own importance • Interactive Data Maps • More than images. User can change views, manipulate data • and receive entirely new sets of information. Often uses GIS tools. • Multimedia • Animations or video, live or stored, from scientific experiments. Does not include video tours of facilities, live cams of weather monitoring • sites, educational or promotional videos, etc.

  25. Numeric Files/Datasets • Data in ASCII flat files, Postscript, Excel files, or formatted into • tables presented on HTML pages. Data may be raw or processed • or both. Criteria is that the collection should have numeric data • as primary content. All other content is secondary, supporting. • Scientific Images • Static images. Not engineering drawings, not collections that • mainly feature images of machinery or buildings. Scientific • images are images taken with high-powered microscopes, • captured during an accelerator run, images from astronomy, etc. • Specialized Mix (Example = “genome” databases) • Typically included in the genome database gene sequence • data, taxonomies, links to papers, images or figures, status of • projects around the world, etc. Key factor is that the content • was uniquely designed to be a "specialized mix,” not just mixture of • “stuff” on a web page. It has structure, organization, search and • retrieval capability. How the information is put together is what • gives it meaning. The information does not exist elsewhere • except in pieces. Non-text information must be included in the mix.

  26. Metadata Collection Title Collection Creator/PI Sponsor Other Sponsors DOE Data Center DOE Scientific User Facility Other Related Organizations Content Type Subject Category Keywords Description DDE Number

  27. Checking out the content • See it now at • http://www.osti.gov/dde

  28. Discovering Data Across the Department The DOE Data Explorer Jannean Elliott

More Related