1 / 12

External/Unstructured Data and the Data Warehouse

External/Unstructured Data and the Data Warehouse. External/Unstructured Data in the Data Warehouse. Several issues relate to the use and storage of external and unstructured data in the data warehouse. The frequency of availability It is totally undiscipline Its unpredictability

saad
Download Presentation

External/Unstructured Data and the Data Warehouse

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. External/Unstructured Data and the Data Warehouse

  2. External/Unstructured Data in the Data Warehouse • Several issues relate to the use and storage of external and unstructured data in the data warehouse. • The frequency of availability • It is totally undiscipline • Its unpredictability • Many methods to capture and store unstructure information such as: • Near-line Storage • Create two stores of unstructured data

  3. Meta Data and External Data • Meta data is vital because through it external data is registered, accessed, and controlled in the data warehouse environment. The importance of meta data is best understood by noting what it typically encompasses: • Document ID • Date of entry into the warehouse • Description of the document • Source of the document • Date of source of the document • Classification of the document • Index words • Purge date • Physical location reference • Length of the document • Related references

  4. Storing External/Unstructured Data • External data and unstructured data can actually be stored in the data warehouse if it is convenient and cost-effective to do so. • To store external data and unstructured data requires considerable resources • By associating external data and the unstructured data with a data warehouse, the external data and the unstructured data become available for all parts of the organization, such as finance, marketing, accounting, sales, engineering and so forth

  5. Modeling and External/Unstructured data • What is the role of the data model and external data. See below (figure 8.6)

  6. Archiving External data • Every piece of information – external or otherwise – has a useful lifetime. • Once past that lifetime, it is not economical to keep the information. An essential part of managing external data is deciding what the useful lifetime of the data is.

  7. Comparing Internal data to external data • One of the most useful things to do with external data is to compare it to internal data over a period of time. The comparison allows management a unique perspective. For instance, being able to contrast immediate and personal activities against global activities and trends allow an executive to have insights that simply not possible elsewhere.

More Related