1 / 18

Distributed DBMSs

Explore the fundamentals, characteristics, advantages, and disadvantages of distributed databases, client/server systems, data warehouses, and organizational databases. Learn about data warehouse architecture, data errors, data mining, and the difference between operational databases and data warehouses.

olsson
Download Presentation

Distributed DBMSs

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Distributed DBMSs • A distributed database is a single logical database that is physically distributed to computers on a network. • Homogeneous DDBMS has the same local DBMS at each site. • Heterogeneous DDBMS has at least two sites where the local DBMSs are different.

  2. Characteristics ofDistributed DBMSs • Location transparency feels to a user as though the entire database is at their location. • Replication transparency is where the user is unaware of the behind the scenes replication of the data. • Fragmentation transparency is where a local object can be divided among the various locations on the network.

  3. Advantages of Distributed Databases • Local control of data • Increasing database capacity • System availability • Added efficiency

  4. Disadvantages of Distributed Databases • Update of replicated data • More complex query processing • More complex treatment of shared update • More complex recovery measure • More difficult management of data dictionary • More complex data design

  5. File Servers • File server contained files required by the individual workstations on the network.

  6. Client/Server Systems • Client/Server has the DBMS run on the file server, but the user sends requests for specific data, not files.

  7. Advantages ofClient/Server Systems • More efficient than file server systems. • Possibility of distributing work among several processors. • Workstations need not be as powerful. • The user doesn’t need to learn any special commands or techniques.

  8. Advantages ofClient/Server Systems • Easier for users to access data from a variety of sources. • Provides greater level of security than file server systems. • Powerful enough to replace expensive mainframe applications.

  9. Data Warehouses • A subject-oriented, integrated, time-variant, nonvolatile collection of data in support of management’s decision-making process.

  10. Data Warehouse Architecture

  11. Data Warehouse Structure

  12. Why build a Data Warehouse? • To speed up the writing and maintaining of queries and reports by technical personnel • To more easily query and report data, on a regular basis, from multiple transaction processing systems and/or from external data sources • To provide a repository of transaction processing system data that contains data over a span of time

  13. Why build a Data Warehouse • To address security concerns • To provide a repository of "cleaned up" transaction processing systems data that can be reported against and that does not necessarily require fixing the transaction processing systems

  14. Data errors • Incomplete • Missing records/fields • Incorrect • Wrong codes (or incorrect pairing of codes) • Incomprehensible • Multiple fields in one field • Many to many relationships • Spreadsheet and word-processing files

  15. Data Errors • Inconsistent • Use / meaning of codes • Business rules • Timing • Use of attributes • Use of nulls/spaces

  16. Data Mining • Identify the goal • Assemble the relevant data • Choose your analysis methods • Decide which software tool is best for implementing the method • Run the analysis • Decide how to implement the results

  17. Operational Database organized about a transaction supports OLTP (record keeping) thousands of users accesses few records at a time response time in seconds Data Warehouses organized about a subject supports OLAP (decision support) few hundred users accesses many records at a time response times in minutes Organizational Databases

  18. Operational Database primitive & detailed smaller (current) highly normalized (many tables with few columns) dynamic (continuous updates online) Data Warehouses derived & summarized larger (historical) de-normalized (few tables with many columns) periodic (batch update) Organizational Databases

More Related