410 likes | 430 Views
Data Management Options. Dr. Merle P. Martin MIS Department CSU Sacramento. Acknowledgments. Dr. Russell Ching ( MIS Dept ) Source Materiel / Graphics Edie Schmidt ( UMS ) - Graphic Design Prentice Hall Publishing (Permissions)
E N D
Data Management Options Dr. Merle P. Martin MIS Department CSU Sacramento
Acknowledgments • Dr. Russell Ching (MIS Dept) Source Materiel / Graphics • Edie Schmidt (UMS) - Graphic Design • Prentice Hall Publishing (Permissions) • Martin, Analysis and Design ofBusiness Information Systems, 1995
Agenda • Why manage data? • Definitions • Typical problems • Data Administrator • The DBMS • Distributing data
Why Manage Data? • Delayed output (paycheck) • Locate a resource • Where is the stock item stored? • Where does the employee work?
Why Manage Data? • Make resource decisions • Should we turn account over to collection agency? • Should we send customer letter asking why he / she hasn’t shopped here in 6 months? • Should we give employee overtime?
Why Manage Data? • Determine resource status • Is there enough stock in warehouse to satisfy this customer’s order? • How much should I order? • What is the value of this resource? • balance sheet
Definitions • File: resource inventory: • Material • People • Employees, customers • Funds • Customer balances • Accounts Payable
Definitions • Data Organization • Bit / byte • Character • Field • Record • File • DBMS
Database Data Hierarchy for Stereos to Go { 12345 Smith John A 123 Main Street Sacramento CA 95819 File 12345 Smith John A 123 Main Street Sacramento CA 95819 12345 Smith John A 123 Main Street Sacramento CA 95819 Record Field Smith Character 10110011 (Byte) Bit 1
Definitions • Views: • Physical - how stored • Logical - how viewed and used • Volatility: - % records that change • Immediacy: rapidity of change
Storage Problems • Redundancy • Accuracy • Security • Lack of data sharing • Report inflexibility • Inconsistent data definitions • Too much data • information overload
Data Administrator • Clean up data definitions • Control shared data • Manage distributed data • Maintain data quality
Clean Up Definitions • Synonyms / aliases • Standard data definitions • names and formats • Date of Birth (AJIS) • mm/dd/yy (courts) • dd/mm/yy (corrections) • Data Dictionary • COBOL
Control Shared Data • Local - used by one unit • Shared - used by two or more activities • Impact of proposed program changes on shared data • Program-to-data element matrix • Control or clearinghouse?
Manage Distributed Data • Geographically dispersed • whether shared data or not • Different levels of detail • different management levels
Aggregate Infrequent Quite old External Future Wide Low StrategicPlanning Management Currency Frequency of Use Time Horizon Required Accuracy Source Scope Level of Aggregation Control Operational Control High Internal Detailed Historical Well defined Very frequent Highly current
Maintain Data Quality • Put owners in charge of data • verify data accuracy and quality • Fairbanks Court example • Who owns the data?
Issue Should the Data Administrator control ALL data, or just that data that crosses organizational boundaries? WHAT DO YOU THINK?
The DBMS Data Base Management System: software that permits a firm to: • centralize data • manage them efficiently • provide accessto applications • such as payroll, inventory
DBMS Components • Data Design Language (DDL) • Data Manipulation Language (DML) • Inquiry Language (IQL) • Teleprocessing Interface (TP) • Martin, Figure 16-5
Designers Teleprocess DDL Database DML IQL Interface Update Retrieve Applic. Software Programmers End-Users
IQL LANGUAGE IQL SELECT EMP-ID, EMP-FIRST-NAME, EMP-LAST-NAME, EMP-YTD-PAY FROM EMPLOYEE WHERE EMP-ID=1234 . Data Base
3-level Database Model • James Martin • Sprague / McNurlin, Fig. 7-2, pg. 207
External Level (1) • User views (logical) • By application program • Each has unique view • Schema / subschema
Schema and Subschemas Physical Database DBMS Software DBMS Overall View of the Data Schema Individual Views Subschema Subschema Subschema User User User User User User
Enterprise Level (2) • Under control of Data Administrator • DBMS • Implementation data removed • passwords • report views
Physical Level (3) • Schema • Pointers (e.g., next record) • Flags (e.g., record frozen)
Traditional Data Models • Hierarchical - one parent • Network • more than one parent • student to course, major • Relational (tables)
Hierarchical Model Project 1 Dept. A Dept. B Dept C 1 2 3 4 5 6 Employees
Network Model John Smith Jane Smith Savings Mortgage Checking
Account Number First Name Middle Initial Last Name . . . Credit Limit Relational Customer Order Number Order Date Account Number Date Shipped Orders Order Number Line Item Number Product Code Quantity Line Items Product Code Product Name Price Unit Manufacturer Code Products Manufacturer Code Manufacturer Name Manufac(turer)
Object-oriented DBMS An object is: • a piece of data PLUS • procedures performed on data PLUS • attributes describing data PLUS • relationship between object and other objects
Distributed Data • Goals: • move processing as close to users as possible • allow several applications to run simultaneously on same data
Distributed Types • Fragmented • distribute data without duplication • users unaware of where data located • Segmented • data duplicated • one site has master file • problem with data synchronization
Why Distribute? • Save money • offload DB processes to less expensive machines (PCs) • Lower telecommunications costs • DB closer to users • Decrease dependence on a single computer manufacturer
Why Distribute • Move control closer to owner • Increased DBMS scope • more varied types of data • link at workstations • Permit storage of multimedia data
True Distributed DB • Local autonomy (ownership) • No reliance on central site • Continuous operations • not affected by another site • Data transparency • Independence
Independence • Fragmentation • Replication • Hardware • Software • Networks • Database
Problems With Distributed Databases • Security • Shared data • simultaneous update • Complexity • Need telecommunications infrastructure
Issue Is data in your organization totally distributed? • How? • Should it be? • Why or why not?
Points to remember • Definition • Typical problems • Role of Data Administrator • The DBMS • Distributing data