710 likes | 1.01k Views
Database Technology. Prof. Hyoung-Joo Kim Internet Database Lab School of Computer Sci & Eng Seoul National University. Contents. A general survey of DBMS History of DBMS Database market share The current DBMS trend. Research in IDB Lab. What is a Database?(1/10). DBMS
E N D
Database Technology Prof. Hyoung-Joo Kim Internet Database Lab School of Computer Sci & Eng Seoul National University
Contents • A general survey of DBMS • History of DBMS • Database market share • The current DBMS trend Research in IDB Lab.
What is a Database?(1/10) • DBMS • A software system which provides the environment enables to store and retrieve massive data effectively
Data Data Data Data Data Data Data Data Data Data Data Data Data Data Data Data Data Data Data Data Data Data Data Data What is a Database?(2/10) • A large collection of data • Data + Programs STORE Database
What is a Database?(3/10) • Information about register and course of 40,000 students of the Seoul Natl’ Univ. 45courses, 10K records per student • 10K Byte * 40,000 = 400M Byte Others: library, health center, S-card, …
What is a Database?(4/10) • Information of SAT management 8K records per student • Year 2006: 550,000 • Year 2005: 570,000 • 8K Byte * 550,000 = 4.4G Byte (109)
What is a Database?(5/10) • Information of mobile phone 60KB record per one 39M * 60 Byte * 5calls/day * 365 days= 4T Byte Korea2006.7 China 370M in 2005
What is a Database?(6/10) • Information of resident registration 10KB record per one • 10K Byte * 470 M = 5T Byte (47millions)
What is a Database?(7/10) • Google database • 8billion’sWebsites, 2billion’sindexing terminology management • Usenet archive = 700 Million messages * 20KB/message = 14 TB
What is a Database?(8/10) • Hubble space telescope data from Mars Data constructed by 2005 : over 12 TB • Constructing and sending 3~5GB’s data abroad daily
What is a Database?(9/10) • NCBI (National Center for Biotechnology Information) • GenBank • management of information of 165,000 species • add 3million’s new DNA sequence monthly
What is a Database?(10/10) • Genome map of Koreans • Venture “MacroGen” • SNU Medical School • Early version: 900G Byte Final product: 15T Byte
What do we do with Database?(1/2) • Record search • Retrieve math grade of the student whose SSN is “840101-12121” • 740,000 * 5records = 3.7 M records 12ms to fetch a record and check content 3.7M * 12ms = 44.4Kseconds = over 12 hours If we use DBMS, it will be less than 0.1sec! Statistical processing for population census DBMS • Search for the purchase pattern on customer groups Search for the correlation between gene and disease
What do we do with Database?(2/2) • Most (all?) computing applications use some type of a database CRM ERP Data Warehouse MIS, ERP OLTP EDPS Database Database Database Database
Database Management System (DBMS) (1/3) Warehouse
Database Management System (DBMS) (2/3) Warehouse Warehouse keeper
Database Management System (DBMS) (3/3) Database Management of orders on-line profile product customer DBMS user Management of wages sale stock Management of manager info. Application
DBMS Architecture naive users application programmers casual users database administrator application programs system calls query database scheme data manipulation language pre-compiler query processor data definition language compiler application programs object database manager DBMS file manager Disk storage
SQL • SQL: widely used commercial query language • E.g. find the name of the customer with customer-id 192-83-7465select customer.customer-namefrom customerwhere customer.customer-id = ‘192-83-7465’ • E.g. find the balances of all accounts held by the customer with customer-id 192-83-7465select account.balancefrom depositor, accountwhere depositor.customer-id = ‘192-83-7465’ and depositor.account-number = account.account-number
Market Leader Stability Mass storage literacy Famous CEO Major Commercial DBMS in 2006(1/3) 10g
Integration with Window NT/XP PC based (Windows NT) Microsoft!!! Major Commercial DBMS in 2006(2/3)
Stability Mainframe Informix purchase IBM Major Commercial DBMS in 2006(3/3)
Contents • A general survey of DBMS • History of DBMS • Database market share • The current DBMS trend Research in IDB Lab.
Hierarchical, Network DBMS The early 70’ IMS (IBM), System/2000(MRA) DMS 1100 (Sperry), Total (Cincom) Advantage: quick data access using link Drawback: impossible to make out independent application
Network Database example Root Record Customer records Lowery Maple Queens Hodges SideHill Brooklyn Shiver Bronx North Amount records 900 647 801 556 647 Query What’s the total balance of Mr. Shiver in Bronx?
Network DB query example sum:=0 get first customer where customer.name=“Shiver” and customer.city =“Bronx”; while DB_status = 0 do begin sum:=sum+customer.amount; get next customer where customer.name = “Shiver” and customer.city =“Bronx”; end print(sum);
Relational DBMS • The late 70’ and early 80’ • E.F.Codd, 1970 CACM paper, “The Relational Data Model” • Relational Algebra & Calculus • The Spartan Simplicity! • SQL: Structured Query Language • System/R - 1976, first commercial RDBMS • Ingres - 1976, first academic RDBMS
Relational DBMS example Select sum(amount) from customer where customer.name = “Shiver” and customer.city=“Bronx”;
The advent of new DB application in 80’ (1/4) CAD/CASE/CAM: massive design data Artificial Intelligence: Expert systems Telecommunication Multimedia: IMAGE, TEXT, AUDIO, VIDEO, etc. Rich data model &DBMS function
The advent of new DB application in 80’ (2/4) • Massive design data in CAD/CASE/CAM Previous DATA CAD DATA
The advent of new DB application in 80’(3/4) • Artificial Intelligence: Expert systems Vehicle disorder Symptoms Control Drive Break Handle Gearbox Engine conclusion : engine ECU disorder Previous DATA Expertise DATA
The advent of new DB application in 80’(4/4) • Multimedia: image, audio, video Previous DATA MULTIMEDIA DATA
17 The mid 80’~ mid 90’ Commercial Products: O2, ObjectStore, Objectivity, Versant, etc. Research prototype ORION, POSTGRES, ENCORE/ObServer ODMG-93 OODB standard Advent of Object Oriented DBMS
Long-duration transaction Persistent programming language Large object Semantic Data Model extension Version & Composite object Feature of Object Oriented DBMS • Object-Oriented Paradigmsupport • object, object identity, • go back to traversal Network DB? • Class hierarchy, inheritance
Object Oriented Database example Is-part-of relationship ISA relationship
OQL query of Object Oriented DBMS select sum(customer.deposit.balance) from Customer customer where customer.name = “Shiver” and customer.deposit.branch.city = “Bronx”;
Object Relational DBMS 1980 – 1985: ORDBMS Research Prototype PostGres by UC Berkeley System/R Engineering Extension Relational DBMS with Object Oriented function Extension within SQL & Tables! The early 90’: OODBMS (Illustra, UniSQL, Mattise) downfall 1997, Big3 ORDBMS advent
Principal functions of Object Relational DBMS LOB (large object) support User defined type & Stored procedure support Abstract Data Type support SQL procedure extension Application domain specific extension support Rule/trigger System support Type Inheritance support
ORACLE-8 Universal Server Informix Universal Server IBM DB2 Universal Database Sybase Adaptive Server Microsoft Access Product of Object Relational DBMS
Contents • A general survey of DBMS • History of DBMS • Database market share • The current DBMS trend Research in IDB Lab.
DBMS market share(1/2) • Worldwide market share for biggest sellers of corporate databases, 2005 15% 48.6% 22% Source: Gartner Dataquest
DBMS market share(2/2) • Worldwide sales for biggest sellers of corporate databases, 2005 $6.7 $3.0 $2.1 billions of dollars Source: Gartner Dataquest
Domestic DBMS market share source : Report for database industry and perspective in Korea, 2004
Domestic DBMS market sales • Domestic market share for biggest sellers of corporate databases, 2004 ₩57.2 ₩45.3 ₩25.1 billions of won Source: Gartner Dataquest, South Korea(2005)
Preference in domestic market Others 3% source : Report for database industry and perspective in Korea, 2004
Contents • A general survey of DBMS • History of DBMS • Database market share • The current DBMS trend Research in IDB Lab.
XML Technology(1/2) • The late 90’ and now • What is XML1)? • Developed by the W3C • Semi-structured text for dissemination and publication • Self-describing HTML XML <tr> <td> <font color=“red”>이름 </font> </td> <td>홍길동</td> </tr> <tr> <td> <b>주소</b> </td> <person> <name>홍길동</name> <city>서울</city> <age>20</age> … </person> Tagging for structure and semantics Tagging for Display 1) eXtensible Markup Language