130 likes | 147 Views
534A – Topics in Database Design (Metadata Management). Rachel Pottinger Course Introduction 2005/1/11. Metadata – data about data. Relational Schema XML Schema or DTD UML document ER Diagrams Example: in a database about grades, the grades are the data. How it’s stored is the metadata.
E N D
534A – Topics in Database Design (Metadata Management) Rachel Pottinger Course Introduction 2005/1/11
Metadata – data about data • Relational Schema • XML Schema or DTD • UML document • ER Diagrams Example: in a database about grades, the grades are the data. How it’s stored is the metadata
What is this class about? Things this class is not about: • Using databases • Database internals • Building database applications Things this class is about: • Metadata applications • Current research strategies for metadata • Future metadata research directions
What aren’t the requirements for this class? • There are no set pre-requisites • You do not need to have taken a database class • Assuming you have a good base in CS, I should provide you with all background material you need
This class is a seminar A seminar is “a small group of advanced students … under the guidance of a professor who meets regularly with them to discuss...” [dictionary.com] • 1 or 2 papers to read for most classes • Most classes students will present the papers and lead discussions. Plan to present once and lead discussion once. I’ll provide: • Necessary background beforehand • Suggestions on how to read the paper where necessary • The high level goals of reading the paper • A set of suggested discussion questions • Feedback on your plans and answers to questions • Sometimes I’ll present the papers and lead discussion
What are the requirements for this class? • Ability to read and respond to 1 – 2 papers a class • Ability to do a project (not necessarily implementation based) either in a group or on your own • Ability and willingness to present papers and lead discussions • Willingness to discuss your own ideas and questions in class Other handy things: databases, AI, logic
Introduce your partner Discuss the answers to the following questions with your partner for next 5 minutes: • What is your name? • What is your affiliation with UBC? • What is your favourite colour? • Where are you from? • What is your database/data management background? • What do you want to get out of this class? Grab a card write your name on the front and pronounciation on the back and prepare to have your picture taken
Some reasons to use a database: • Large amounts of data • Structured data • Persistent data • Valuable data • Performance requirements • Concurrent access to the data • Restricted access to data
What data is stored in databases? This space intensionally left blank
What data do you have? Here is some data I have: • Papers I’ve read • Addresses • Job search data • Experiments I’ve run • Grades • CDs, DVDs, and books I own • Recipes
Introduction and background Overview of data management What kinds of data are managed by today’s data management systems Metadata management and Model Management What is metadata? How is metadata handled today? Model management: a metadata system Generic Merge Merging applicationsCreating schemas for data integration Creating schemas for view integration Merging ontologies Theoretical aspects of schema merging Merging in CSCW Generic schema merging Match Current schema matching algorithms Schema matching applied to medical ontologies Creating data translation from correspondences Information capacity – a metric for success in matching and merging Translating schemas from one meta-model to another Logic in metadata management Description Logics Concept Base Project presentations Wrap-up Class outline
To do: Course website: http://www.cs.ubc.ca/~rap/teaching/534a/ • Mailing list: mail majordomo@cs.ubc.cawith “subscribe cpsc534a” in the body • Think more about data you have • Think about which papers you’d like to present/lead discussion on • Read the project description, and think about projects
What data do people have? • Pictures – media in general difficult to categorize, hard to deal with things you didn’t know what you were interested in advance • Movies that you’ve seen, and ratings – easy (generic) • Popular links on websites • Exercise training routines • Health – food log, etc. • Collections • E-mail addresses • What experiments are what – how to organize • Paper reviews • Motion capture data (how do you relate them) • Vision data sets • Cars, models, makes, etc • Contact information • E-books • Bills • Paper lists • Photos, etc. • Assignments we have done • Budgets • TA – assignments that people have made • Music that you like –hard to find if you only like some part • E-mails • Job search – cover letters, etc. • Shopping • E-mail threads • Software configuration files • Revision control with better search • Track your friends and family • Calendars • Deadlines • Research notes • Hard to find specific photos sometimes • Restaurant lists