170 likes | 287 Views
Advanced Database System Design. Instructor: Ruoming Jin Fall 2010. Welcome!. Instructor: Ruoming Jin Homepage: www.cs.kent.edu/~jin/ Office: 264 MCS Building Email: jin@cs.kent.edu Office hour: Tuesday (5:00PM-5:30PM) Thursdays (4:30PM to 5:30PM) or by appointment. Overview.
E N D
Advanced Database System Design Instructor: Ruoming Jin Fall 2010
Welcome! • Instructor: Ruoming Jin • Homepage: www.cs.kent.edu/~jin/ • Office: 264 MCS Building • Email: jin@cs.kent.edu • Office hour: Tuesday (5:00PM-5:30PM) Thursdays (4:30PM to 5:30PM) or by appointment
Overview • Homepage: www.cs.kent.edu/~jin/AdvDB10Fall/datamining.html • Time: 5:30-6:45PM Tuesdays and Thursdays • Place: MSB 228 • Prerequisite: Into to Database, Data Structures, and Algorithms.
Overview • No Official Textbook • References Hector Garcia-Molia, Jeffrey Ullman, and Jennifer Widom, Database Systems, The Complete Book, Prentice Hall, 2002 Silberschatz, Korth, Sudarshan, Database System Concepts, 5th Edition, McGraw Hill 2006 Bin Liu, Web Data Mining, Springer, 2007 Ricardo Baeza-Yates and Berthier Riberiro-Neto, Modern Information Retrieval, Addison Wesley, 1999 Ian H. Witten, Alistair Moffat, and Timothy Bell, Managing Gigabytes, Second Edition, Morgan Kaufmann, 1999
Overview • Grading scheme • No exam
Overview (Presentation) • Paper presentation • One per student • Research paper(s) • List of recommendations (will be available by the end of Sept.) • Your own pick (upon approval) • Your overall presentation will be less than 40 minutes (30 minutes talk) • Three parts • Review of research ideas in the paper • Debate (Pros/Cons) • Questions and comments from audience
Overview (Presentation) • Order of presentation: assigned by instructor • The presentation will start from late October or early November • You need submit two drafts before the final presentation • First draft due on Oct. 14th • Second draft due on Oct. 21th • Final slides due one day before your presentation. • I will provide feedback and suggestion for each draft • Note that I do expect the complete presentation slides in the first draft
Overview (Project) • Project (due Dec 7rd) • One project: One to Three students (The more students, the more task is expected to be done). • Some basic suggestion later. • A variety of subjects (some of them has individual mentors). • Checkpoints • Proposal: title and goal (due Sep. 30th) • Outline of approach (due Oct 7th) • An intermediate report and research discussion (Nov. 3rd) • Project/Demo Presentation (Dec 9th) • Documentation (due Dec 14rd) • Each group will have a short presentation and demo (20 minutes) • Each group will provide a five-page document on the project
Topics 1 • Scope: Data Warehouse/Data Mining • Topics: • OLAP • Association Rule • Classification and Prediction • Clustering
Topics 2 • Scope: Web Mining/Information Extraction • Topics: • Web Crawler • Page Rank • Text Mining (basics) • Information Extraction (basics)
Topics 3 • Scope: Relational DB Implementation • Topics: • Query Evaluation • Query Optimization • Indexing
Topics 4 • Scope: Information Retrieval • Topics: • Inverted List • Query Evaluation • Index Construction/Compression
Presentation Topics • Social Network Analysis • New Database Architecture • Cloud Computing • Information Integration
Project Ideas • Web Mining + Social Network Analysis • Twitter, Wikipedia • Fatwallet • Eopinion/Amazon • Yelp • Computer Science Literature • Information/Data Integration • Noaa data • Mobile Data Mining • Mobile user profiling