1 / 5

Midterm Review

Prepare for your midterm exam on data-intensive computing with a focus on topics like web services, MapReduce, and Hadoop architecture. Study materials, solve problems, and practice writing pseudo code to ensure success.

ascruggs
Download Presentation

Midterm Review

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Midterm Review CSE4/587 B.Ramamurthy 1/1/2020 B.Ramamurthy B.Ramamurthy 1

  2. Exam Date • October 25, 2011 • Location 107 Talbert • Please bring • Pencils, pens and erasers. • This is a closed book exam. • NO Other material is allowed. • No calculators/phones. • Arrive on time, no extra time will be given if you arrive late 1/1/2020 B.Ramamurthy B.Ramamurthy 2

  3. Topics • Defining data intensive computing ( as in Fourth Paradigm: up to p.19) • Enabling Technologies (ET): • ET1: Web service • ET2: Special data structures and algorithms • NO GAE • MapReduce model: components: Mapper, Reducer, Partitioner, Combiner; Execution framework , shuffle and sort • Hadoop (HDFS) : as in yahoo site: Ch1, 2, 4; 5 only partitioner. • Problem solving with MR: • Chapter 1-4 in Lin and Dryer’s text • Tom White analysis of web log (Don’t ask me for the handout, go find it) 1/1/2020 B.Ramamurthy B.Ramamurthy 3

  4. Questions • Defining data-intensive computing: J. Gray • Given a problem solve it using MR • Given a MR provide, provide a numerical example trace • Best practices and design patterns described in the Lin&Dryer text • Web services and project 1 • Hadoop (HDFS) architecture • Functions of various MR modules B.Ramamurthy

  5. How to study? • Make a list of all material to study. • Study the material • Practice writing pseudo code for the MRs • Use block diagrams and numerical examples when necessary B.Ramamurthy

More Related