380 likes | 394 Views
Dive into the world of algorithms and data structures in CSCE 221H with topics like stacks, heaps, graphs, and more. Learn through labs, quizzes, programming assignments, and culture tasks. Understand the analysis of algorithms through theoretical and experimental studies. Get ready to enhance your programming skills and problem-solving techniques!
E N D
Data Structures and Algorithms CSCE 221H Dr. Scott Schaefer
Staff • Instructor • Dr. Scott Schaefer • HRBB 304A • Office Hours: TR 8am-9am (or by appointment) • TA • Harish Kumar • EABB • Office Hours: TR noon-1pm • Peer Teachers • Nathan Brockway, Dominick Fabian • Peer Teacher Central (HRBB 129)
What will you learn? Analysis of Algorithms Stacks, Queues, Deques Vectors, Lists, and Sequences Trees Priority Queues & Heaps Maps, Dictionaries, Hashing Skip Lists Binary Search Trees Sorting and Selection Graphs
Prerequisites • CSCE 121 “Introduction to Program Design and Concepts” • CSCE 222 “Discrete Structures” or MATH 302 “Discrete Mathematics” (either may be taken concurrently with CSCE 221)
Grading • 3% Labs • 12% Homework • 5% Culture Assignments • 30% Programming Assignments • 10% Quizzes • 20% Midterm • 20% Final
Assignments • Turn in code/homeworks via Google Classroom (invitation code gpj6ff) • Due by 11:59pm on day specified • All programming in C++ • Code, proj file, sln file, and Win32 executable • Make your code readable (comment) • You may discuss concepts, but coding is individual (no “team coding” or web)
Late Policy • Penalty = m: number of minutes late percentage penalty days late
Late Policy • Penalty = m: number of minutes late percentage penalty days late
Late Policy • Penalty = m: number of minutes late percentage penalty days late
Late Policy • Penalty = m: number of minutes late percentage penalty days late
Labs • Several structured labs at the beginning of the semester with (simple) exercises • Graded on completion • Time to work on homework/projects
Homework Approximately 5 Written/Typed responses Simple coding if any
Programming Assignments About 5 throughout the semester Implementation of data structures or algorithms we discusses in class Written portion of the assignment
Quizzes • Approximately 10 throughout the semester • Short answer, small number of questions • Will only be given in class or lab • Must be present to take the quiz
Culture Assignments • Two research seminar reports • One before Spring break, one after • Biography of a famous Computer Scientist • 5 minute presentation • Signup this week by sending me an email • http://faculty.cs.tamu.edu/schaefer/teaching/221_Fall2018/Assignments/culture.html
Academic Honesty • Assignments are to be done on your own • May discuss concepts, get help with a persistent bug • Should not copy work, download code, or work together with others unless specifically stated otherwise • We use a software similarity checker • http://moss.stanford.edu
Class Discussion Board piazza.com/tamu/fall2018/csce221200
Asymptotic Analysis 19/38
Running Time • The running time of an algorithm typically grows with the input size. • Average case time is often difficult to determine. • We focus on the worst case running time. • Crucial to applications such as games, finance, and robotics • Easier to analyze worst case 5ms 4ms Running Time 3ms best case 2ms 1ms A B C D E F G Input Instance 20/38
Experimental Studies • Write a program implementing the algorithm • Run the program with inputs of varying size and composition • Use a method like clock() to get an accurate measure of the actual running time • Plot the results 21/38
Stop Watch Example 22/38
Limitations of Experiments • It is necessary to implement the algorithm, which may be difficult • Results may not be indicative of the running time on other inputs not included in the experiment. • In order to compare two algorithms, the same hardware and software environments must be used 23/38
Theoretical Analysis • Uses a high-level description of the algorithm instead of an implementation • Characterizes running time as a function of the input size, n. • Takes into account all possible inputs • Allows us to evaluate the speed of an algorithm independent of the hardware/software environment 24/38
Important Functions • Seven functions that often appear in algorithm analysis: • Constant 1 • Logarithmic log n • Linear n • N-Log-N n log n • Quadratic n2 • Cubic n3 • Exponential 2n 25/38
Important Functions • Seven functions that often appear in algorithm analysis: • Constant 1 • Logarithmic log n • Linear n • N-Log-N n log n • Quadratic n2 • Cubic n3 • Exponential 2n 26/38
Why Growth Rate Matters runtime quadruples when problem size doubles 27/38
Comparison of Two Algorithms insertion sort is n2 / 4 merge sort is 2 n lg n sort a million items? insertion sort takes roughly 70 hours while merge sort takes roughly 40 seconds This is a slow machine, but if 100 x as fast then it’s 40 minutes versus less than 0.5 seconds 28/38
Constant Factors • The growth rate is not affected by • constant factors or • lower-order terms • Examples • 102n+105is a linear function • 105n2+ 108nis a quadratic function 29/38
Big-Oh Notation • Given functions f(n) and g(n), we say that f(n) is O(g(n)) if there are positive constantsc and n0 such that f(n)cg(n) for n n0 • Example: 2n+10 is O(n) • 2n+10cn • (c 2) n 10 • n 10/(c 2) • Pick c = 3 and n0 = 10 30/38
Big-Oh Example • Example: the function n2is not O(n) • n2cn • n c • The above inequality cannot be satisfied since c must be a constant 31/38
More Big-Oh Examples • 7n-2 7n-2 is O(n) need c > 0 and n0 1 such that 7n-2 c•n for n n0 this is true for c = 7 and n0 = 1 • 3n3 + 20n2 + 5 3n3 + 20n2 + 5 is O(n3) need c > 0 and n0 1 such that 3n3 + 20n2 + 5 c•n3 for n n0 this is true for c = 4 and n0 = 21 • 3 log n + 5 3 log n + 5 is O(log n) need c > 0 and n0 1 such that 3 log n + 5 c•log n for n n0 this is true for c = 8 and n0 = 2 32/38
Big-Oh and Growth Rate • The big-Oh notation gives an upper bound on the growth rate of a function • The statement “f(n) is O(g(n))” means that the growth rate of f(n) is no more than the growth rate of g(n) • We can use the big-Oh notation to rank functions according to their growth rate 33/38
Big-Oh Rules • If is f(n) a polynomial of degree d, then f(n) is O(nd), i.e., • Drop lower-order terms • Drop constant factors • Use the smallest possible class of functions • Say “2n is O(n)” instead of “2n is O(n2)” • Use the simplest expression of the class • Say “3n+5 is O(n)” instead of “3n+5 is O(3n)” 34/38
Computing Prefix Averages • We further illustrate asymptotic analysis with two algorithms for prefix averages • The i-th prefix average of an array X is average of the first (i+ 1) elements of X: A[i]= (X[0] +X[1] +… +X[i])/(i+1) 35/38
Prefix Averages (Quadratic) • The following algorithm computes prefix averages in quadratic time by applying the definition AlgorithmprefixAverages1(X, n) Input array X of n integers Output array A of prefix averages of X #operations A new array of n integers n fori 0 ton 1 do n s X[0] n forj 1 toido 1 + 2 + …+ (n 1) s s+X[j] 1 + 2 + …+ (n 1) A[i] s/(i+ 1)n returnA 1 36/38
The running time of prefixAverages1 isO(1 + 2 + …+ n) The sum of the first n integers is n(n+ 1) / 2 Thus, algorithm prefixAverages1 runs in O(n2) time Arithmetic Progression 37/38
Prefix Averages (Linear) • The following algorithm computes prefix averages in linear time by keeping a running sum AlgorithmprefixAverages2(X, n) Input array X of n integers Output array A of prefix averages of X #operations A new array of n integers n s 0 1 fori 0 ton 1 do n s s+X[i] n A[i] s/(i+ 1)n returnA 1 • Algorithm prefixAverages2 runs in O(n) time 38/38