640 likes | 969 Views
P, NP, and Complexity. Six fundamental facts One rule of thumb Three fundamental notions One fundamental question. School of IT. Peter Eades. Fundamental Fact #1. Fundamental Fact # 1 Exponential functions are eventually bigger than polynomial functions. Fundamental Fact #1.
E N D
P, NP, and Complexity Six fundamental facts One rule of thumb Three fundamental notions One fundamental question School of IT Peter Eades
Fundamental Fact #1 Fundamental Fact #1 Exponential functions are eventually bigger than polynomial functions
Fundamental Fact #1 Exponential functions are eventually bigger than polynomial functions Examples of exponential functions: Examples of polynomial functions: exponential function polynomial function
Fundamental Fact #2 Fundamental Fact #2 Some algorithms are efficient, some are not
Fundamental Fact #2 Some algorithms are efficient, some are not Sorting problem: Alice Bob Chen Doug Evelyn Franz Georgie Evelyn Georgie Franz Chen Bob Alice Doug ? Algorithm ?
sorting There are many different algorithms for sorting Basic step: • If two things are out of order, we can swap them to improve the sortedness of the list Evelyn Georgie Franz Chen Bob Alice Doug Evelyn Georgie Chen Franz Bob Alice Doug Evelyn Georgie Chen Franz Alice Bob Doug …
sorting exhaustingSort • Exhaustively swap to get every possible ordering of the list • At each step, check to see if the list of ordered Run-time for exhaustingSort: • You need to consider different orderings of the list • This takes time at least proportional to • This is infeasible
sorting bubbleSort • Scan the list from top to bottom • If the th and th elements are out of order, then swap them • Keep scanning and swapping until the list is sorted Run-time for bubbleSort: • you need to scan the list times • make swaps. This takes time proportional to .
sorting geeky_bubbleSort • Same old bubbleSort algorithm, but use really clever coding in assembler and C languages to make it go faster. • Run on a fast computer. Run-time for geeky_bubbleSort: • you still need to scan the list times • you still make swaps. It still takes time proportional to . But it seems to be much faster … … …
sorting AISASort • Measure the “sortedness” of the sequence: • Choose swaps that increase as much as possible • Use an AI technique called “simulated annealing” to search for a maximally sorted sequence Run-time for AISASort: exponential
sorting quickSort • Choose an element, called a pivot, from the list. • Swap things around so that smaller things come before the pivot, and larger things come after it • Recursively sort the sub-list of lesser elements and the sub-list of greater elements. 1 2 3 Alice Bob Chen Doug Evelyn Franz Georgie Evelyn Georgie Franz Bob Chen Alice Doug Bob Alice Chen Evelyn Doug Franz Georgie Run-time for quickSort: proportional to
sorting How long does it take to sort things? Remarks: • quickSort is excellent • bubbleSort and geeky_bubbleSort are feasible • AISASort is infeasible (as well as exhaustingSort)
Rule of Thumb #1 Rule of Thumb #1 An algorithm that runs in exponential time is not feasible; an algorithm that runs in polynomial time may be feasible.
Rule of Thumb #1 An algorithm that runs in exponential time is not feasible; an algorithm that runs in polynomial time may be feasible. • Infeasible • exhaustingSort • AISASort • Feasible • bubbleSort • quickSort
Fundamental Notion #1: P Fundamental Notion #1: P P is the set of all problems that can be solved in polynomial time
Fundamental Notion #1: P P Sorting problem Find Max Max Flow
Fundamental Fact #3 Fundamental Fact #3 Some problems can be solved with efficient algorithms, and some others … maybe not
traveling salesman problem Some problems can be solved with efficient algorithms, and some others … maybe not Travelling salesman problem: • How can we choose a route around cities to minimise the total distance travelled? • We need to order the cities appropriately 2 3 5 6 1 4 7
traveling salesman problem Some problems can be solved with efficient algorithms, and some others … maybe not Travelling salesman problem: • How can we choose a route around cities to minimise the total distance travelled? • We need to order the cities appropriately 2 1 4 5 0 3 6
traveling salesman problem There are many different algorithms for the traveling salesman problem Basic problem: • We need to swap around the order of cities to decrease the length travelled • Sydney • Darwin • Brisbane • Melbourne • Perth • Adelaide • Hobart • Sydney • Brisbane • Darwin • Melbourne • Perth • Adelaide • Hobart Like the sorting problem
traveling salesman problem exhaustingTSP • Exhaustively swap to get every possible ordering of the cities • At each step, compute the total distance travelled • Keep the tour with the minimum total distance Run-time for exhaustingTSP: • You need to consider different orderings of the cities • This takes time at least proportional to • This is infeasible
traveling salesman problem bubbleTSP • Scan the current list of cities • If swapping the th and th cities would decrease distance, then swap them • Keep scanning and swapping until no swap decreases distance Run-time for bubbleTSP: • Hard theorem: it can take exponentially time, but not as bad as exhaustingTSP Effectiveness of bubbleTSP: • Does not give optimal results • Gives mediocre results
traveling salesman problem AISA_TSP • Choose swaps that decrease distance as much as possible • Use an AI technique called “simulated annealing” to search for an ordering with smallest total distance Run-time for AISA_TSP: • Hard theorem: it can take exponentially long • Faster than exhaustingTSP • Slower than bubbleTSP Effectiveness of AISA_TSP: • Does not give optimal results • Gives mediocre results, but better than bubbleTSP
traveling salesman problem How long does it take to find a solution to TSP? Remarks: • We don’t know any algorithm for TSP that runs fast and gives an optimal result
Fundamental Fact #3 Fundamental Fact #3 Some problems can be solved with efficient algorithms, and some others … maybe not (continued)
cliques Clique problem: • Given a network of size and an integer • Does have nodes that are all connected to each other?
cliques Some people: • Alice, Andrea, Annie, Amelia, Bob, Brian, Bernard, Boyle Some connections • Bob is connected to Alice • Bob is connected to Andrea • Bob is connected to Amelia • Brian is connected to Alice • Brian is connected to Andrea • Brian is connected to Amelia • Boyle is connected to Alice • Boyle is connected to Andrea • Boyle is connected to Annie • Bernard is connected to Alice • Bernard is connected to Andrea • Bernard is connected to Annie Is there a clique of size 3 among these people? (Clique: a group of people who are all connected to each other)
cliques • Another network • Is there a clique of size 5 among these nodes?
cliques • Clique of size 4
cliques exhaustingClique • Test every subset of nodes, check to see if these nodes form a clique Run-time for exhaustingClique: • You need to consider different subsets of nodes • If then is exponential • This is infeasible
Fundamental Fact #3 Fundamental Fact #3 Some problems can be solved with efficient algorithms, and some others … maybe not (still continued)
sat The Satisfiabilty (SAT) problem If Peter is Australian, then Patrick is Australian, and Either Patrick is Australian, or Patrick is Irish, or Paul is English, and If Catherine is English then either Paul is not English or Jane is French, and If Jane is French and Patrick is not Australian, then Peter is English, and If Peter is English then Peter is not Irish, and If Paul is not English then either Catherine is Australian or Jane is French, and …. A statement • Question: • Is there any way that this statement could be true?
sat SAT If then , and Either , or , or , and If then either or , and If and , then not , and If then not , and If then either or , and …. Given an abstract proposition Are there truth values for , , , ... such that this statement is true?
sat exhaustingSAT • Test every combination of true and false values for , , , ... , check to see whether the whole statement is true Run-time for exhaustingSAT: • You need to consider different combinations of true and false values for , , , ... • This is infeasible
Cliques, travelling salesmen, and satisfaction Some problems can be solved with efficient algorithms, and some others … maybe not The Travelling Salesman Problem, the Clique Problem, and SAT all share some characteristics: • They can be solved by exhausting algorithms that use exponential time • No-one knows any algorithms that run fast and always solve these problems Remarks • These problems are commercially important, and a huge amount of research has been done on them. • There are many algorithms for specific versions, and many algorithms that almost work • There are many other problems with the same characteristics
Fundamental Fact #4 Fundamental Fact #4 Sometimes we can efficiently check whether an answer is correct, even if we can’t efficiently find a correct answer
Cliques, travelling salesmen, and satisfaction Sometimes we can efficiently check whether an answer is correct, even if we can’t efficiently find a correct answer • Clique Problem: • Given a set of nodes, one can quickly check whether these nodes are all connected to each other. • But it seems difficult to find the right set of nodes. • SAT: • Given a truth value for each variable it is easy to check whether these truth values make the statement true. • But it seems hard to find the right truth values • Traveling Salesman Problem: • Hmmmm … it may be hard even to check whether a given order of cities is optimal?
Fundamental Notion #2: NP Fundamental Notion #2: NP NP is the set of problems for which we can efficiently check to see whether a given answer is correct.
Fundamental Notion #2: NP NP Clique SAT ? Traveling Salesman ?
Fundamental Notion #2: NP NP P Clique Sorting problem SAT ? Traveling Salesman ?
Fundamental Fact #5 Fundamental Fact #5 Some problems are harder than others
Clique is harder than SAT Some problems are harder than others • The Clique problem is at least as difficult as SAT. Theorem If there were a good algorithm to solve the clique problem, then there would be a good algorithm to solve SAT. Proof
Clique is harder than SAT Theorem If there were a good algorithm to solve the clique problem, then there would be a good algorithm to solve SAT. Proof: Suppose that we have a good fast algorithm that solves the clique problem fast. Largest clique in network Network Using ,let’s make an algorithm to solve SAT. Truth values that satisfy Proposition
Clique is harder than SAT reduction Proposition Network Truth values that satisfy Largest clique in network
Clique is harder than SAT reduction Either , or , or , and Either , or , or , and Either , or , or , and Either , or , or , and Either , or , or , and Either , or , or , and Either , or , or , and …. Nodes: • For each , there is a node for variable and a node for the negation of . Connections: • Connect two nodes if they do not occur in the same clause, and one is not the negation of the other. Proposition Network reduction
Clique is harder than SAT reduction Proposition Network Truth values that satisfy Largest clique in network
Clique is harder than SAT Theorem If there were a good algorithm to solve the clique problem, then there would be a good algorithm to solve SAT. Proof: If we knew a good fast algorithm for the clique problem, then we could use it to make an algorithm to solve SAT. • That is, if the clique problem were easy, then SAT would be easy • That is, the clique problem is at least as difficult as SAT
Fundamental Fact #5 Some problems are harder than others • Also, the travelling salesman problem is at least as hard as SAT. Clique Traveling Salesman Is_at_least_as_hard_as Is_at_least_as_hard_as SAT
Fundamental Fact #5 Scheduling In fact, the relationship “at_least_as_hard_as” between problems has been very well investigated. For many problems, there is a proof that this problem is at least as hard as SAT. Bin Packing Hamilton Path Feedback arc set Graph colouring Traveling Salesman Independent set Clique SAT
Fundamental Fact #5 In fact, the relationship “at_least_as_hard_as” between problems has been very well investigated. For many problems, there is a proof that this problem is harder than SAT. Many problems are at_least_as_hard_asSAT. • Feedback Arc Set Problem: solutions needed to detect deadlocks in communications systems • Scheduling problems: solutions need in logistics industry, and in computer systems • Bin Packing problems: solutions needed in manufacturing: steel industry, clothing industry • Many layout problems: solutions needed for network visualization, newspaper layout, integrated circuit layout • Number theory problems: solutions needed for cryptography