CS B553: Algorithms for Optimization and Learning

CS B553: Algorithms for Optimization and Learning Univariate optimization

f(x) x

Key Ideas • Critical points • Direct methods • Exhaustive search • Golden section search • Root finding algorithms • Bisection • [More next time] • Local vs. global optimization • Analyzing errors, convergence rates

Figure 1 f(x) Local maxima Inflection point Local minima x

Figure 2a f(x) a b x

Figure 2b Find critical points, apply 2nd derivative test f(x) a b x

Figure 2b f(x) a b x

Figure 2c f(x) a b x Global minimum must be one of these points

Figure 3 Exhaustive grid search f(x) a b x

Exhaustive grid search f(x) a b x

Figure 4 Two types of errors f(x) f(xt) Analytical error f(x*) x x* xt Geometric error

Does exhaustive grid search achieve e/2 geometric error? f(x) x* b a x e

Does exhaustive grid searchachieve e/2 geometric error? Not necessarily for multi-modal objective functions f(x) x* b a x Error

Figure 5 Lipschitz continuity Slope +K |f(x)-f(y)|  K|x-y| Slope -K

Figure 6 Exhaustive grid search achieves Ke/2 analytical error in worst case f(x) b a x e

Figure 7a Golden section search f(x) m b a x Bracket [a,b]Intermediate point m with f(m) < f(a),f(b)

Figure 7b Golden section search f(x) c m b a x Candidate bracket 1 [a,m] Candidate bracket 2 [c,b]

Figure 7b Golden section search f(x) m b a x

Figure 7b Golden section search f(x) m b a c x

Figure 7b Golden section search f(x) a b m x

Optimal choice: based on golden ratio f(x) c m b a x Choose c so that (c-a)/(m-c) = , where  is the golden ratio => Bracket reduced by a factor of -1 at each step

Notes • Exhaustive search is a global optimization: error bound is for finding the true optimum • GSS is a local optimization: error bound holds only for finding a local minimum • Convergence rate is linear: with xn = sequence of bracket midpoints

Figure 8 Root finding: find x-value where f’(x) crosses 0 f(x) f’(x) x

Figure 9a Bisection g(x) a b Bracket [a,b]Invariant: sign(f(a)) != sign(f(b))

Figure 9 Bisection g(x) a m b Bracket [a,b]Invariant: sign(f(a)) != sign(f(b))

Figure 9 Bisection g(x) a b Bracket [a,b]Invariant: sign(f(a)) != sign(f(b))

Figure 9 Bisection g(x) a b Bracket [a,b]Invariant: sign(f(a)) != sign(f(b)) Linear convergence: Bracket size is reduced by factor of 0.5 at each iteration

Next time • Root finding methods with superlinear convergence • Practical issues

CS B553: Algorithms for Optimization and Learning

CS B553: Algorithms for Optimization and Learning

Presentation Transcript

Learning and Vision: Discriminative Models

Algorithms

Organization

The Efficiency of Algorithms

CMPT 454

Performance Evaluation of Machine Learning Algorithms

Frédéric Saubion LERIA

Codon Optimization

Search Engine Optimization (SEO)

Chapter 3

Distributed Systems: Distributed algorithms

MAP Estimation Algorithms in

Code Optimization

The Efficiency of Algorithms

The Efficiency of Algorithms

Chapter 2-OPTIMIZATION

Alex Kesselman , MPI

Evolutionary Algorithms

MAP Estimation Algorithms in

Tier I: Mathematical Methods of Optimization

Topic 7

CPSC 411 Design and Analysis of Algorithms