Chapter 2-OPTIMIZATION

Chapter 2-OPTIMIZATION G.Anuradha

Contents • Derivative-based Optimization • Descent Methods • The Method of Steepest Descent • Classical Newton’s Method • Step Size Determination • Derivative-free Optimization • Genetic Algorithms • Simulated Annealing • Random Search • Downhill Simplex Search

What is Optimization? • Choosing the best element from some set of available alternatives • Solving problems in which one seeks to minimize or maximize a real function

Notation of Optimization Optimize y=f(x1,x2….xn) --------------------------------1 subject to gj(x1,x2…xn) ≤ / ≥ /= bj ----------------------2 where j=1,2,….n Eqn:1 is objective function Eqn:2 a set of constraints imposed on the solution. x1,x2…xn are the set of decision variables Note:- The problem is either to maximize or minimize the value of objective function.

Complicating factors in optimization • Existence of multiple decision variables • Complex nature of the relationships between the decision variables and the associated income • Existence of one or more complex constraints on the decision variables

Types of optimization • Constraint:- Solution is arrived at by maximizing or minimizing the objective function • Unconstraint:- No constraints are imposed on the decision variables and differential calculus can be used to analyze them

Least Square Methods for System Identification • System Identification:- Determining a mathematical model for an unknown system by observing the input-output data pairs • System identification is required • To predict a system behavior • To explain the interactions and relationship between inputs and outputs • To design a controller • System identification • Structure identification • Parameter identification

Structure identification • Apply a priori knowledge about the target system to determine a class of models within which the search for the most suitable model is conducted • y=f(u;θ) y – model’s output u – Input Vector θ – parameter vector

Parameter Identification • Structure of the model is known and optimization techniques are applied to determine the parameter vector θ= θ

Block diagram of parameter identification

Parameter identification • An input ui is applied to both the system and the model • Difference between the target system’s output yi and model’s output yi is used to update a parameter vector θ to minimize the difference • System identification is not a one-pass process; it needs to do both structure and parameter identification repeatedly

Classification of Optimization algorithms • Derivative-based algorithms:- • Derivative-free algorithms

Characteristics of derivative free algorithm • Derivative freeness:- repeated evaluation of objective function • Intuitive guidelines:- concepts are based on nature’s wisdom, such as evolution and thermodynamics • Slower • Flexibility • Randomness:- global optimizers • Analytic Opacity:-knowledge about them are based on empirical studies • Iterative nature:-

Characteristics of derivative free algorithm • Stopping condition of iteration:- let k denote an iteration count and fk denote the best objective function obtained at count k. stopping condition depends on • Computation time • Optimization goal; • Minimal Improvement • Minimal relative improvement

Basics of Matrix Manipulation and Calculus

Gradient of a Scalar Function

Jacobian of a Vector Function

Least Square Estimator • Method of least squares is a standard approach to approximate solution of overdetermined systems. • Least Squares- Overall solution minimizes the sum of the squares of the errors made in solving every single equation • Application—Data Fitting

Types of Least Squares • Least Squares • Linear:- It is a linear combination of parameters. • The model may represent a straight line, a parabola or any other linear combination of functions • Non-Linear:- the parameters appear as functions, such as β2,eβx.If the derivatives are either constant or depend only on the values of the independent variable, the model is linear else non-linear.

Differences between Linear and Non-Linear Least Squares

Linear regression with one variable Model representation Machine Learning

Housing Prices (Portland, OR) Price (in 1000s of dollars) Size (feet2) Supervised Learning Given the “right answer” for each example in the data. Regression Problem Predict real-valued output

Training set of housing prices (Portland, OR) Notation: m = Number of training examples x’s = “input” variable / features y’s = “output” variable / “target” variable

Training Set How do we represent h ? Learning Algorithm Size of house Estimated price h Linear regression with one variable. Univariate linear regression.

Linear regression with one variable Cost function Machine Learning

Training Set Hypothesis: ‘s: Parameters How to choose ‘s ?

y Idea: Choose so that is close to for our training examples x

Linear regression with one variable Costfunctionintuition I Machine Learning

Simplified Hypothesis: Parameters: Cost Function: Goal:

(for fixed , this is a function of x) (function of the parameter ) y x

(function of the parameter ) (for fixed , this is a function of x) y x

Linear regression with one variable Costfunctionintuition II Machine Learning

Hypothesis: Parameters: Cost Function: Goal:

(for fixed , this is a function of x) (function of the parameters ) Price ($) in 1000’s Size in feet2 (x)

(for fixed , this is a function of x) (function of the parameters )

Linear regression with one variable Gradient descent Machine Learning

Have some function Want • Outline: • Start with some • Keep changing to reduce until we hopefully end up at a minimum

Chapter 2-OPTIMIZATION

Chapter 2-OPTIMIZATION

Presentation Transcript

Chapter 2 Optimization Models

Chapter 2 Single Variable Optimization

Chapter 2 Economic optimization

Chapter 5 Query Optimization

Chapter 2 Deterministic Optimization Models in Operations Research

Query Optimization, part 2

Optimization Lecture (2)

Discrete Optimization Lecture 2

Program Optimization (Chapter 5)

Chapter 2 Single Variable Optimization

Optimization Techniques Chapter 2

Discrete Optimization Lecture 2

Chapter 2: Simplification, Optimization and Implication

Chapter 7 Optimization

2/24: Disk Optimization

Chapter 7: Optimization

Chapter 13: Query Optimization

Chapter 13: Query Optimization

Chapter 10 Query Optimization

Chapter 2-OPTIMIZATION

Chapter 14 Query Optimization