Inverse Function Theorem and Implicit Function Theorem

Inverse Function Theorem and Implicit Function Theorem Dr. M. Asaduzzaman Professor Dept. of Mathematics University of Rajshahi Email: md_asaduzzaman@hotmail.com

Contents • Euclidean Spaces • Normed Linear Spaces • Metric Induced From Norms • Open Sets in ℝn • Continuous Functions • Directional and Partial Derivatives • Derivatives • Jacobian and Regular Maps • Inverse Function Theorem • Implicit Function Theorem

Euclidean Spaces Definition: A nonempty set F with addition and multiplication defined in F is called a field if the following conditions hold: i) x+y, xy F,  x, y F ii) x+y=y+x and xy=yx  x, y F iii) (x+y)+z=x+(y+z) and (xy)z=x(yz)  x, y, zF iv)  two elements 1 and 0 in F such that x+0=x and x.1=x  xF  xF  an element –xF such that x+(-x)=0 and for every nonzero xF  an element x-1F such that xx-1=1 vi) x(y+z)=xy+xz,  x, y, z F. Example 1. ℝ, the set of all real numbers is a field under usual addition and multiplication. This field is called the real field. Example 2. C, the set of all complex numbers is a field under addition and multiplication of complex numbers. This field is called the complex field.

Euclidean Spaces Definition: A nonempty set V with addition and scalar multiplication defined in V is called a vector space over the field F if x, y, zV and  , Fthe following conditions hold: i) x+y, x V ii) x+y=y+x iii) (x+y)+z=x+(y+z) iv)  an element 0 in V such that x+0=x  xV  an element –xVsuch that x+(-x)=0 vi) (x+y)=x+y and (+)x=x+x vii) ()x=(x) and (+)x=x+x viii) 1x=x The elements of V are called vectors and those of F are called scalars.

Euclidean Spaces Example: Every field is a vector space over itself. Example: Let n be a positive integer. Then the set ℝn={(x1,x2,,xn): xiℝ, 1≤i ≤n} is a vector space over the real field if we define addition and scalar multiplication component wise. Definition: We define the so-called “inner product “ (or scalarproduct) of two vectors x=(x1,x2,,xn), y=(y1,y2,,yn) inℝnby xy=x1y1+x2y2++xnyn and the norm of x by |x|=(x12+x22++xn2)1/2 then the vector space with the above inner product and norm is called n dimensional Euclidean space. We then have following theorem:

Euclidean Spaces Theorem: Suppose x, y, zRn, and  is a real. Then (a) |x|≥0; (b) |x|=0iff x=0; (c) |x|=|||x|; (d) |xy|≤|x||y|; (e) |x+y|≤|x|+|y|; (f) |x-z|≤|x-y|+|y-z|. Based on property (a), (b), (c) and (e) of the above theorem we define norm on any vector space over real or complex field.

Normed Linear Spaces Definition: Let X be a vector space over the real or complex field K. Then a function ║║: X→ℝis called a norm in X if it satisfies the following properties: (a) ║x║≥0; (b) ║x║=0iff x=0; (c) ║x║=||║x║; (d) ║x+y║≤║x║+║y║. A vector space over the real or complex field is called a linear space and a linear space with a norm is called a normed linear space.

Normed Linear Spaces Example: Let X =Rn and let ║║: X→ℝbe defined by ║x║= |x1|+|x2|+  +|xn|, where x=(x1,x2,,xn)X. Then X is a normed linear space. Example: Let X =ℝnand let ║║: X→ℝbe defined by ║x║= max{|x1|,|x2|,  ,|xn|}, where x=(x1,x2,,xn)X. Then X is a normed linear space. Thus there are many ways to define a norm on the same linear space.

Metric Induced from Norms For any normed linear space we have the following theorem: Theorem: Suppose X is a normed linear space . Then for any x, y, zX , we have ║x-y║≤ ║x-z║+ ║y-z║. Thus in a normed linear space X if we define d(x,y)= ║x-y║ then d satisfies all conditions of a metric. This metric is called metric induced from norm. It is clear that different norm on the same linear space induced different metric.

Open Sets in Rn We now consider only the Euclidean norm and the metric induced from Euclidean norm. Definition: Let x0ℝnand r>0 be given. We define the set S(x0,r)={xℝn: |x-x0|<r} and call it a neighborhood of x0 of radius r. From the definition it is clear that neighborhood changes with x0 and r. Example: In ℝ1neighborhoods are open intervals and in ℝ2neighborhoods are open discs.

Open Sets in ℝn r . x0 S(x0,r) O

Open Sets in ℝn Definition: Let E be a subset of a Euclidean space ℝn. A point x0E is said to be an interior point ofE if there exists a neighborhood S(x0,r) of x0 such that S(x0,r)E. Definition: A subset E of a Euclidean space ℝnis called an open set if every point of E is an interior point of E. Example: For any x0ℝnand for any r>0 the neighborhood S(x0, r) is an open set. Example: Union of any number of neighborhoods is an open set. Example: A finite subset of ℝncannot be an open set.

Continuous functions Definition: Let E be a subset of a Euclidean space ℝn. A point x0ℝnis said to be an limit point ofE if every neighborhood contains infinitely many points of E. Definition: Let E be a subset of ℝnand let :E→ℝmbe a function. Let x0 be a point as well as a limit point of E. Then  is said to be continuousatx0 if given  there exists a  such that |x- x0| whenever |x-x0| and xE. If  is continuous at every point of E then  is called continuous on E.

Continuous functions Example: The function (x,y)=1-x2-y2 is continuous on ℝ2. Example: The function is not continuous at (0,0) .

Directional and Partial Derivatives Definition: Let  be real valued function defined on an open subset E ofℝnand let x0E be given. We define the directional derivative of  at x0 in the direction of unit vector u to be provided the limit exists. It will be denoted by Du(x0).

Directional and Partial Derivatives Example: Let Then directional derivatives of  exist only in the four directions (1/2, 1/2). Definition: The partial derivatives of  are defined as the directional derivatives in the directions of standard basis vectors e1, e2,, en. The ith partial derivative of  at x is usually denoted by Di(x), or Thus provided the limit exists.

Directional and Partial Derivatives Problem: Let Find the directional derivative at e2 in any direction u. Solution: Let u=(cos , sin ). Then

Directional and Partial Derivatives The following example shows that existence of partial derivatives at a point does not imply that directional derivatives exist in all directions. Example: Let (x,y)=(xy) 1/3 and u=(cos , sin). Then which exists and equal to 0 if and only if sin cos=0. That is if and only if u=e1, e2, -e1 or –e2.

Derivatives In order to define derivaties of a function whose domain is an open subset of ℝn, let us take another look at the familiar case n=1, and let us see how to interpret the derivative in that case in a way which will naturally extend to n1. If  is a real function with domain (a,b)ℝ1and if x(a,b), then (x) is usually defined to be the real number provided the limit exists. Thus (1) x+hxxh  rh where the remainder r(h) is small in the sense that

Derivatives Thus (1) expresses the difference x+hx as the sum of the linear function that takes h to xh, plus a small remainder. We can therefore regard the derivative of  at x, not as a real number but as the linear operator on ℝ1that takes h to xh. Let us next consider a function f that maps (a,b)ℝ1intoℝm. In that case, fx was defined to be that vector yRmfor which We can again rewrite this in the form (2) where r(h)/h→0 as h→0. The main term on the RHS of (2) is again a linear function of h.

Derivatives Each yℝminduces a linear transformation of ℝ1into ℝm, by associating to each hℝ1the vector hyℝm. This identification of ℝmwith L(ℝ1,ℝm) allow us to regard f(x) as a member of L(ℝ1,ℝm). Thus, if f is a differential mapping of (a,b)ℝ1into ℝm, and if x(a,b), then f(x) is the linear transformation of ℝ1into ℝmthat satisfies (3) or, equivalently, (4)

Derivatives Motivating from the above fact, we define derivative of a function from ℝninto ℝm. Definition: Suppose E is an open set in ℝn, f maps E into ℝm, and xE. If there exists a linear transformation A of Rn into ℝmsuch that (5) then we say that f is differentiable at x, and we write (5) f(x)=A. If f is differentiable at every xE , we say that f is differentiable in E

Derivatives The following theorem proves that if a function f is differentiable at x then f(x) is unique. Theorem: Suppose E is an open set in ℝn, f maps E into ℝm, and xE. If there exists a linear transformation A1 and A2 of ℝninto ℝmsuch that holds with A=A1 and with A=A2. Then A1=A2.

Derivatives We see that if a function f:E→ℝmis differentiable at a point xE then f(x)L(ℝn,ℝm). Since every mn matrix with real entries are in L(ℝn,ℝm) and every member of L(ℝn,ℝm) has a matrix representation with respect to given bases of ℝnand ℝm. In order to make everything simple we will always consider the standard bases. Thus when we see f(x), we shall think of f(x) as an mn matrix which is the matrix of f(x) with respect to standard bases.

Derivatives The following theorem extends chain rule of derivatives of single variable. Theorem: Suppose E is an open set in ℝn, f maps E into ℝm, f is differentiable at x0E, g maps an open set containing f(E) into ℝk, and g is differentiable at f(x0). Then the mapping F of E into Rk defined by F(x)=g(f(x)) is differentiable at x0, and F(x0)=g(f(x0))f(x0). g(f(x0)) x0 f(x0) f g F

Derivatives Suppose E is an open set in ℝn, f maps E into ℝm. Let {e1, e2,…., en} and {u1, u2,…., um} be the standard bases of ℝnand ℝmrespectively. Let the component functions of f be 1, 2,…., m. Then for xE, f(x)=(1(x), 2(x), … , m(x)) and each iis a real valued function on E. We denote partial derivative of iwith respect to xj by the notation Dji. These partial derivatives are called partial derivatives of f. If we consider i as row index and j as column index then partial derivatives of f form an mn matrix.

Derivatives The following theorem shows that if f is differentiable at a point x then its partial derivatives exist at x, and they determine f(x) completely. Theorem: Suppose f maps an open set E ℝninto ℝm, and f is differentiable at point xE. Then the partial derivatives (Dji)(x) exist, and the matrix of f(x) is

Derivatives The following theorem shows that differentiability at a point x is sufficient for a function to be continuous at the point. Theorem: Suppose f maps an open set E ℝninto ℝm, and f is differentiable at point x0E. Then f is continuous at x0.

Derivatives The following example shows that existence of partial derivatives at a point is not enough for a function to be differentiable or even continuous. Example: Let then both partial derivatives x and y exist at (0,0) but the function  is not continuous at (0,0).

Derivatives Definition: A differentiable mapping f of an open set Eℝninto Rm is said to be continuously differentiable in E if f is a continuous mapping of E into L(ℝn,ℝm). More explicitly, it is required that to every  corresponds a  such that If yE and |x-y|. If this is so, we also say that f is a 𝒞- mapping, or that f𝒞(E).

Derivatives The following theorem shows that existence of continuous partial derivatives at a point for a function is a sufficient condition to be differentiable at the point. Theorem: Suppose f maps an open set E ℝninto ℝn. Then f𝒞(E) if and only if the partial derivatives Dji exists and are continuous on E for 1≤i ≤m, 1 ≤j ≤n.

Derivatives The following example shows that continuity of partial derivatives at a point is not a necessary condition for a function to be differentiable there. Example: Let then (x and y exist for all (x,y))  is differentiable at every point on the line x=y but the functions x and y are not continuous on the line x=y.

Derivatives The existence of a derivative for a real valued function of one variable is a fact of considerable interest. Geometrically it says that a tangent line exists. However, the fact that a real valued function of several variables has partial derivatives is not in much interest. For, one thing, the existence of directional derivatives in the directions of standard basis vectors e1, e2,…., en does not imply that directional derivatives in all direction exist. Moreover, the function need not have a tangent hyper plane even if there is a derivative in every direction. It can be shown that most of the basic properties of differentiable functions of one variable remain true for differentiable functions of several variables.

Derivatives Now we will see the geometrical similarities: Theorem: Suppose a real function  defined on (a,b)ℝ1is differentiable at a point x0(a,b). Then the tangent line to the curve y=(x) has the equation y=(x0)+(x0)(x-x0). y (x0,f(x0)) y=f(x) x o a x0 b

Derivatives Theorem: Suppose a real function  defined on an open set ERn is differentiable in E and x0E is a point on the hyper surface (x)=c . Then the tangent hyper plane to the hyper surface z=(x) at the point (x0,(x0)) has the equation E

Derivatives Theorem: Suppose a real function  defined on an open set Eℝ2is differentiable in E and (x0,y0)E be a point on the curve (x,y)=c. Then the gradient vector (x, y) of  at (x0,y0) represents the direction of the normal to the curve (x,y)=c at (x0,y0). y f(x,y)=c (x0,y0) x o

Derivatives Theorem: Suppose a real function  defined on an open set Eℝnis differentiable at a point x0E. Then the gradient vector (D1, D2, … , Dn) of  at x0 represents the direction of the normal to the hyper surface (x)=c at x0. x0

Jacobian Suppose f maps an open set E ℝninto itself, and x0E. Then f is continuous at x0. Let f=(1, …. ,n), so that 1, …. ,n are real valued functions with domain E. We suppose all first order partial derivatives (Dji)(x0) exist for 1≤i,j≤n. In this case we say that f possesses first order partial derivatives at x0. The determinant is called the Jacobian of the function f at the point x0 and is denoted by Jf(x0). Another common notation of Jacobian is

Jacobian Definition: A function f is said be one-to-one, or invertible, if f(x1) and f(x2) are distinct whenever x1 and x2 are distinct points of Df. Example: The linear map f(x,y)=(x-y, x+y) is one-to-one in the domain Df=ℝ2. Example: The map f(x,y)=(x2-y2, 2xy) is not one-to-one in the domain Df=ℝ2, since f(x,y)=f(-x,-y). Definition: A function f may fail to be one-to-one, but be one-to-one on a subset S of Df. By this we mean that f(x1) and f(x2) are distinct whenever x1 and x2 are distinct points of S. In this case, f is not invertible, but if fS is defined on S by fS(x)=f(x), xS, and left undefined for xS, then fS is invertible. We say that fS is the restriction of f to S.

Jacobian Definition: If a function f is one-to-one on a neighborhood of x0, we say that f is locally invertible at x0. Example: The map f(x,y)=(excos y, ex sin y) is not one-to-one in the domain Df=ℝ2, since f(x,y)=f(x,y+2). But this map is one-to-one on S={(x,y):   x  ,  ≤ y 2 }. This map is locally invertible on the entire plane. Remark: The above example proves that locally invertible at every point of the domain of a function does not imply that the function is invertible.

Jacobian Definition: A function f : ℝn→ ℝnis said to be regular on an open set S if f is one-to-one and continuously differentiable on S, and Jf(x)0 if xS. Example: If f(x,y)=(x-y, x+y), then for all (x,y)ℝ2. So f is regular on ℝ2. Example: If f(x,y)=(x+y, 2x+2y), then for all (x,y)ℝ2. So f is not regular on any open subset of ℝ2.

Jacobian Example: If f(x,y)=(excos y, ex sin y), then for all (x,y)ℝ2. So f is regular on any open set S on which f is one-to-one. Remark: The above example proves that non zero Jacobian in the entire plane does not prove that the function is one-to-one in the entire plane.

Inverse function theorem The inverse function theorem states, roughly speaking, that a continuously differentiable mapping f is invertible in a neighborhood of any point x at which the linear transformation f(x) is invertible.

Inverse function theorem Theorem (Inverse function theorem): Suppose f is a continuously differentiable mapping of an open set Eℝninto ℝn, f(a) is invertible for some aE, and b=f(a). Then (a) there exists open set U and V in ℝnsuch that aU, bV, f is one-to-one on U, and f(U)=V; (b) if g is the inverse of f [ which exists by (a)], defined in V by g(f(x)=x (xU), then g is continuously differentiable in V and g(y)={f(g(y))}-1 (yV).

Hypothesis: (i) Eℝnis open. (iii)f𝒞(E) . (ii)f : E→ ℝn. (v)b=f(a). (iv)a E such that Jf(a)0. Conclusion: (i)  open sets U and V inℝnsuch that aUE, bVf(E) , f is one-to-one and f(U)=V . (ii) if g is the inverse of f defined in V then g𝒞(V) and g(y)={f(g(y))}-1 (yV). b=f(a) a f . . V=f(U) g f(E) U E Rn Rn

Example: Let a=(1,2,1) and Then so In this case, it is difficult to describe U of find g=fU-1 explicitly; however we know that f(U) is a nbd. of b=f(a)=(4,2,5), that g(b)=a=(1,2,1), and that

Therefore where Thus we have approximated g near b=(4,2,5).

IImplicit function theorem If  is a continuously differentiable real function in the plane, then the equation (x,y)=0 can be solved for y in terms of x in a nbd. of any point (a,b) at which (a,b)=0 and /y0. Similarly x can be solved in terms of y near (a,b) if /x0. For a simple example which illustrates the need for assuming /y0 , consider (x,y)=x2+y2-1. y (x,y)=0 . x O

IImplicit function theorem We now consider mappings from ℝn+m to ℝm. It will be convenient to denote points in ℝn+m by To motivate the problem we are interested in, we first ask whether the linear system of m equations in m+n variables . determines uniquely in terms of

By rewriting the system in matrix form as (1) Ax+Bu=0, where and We see that the equation (1) can be solved uniquely for u in terms of x if the square matrix B is nonsingular. In this case the solution is u=-B-1Ax. .

Inverse Function Theorem and Implicit Function Theorem

Inverse Function Theorem and Implicit Function Theorem

Presentation Transcript

The Implicit Function Theorem---Part 1

Theorem

Superposition Theorem, Thevenin’s Theorem

Riemann Zeta Function and Prime Number Theorem

7.5 Inverse Function

ECON 1450 – Professor Berkowitz The Implicit Function Theorem

The Derivative of Inverse Function

The Mean Value Theorem and Rolle’s Theorem

Theorem

Likelihood function and Bayes Theorem

ENGG2012B Lecture 19 Probability mass function and central limit theorem

FUNCTIONS – Inverse of a function

Thevenin’s theorem, Norton’s theorem and Superposition theorem for AC Circuits

The Inverse Cosine Function

Theorem

Inverse of a function

Factor Theorem Remainder Theorem

Function Inverse

Zeroing in on the Implicit Function Theorem

Theorem