All warfare is based on deception, Sun tzu, The Art of War.
A function of two variables f: ℝ x ℝ → ℝ assigns to each ordered pair in its domain a unique real number, e.g., Area = $\frac{1}{2}b·h$, z = f(x, y) = 2x + 3y, f(x, y) = x2 + y2, ex+y, etc.
Partial derivatives are derivatives of a function of multiple variables, say f(x1, x2, ···, xn) with respect to one of those variables, with the other variables held constant. They measure how the function changes as one variable changes, while keeping the others constant. $\frac{\partial f}{\partial x}(x_0, y_0) = \lim_{\Delta x \to 0} \frac{f(x_0+\Delta x, y_0)-f(x_0, y_0)}{\Delta x}$
Geometrically, the partial derivative $\frac{\partial f}{\partial x}(x_0, y_0)$ at a point (x0, y0) can be interpreted as the slope of the tangent line to the curve formed by the intersection of the surface defined by f(x,y) and the plane y = y0. Similarly, $\frac{\partial f}{\partial y}(x_0, y_0)$ represents the slope of the tangent line to the curve formed by the intersection of the surface defined by f(x,y) and the plane x = x0
$\frac{dw}{ds}\bigg|_{\vec{u}} = ∇w·\vec{u} = |∇w|·|\vec{u}|cos(θ) = |∇w|·cos(θ)$ where θ is the angle between the gradient and the given unit vector.
The gradient is the direction in which the function increases fastest (the direction of the steepest ascent) at a given point, and |∇w| = $\frac{dw}{ds}\bigg|_{\vec{u}=dir(∇w)}$.
The directional derivative is minimized when cos(θ) = -1 ↭ θ = 180 ↭ $\vec{u}$ is in the opposite direction of the gradient ∇w. Futhermore, $\frac{dw}{ds}\bigg|_{\vec{u}} = 0$ ↭ cos(θ) = 0 ↭ θ = 90° ↭ $\vec{u}$ ⊥ ∇w.
The Chain Rule. Let w = f(x, y, z) where x = x(t), y = y(t), and z = z(t) are functions of another variable t, then the derivate of f with respect to t is given by $\frac{dw}{dt} = f_x\frac{dx}{dt} +f_y\frac{dy}{dt}+f_z\frac{dz}{dt}, \frac{dw}{dt} = w_x\frac{dx}{dt} +w_y\frac{dy}{dt}+w_z\frac{dz}{dt} = ∇w·\frac{d\vec{r}}{dt}~$ where ∇w = ⟨wx, wy, wz⟩ and $\frac{d\vec{r}}{dt}=⟨\frac{d\vec{x}}{dt}, \frac{d\vec{y}}{dt}, \frac{d\vec{z}}{dt}⟩$
The gradient vector is defined as follows: ∇w = ⟨wx, wy, wz⟩ = $⟨\frac{\partial w}{\partial x}, \frac{\partial w}{\partial y}, \frac{\partial w}{\partial z}⟩$. And $\frac{d\vec{r}}{dt}$ is the vector of derivatives of x, y, and z with respect to t: $\frac{d\vec{r}}{dt} = ⟨\frac{dx}{dt}, \frac{dy}{dt}, \frac{dz}{dt}⟩$.
The gradient vector is a vector that points in the direction of the steepest increase of the function at a given point. It is perpendicular (orthogonal) to the level surfaces of the function. This means that the gradient points in the direction of the steepest ascent or increase, and there is no change in the function’s value along the level surface, making the gradient perpendicular to it.
Lagrange multipliers are a powerful method for finding the local maxima and minima of a function, say f(x, y, z), where the variables involved x, y, and z are not entirely independent, but instead they are constraint by an equation, g(x, y, z) = c. This method leverages the insight that at points where the function f(x, y, z) achieves an extremum (either a maximum or a minimum) subject to additional requirement or constraint g(x, y, z) = c, the gradient of f must be parallel to the gradient of g.
This relationship can be expressed mathematically as: ∇f = λ∇g where λ is the Lagrange multiplier.
Here, λ is known as the Lagrange multiplier. The gradients ∇f and ∇g represent the directions of the steepest ascent for the functions f and g, respectively. The condition that these gradients are parallel implies that at an extremum, the function f cannot increase or decrease in any direction that would remain within the constraint g(x, y, z) = c.
Let’s apply the method of Lagrange multipliers to a specific problem: finding the point on the hyperbola defined by the equation xy = 3 that is closest to the origin.
Our goal is to minimize the distance from a point (x, y) on the hyperbola to the origin (0, 0). The distance from (x, y) to the origin is given by the function: f(x, y) = $\sqrt{x^2+y^2}$.
However, because the square root function is monotonically increasing, minimizing f(x, y) is equivalent to minimizing its square: f(x, y) = x2 + y2 (Figure C and D).
The constraint is that the point must lie on the hyperbola g(x, y) = xy = 3. 💡It is like finding the highest or lowest point on a mountain or hill subject to the fact you can only walk along a trail.
Observing at the figure, at the minimum, the level curve of f is tangent to the parabola g. Lagrange multipliers involve finding points (x, y) where the level curves of f and g are tangent, and this is due to the fact that gradients are perpendicular to the contour lines of their respective functions. Therefore, the gradient of f is parallel to the gradient of g, expressed as ∇f // ∇g, meaning they are proportional to each other ↭ ∇f = λ∇g where λ is the Lagrange multiplier.
The gradient of a function at $\vec{a}$ represents the direction in which f increases at the greatest rate so, if one writes down a contour diagram with each contour representing a level set of f (levels of equal value), ∇f($\vec{a}$) represents the direction from $\vec{a}$ where the contours are least sparse.
In Calculus 1, to find the local maxima and minima of a function f(x), we look for points where the derivative f′(x) = 0. At these points, the tangent line to the curve y = f(x) is horizontal, indicating a potential extremum (critical point), e.g., f(x) = x2 +4x +4, f’(x) = 2x + 4 = 0 ⇒x = -2 and we need to verify it by the second derivative test.
When finding a minimum or maximum of a function f(x, y, z) subject to a constraint g(x, y, z) = c, we look for points where f does not increase or decrease when moving along the level curve or surface defined by g(x, y, z) = c. Essentially, the rate of change of f in any direction $\vec{a}$ tangent to the level curve g = c must be zero at a constrained extremum. This is because if it were not zero, one could move slightly in the direction of increase or decrease while staying on the level curve or surface g = c, indicating that the current point is not actually an extremum. Mathematically, this is represented as $\frac{df}{ds}\bigg|_{\vec{a}} = 0$ (Figure E)
$\frac{df}{ds}\bigg|_{\vec{a}} = ∇f·\vec{a} = 0$ ⇒ $\vec{a}$ is perpendicular to $∇f$, $\vec{a} ⊥ ∇f$ ⇒ ∇f is perpendicular to the level set of g = c. Since ∇g is also perpendicular to the level set of g = c, it follows ∇f || ∇g.
When extending Calculus 1’s critical points to functions of multiple variables, we seek points where the gradient ∇f is zero to find local extrema. However, when there are constraints, such as in the case of Lagrange multipliers, we look for points where the gradient of f is parallel to the gradient of the constraint function g. This parallelism reflects that the rate of change of f in the direction allowed by the constraint is zero, similar to how the derivative is zero at extrema in one-dimensional calculus.
Therefore, our problem is reformulated as a system of equations,
$\begin{cases} \frac{∂f}{∂x} = λ\frac{∂g}{∂x} \\ \frac{∂f}{∂y} = λ\frac{∂g}{∂y} \\ g = c \end{cases}$
This yields critical points that need to be evaluated to determine if they are maxima, minima, or saddle points, taking into account the constraint g = c. The method does not tell whether a critical point is a minimum, maximum, or saddle point.
$\begin{cases} f_x = λg_x \\ f_y = λg_y \\ g = c\end{cases}$
In our particular example, f(x, y) = x^2 + y^2, g = xy,
$\begin{cases} 2x = λy \\ 2y = λx \\ xy = 3\end{cases}$
$\begin{cases} 2x - λy = 0 \\ λx -2y = 0 \\ xy = 3\end{cases}$
$(\begin{smallmatrix}2 & -λ\\ λ & -2\end{smallmatrix})(\begin{smallmatrix}x\\ y\end{smallmatrix}) = (\begin{smallmatrix}0\\ 0\end{smallmatrix})$.
The trivial solution (0, 0) does not satisfy the constraint (0·0 ≠ 3). We have other solutions only if det(M) = 0 ↭ $|\begin{smallmatrix}2 & -λ\\ λ & -2\end{smallmatrix}|=-4+λ^2=0 ↭ λ^2 = 4 ↭ λ = ± 2$
The method of Lagrange multipliers has provided us with potential extremum points, but it does not tell us whether these points correspond to a minimum, maximum, or saddle point. In this particular example, since we’re dealing with a distance minimization problem, both points $(\sqrt{3},\sqrt{3})$ and $(-\sqrt{3},-\sqrt{3})$ are minima because they yield the same minimum distance from the origin on the hyperbola.
We need to find the dimensions x, y, and z of the largest rectangular box that lies in the first octant (where x, y, and z are all positive) with three of its faces on the coordinate planes xy-plane, yz-plane, and zx-plane, and one vertex on the plane 2x + 3y +5z = 90. The goal is to maximize the volume of this box.
The volume of the box is given by the function f(x ,y, z) = xyz, and it must satisfy the constraint g(x, y, z) =2x + 3y + 5z −90 = 0.
In this problem, we are tasked with finding the maximum and minimum values of the function f(x, y) = xy +1 given that x and y are constrained by the equation g(x, y) = x2+y2-1 = 0. This constraint represents a circle with a radius of 1 centered at the origin. To solve this problem, we will use the method of Lagrange multipliers, which helps find extrema of functions subject to constraints.
In this problem, we are tasked with finding the maximum and minimum values of the function f(x, y, z) = 3x2+y2-2z2 under the constraint g(x, y) = 3x +2y -8z = -50. The constraint represents a plane in 3D space. To solve this problem, we will use the method of Lagrange multipliers, which helps find extrema of functions subject to constraints.
To confirm whether this point represents a minimum, it can be checked that all other points that satisfies the constrain g(x, y, z) = 3x +2y-8z = -50, e.g., (4, -3, 7), (0, 11, 9), their values is less than -100 (f(4, -3, 7) = f(0, 11, 9) = -41 > -100)
We are tasked with constructing a pyramid that has a specific triangular base and a predetermined volume. Our objective is to minimize the total surface area of the pyramid while maintaining these constraints. (Figure i, ii).
Understanding the problem. The volume V of pyramid with a triangular base is given by 1⁄3Areabase·h and both Areabase, the area of the triangular base, and h, the height of the pyramid are fixed or known, hence the height h can be determined and expressed in terms of the volume and the base area h = $\frac{V}{Area_{base}}$
Surface Area of the Pyramid. The total surface area of the pyramid consists of the area of the triangular base (which is fixed) and the areas of the three triangular faces (sides) that connect the apex to each edge of the base. Let u1, u2, and u3 be the distances from Q to the sides of the triangular base (P1, P2, and P3) where Q(x, y, 0) is the point on the base of the pyramid and P(x, y, h) is the peak.
We need to minimize the total surface area of the pyramid. The surface area consists of the areas of the three triangular sides (the area of the base is fixed). The area of each triangular side can be expressed as: $A_{side, i} = \frac{1}{2}a_i\sqrt{u_i^2+h^2}$ where ai is the length of the i-th side of the base and $\sqrt{u_i^2+h^2}$ is the slant height of the triangular face. Notice that there is a right triangle with cathetus ui and h, and the hypotenuse is $\sqrt{u_i^2+h^2}$.
Therefore, the total side surface area is $f(u_1, u_2, u_3) = \frac{1}{2}a_1\sqrt{u_1^2+h^2}+\frac{1}{2}a_2\sqrt{u_2^2+h^2}+\frac{1}{2}a_3\sqrt{u_3^2+h^2} = $.
Setting up the Optimization Problem. The constraint is the fixed area of the triangular base, Areabase = g(u1, u2, u3) = constant. Each of these sides have base, height: $(a_1,u_1), (a_2,u_2),~ and~ (a_3,u_3)$ respectively, so the total Areabase = $\frac{1}{2}a_1u_1 + \frac{1}{2}a_2u_2 + \frac{1}{2}a_3u_3$
Applying Lagrange Multipliers To minimize the surface area subject to the constraint on the base area, we apply the method of Lagrange multipliers. This method involves finding the points where the gradients of the surface area function f(u1, u2, u3) = S and the constraint function g(u1, u2, u3) are parallel. Mathematically, this condition is expressed as: ∇f = λ ∇fg.
Compute the Gradients. Let’s compute the partial derivatives (gradients) of f and g. $\frac{∂f}{u_1} = \frac{1}{2}a_1\frac{u_1}{\sqrt{u_1^2+h^2}}, \frac{∂f}{u_2} = \frac{1}{2}a_2\frac{u_2}{\sqrt{u_2^2+h^2}}, \frac{∂f}{u_3} = \frac{1}{2}a_3\frac{u_3}{\sqrt{u_3^2+h^2}}$. $\frac{∂g}{u_1} = \frac{1}{2}a_1, \frac{∂g}{u_2} = \frac{1}{2}a_2, \frac{∂g}{u_3} = \frac{1}{2}a_3$
Setting the System of Equations. Using the gradients calculated above, we set up the system of equations based on the relationship ∇f = λ∇g: $\frac{∂f}{u_1} = \frac{1}{2}a_1\frac{u_1}{\sqrt{u_1^2+h^2}} = λ\frac{1}{2}a_1 ↭ \frac{u_1}{\sqrt{u_1^2+h^2}} = λ,$ and similarly, $\frac{u_2}{\sqrt{u_2^2+h^2}} = λ, \frac{u_3}{\sqrt{u_3^2+h^2}} = λ ⇒ \frac{u_1}{\sqrt{u_1^2+h^2}} = \frac{u_2}{\sqrt{u_2^2+h^2}} = \frac{u_3}{\sqrt{u_3^2+h^2}}$
Solve the system of equations: $λ = \frac{u_1}{\sqrt{u_1^2+h^2}} = \frac{u_2}{\sqrt{u_2^2+h^2}} = \frac{u_3}{\sqrt{u_3^2+h^2}}$. Since the distance u1, u2, and u3 are proportional to each other, and all sides of the pyramid are symmetric in this configuration, it follows that: u1 = u2 = u3⇒The solution implies that Q is located at the incenter of the triangular base.
Recall that the incenter is the point where the three interior angle bisectors of a triangle meet. The incenter is equidistant from the sides of the triangle.
We are given a closed cylindrical jar with a fixed volume of 2000 cm³. The objective is to find the dimensions (radius r and height h) that will minimize the surface area of the jar.