The Joys Of Minimalism

Algebra Level 5

f a ( x , y ) = a x 2 + 4 a y 2 + 20 x y + 3 x + 6 y + 7 \large f_a(x,y)=ax^2+4ay^2+20xy+3x+6y+7

Find the smallest value of the parameter a a such that f a ( x , y ) f_a(x,y) attains a global minimum.

Enter 666 if you come to the conclusion that no such smallest value of a a exists.


This problem was commissioned by Comrade Pi Han Goh.


The answer is 5.

This section requires Javascript.
You are seeing this because something didn't load right. We suggest you, (a) try refreshing the page, (b) enabling javascript if it is disabled on your browser and, finally, (c) loading the non-javascript version of this page . We're sorry about the hassle.

7 solutions

Relevant wiki: Hessian Matrix

The function is : f a ( x , y ) = a x 2 + 4 a y 2 + 20 x y + 3 x + 6 y + 7 \large f_a(x,y)=ax^2+4ay^2+20xy+3x+6y+7

Finding the first partial derivatives gives us : { 2 a x + 20 y + 3 = 0 8 a y + 20 x + 6 = 0 \begin{cases} 2ax+20y+3=0 \\ 8ay+20x+6 = 0 \end{cases}

Taking the Hessian we see,

Δ 2 f a ( x , y ) = ( 2 a 20 20 8 a ) \Delta^2f_a(x,y) = \begin{pmatrix} 2a & 20 \\ 20 & 8a \end{pmatrix}

For attaining a global minima it must be Positive Semi-Definite which implies in turn f f should be convex throughout the domain.

The principal minors must be non-negative , 16 a 2 400 0 16a^2-400\ge 0 & 2 a , 8 a 0 2a,8a\ge 0 . We get on solving a 5 a\ge 5

{ For a=5, f 5 ( x , y ) = ( x + 2 y ) ( 5 x + 10 y + 3 ) + 7 For a>5, f a ( x , y ) is continously increasing for all positive a \begin{cases} \text{For a=5, } f_{5}(x,y)=(x+2y)(5x+10y+3)+7 \\ \text{For a>5, }f_{a}(x,y)\text{ is continously increasing for all positive a} \end{cases}

There are critical points in this case as the coefficient vector of the linear terms [ 3 , 6 ] T [3,6]^T is in the range of the Hessian. So a = 5 a=5 gives local as well as global minimum for the function as the function f f is convex & it's Hessian is positive semi-definite.

You are doing good work (+1), but I'm still trying to fully understand your solution.

Let me ask you, if we were to replace the term 3 x 3x in f a ( x , y ) f_a(x,y) by 4 x 4x , what would your answer be?

Otto Bretscher - 5 years, 1 month ago

Log in to reply

I see that this tiny change brings some new facts. Now I need to learn these from you that :

1) Taking first partial derivative we get the equations as : { 2 a x + 20 y + 4 = 0 8 a y + 20 x + 6 = 0 \begin{cases} 2ax+20y+4=0 \\ 8ay+20x+6 = 0 \end{cases} . Proceeding like same we obtain a 5 a\ge 5 . But, the equations are inconsistent. There do not exists x , y x,y that satisfy these.

2) In the original problem the equations satisfied the property of having infinite number of solutions ( x , y ) (x,y) . So as f f is convex & increasing through the domain , the entire equation depends on a a and we get the smallest such a a as 5.

What I am not able to understand is If the equations are inconsistent then what should we say ? There aren't any stationary points and the function attains it's global minimum at a = 5 a=5

Aditya Narayan Sharma - 5 years, 1 month ago

Log in to reply

I'm getting ready for work (it's 6:30 am at the US East coast)... I will answer fully in the evening unless somebody else does.

Let me just give you a simple analogy: The function g a ( x ) = a x 2 + 7 g_a(x)=ax^2+7 was a global minimum for a 0 , a\geq 0, but the function h a ( x ) = a x 2 + 3 x + 7 h_a(x)=ax^2+3x+7 has a minimum for a > 0 a > 0 only.

The Hessian being positive semi-definite does not guarantee the existence of a minimum; you have to consider the linear terms also... you need a critical point.

Otto Bretscher - 5 years, 1 month ago

Log in to reply

@Otto Bretscher Oh I see sir. I am trying to modify my solution a bit if possible considering this fact that the linear terms are also necessary. I will be waiting for your solution. And yeah it took me a while to understand that line since it's already evening here ;)

Aditya Narayan Sharma - 5 years, 1 month ago

Log in to reply

@Aditya Narayan Sharma You can easily make your solution complete by adding a little something to the paragraph "So a = 5 a=5 gives local as well as global minimum for the function as the function is convex & it's Hessian is positive semi-definite." You need to add that there are critical points in this case (because the coefficient vector [ 3 , 6 ] T [3, 6]^T of the linear terms is in the image (or range) of the Hessian).

Otto Bretscher - 5 years, 1 month ago

Log in to reply

@Otto Bretscher Ok then I am adding that step in & I think I've understood why it was incomplete. I need to study on it harder . But how do we know they're in the range ?

Aditya Narayan Sharma - 5 years, 1 month ago

Log in to reply

@Aditya Narayan Sharma Well, you solve the linear system, as you said, or you observe that [ 3 , 6 ] T [3,6]^T is parallel to the columns of the Hessian.

One more tiny point, just to make your solution flawless: You need to include 8 a 8a in your list of the principal minors when you test for positive semi-definiteness.

Otto Bretscher - 5 years, 1 month ago

Log in to reply

@Otto Bretscher I have done that correction sir . But when do we actually need to work on the linear terms and how would we ? As you said that the linear terms must be considered so what will be the effect of considering them ? I mean how will it affect the minima or maxima .

Aditya Narayan Sharma - 5 years, 1 month ago

Log in to reply

@Aditya Narayan Sharma The sentence "So a = 5 a=5 gives local as well as global minimum for the function as the function is convex & it's Hessian is positive semi-definite" is misleading. As we discussed, the Hessian being positive semi-definite does not by itself guarantee the existence of a minimum.

More later. Obviously, the linear terms may affect the value of the minimum and the point(s) where it is attained... consider the simple example p ( x ) = a x 2 + b x + c p(x)=ax^2+bx+c again

Otto Bretscher - 5 years, 1 month ago

Log in to reply

@Otto Bretscher So should I remove that line (So a = 5 a=5 gives local ....) or it is fine as it is ? I've added that line referring to coefficient vector of linear terms already.

Yes in the example it is obvious that the linear term affects the minimum, but in our problem it doesn't. I learnt a whole lot of facts today.

Aditya Narayan Sharma - 5 years, 1 month ago

Log in to reply

@Aditya Narayan Sharma I would reverse the order of the two sentences in that paragraph. First state that there are critical points in the case a = 5 a=5 . Now, since the graph is convex, the function must attain its global minimum at those critical points.

Otto Bretscher - 5 years, 1 month ago

Log in to reply

@Otto Bretscher Oh I got it. I reversed them and it looks good now. I think it's okay now. Thanks sir again !

Aditya Narayan Sharma - 5 years, 1 month ago

Log in to reply

@Aditya Narayan Sharma Yes great! I'm about to teach a class on determinants... I will mention the Hessian as an example that the students have seen in their calculus class.

I wish all my (college) students would understand this stuff as well as you do ;)

Otto Bretscher - 5 years, 1 month ago

Log in to reply

@Otto Bretscher That will be great :D . I think my use of Hessian Inspired you to give the example ;)

Aditya Narayan Sharma - 5 years, 1 month ago

Log in to reply

@Aditya Narayan Sharma Exactly! See, I'm an algebraist by training (and by disposition), I find algebra "cleaner" than calculus, so, I'm thinking of the Hessian as twice the matrix of the quadratic form... the reasoning is equivalent.

Note that the Topic for this problem is Algebra ;)

Otto Bretscher - 5 years, 1 month ago

Log in to reply

@Otto Bretscher Your C a l g e b r a \mathbf{Calgebra} view is great ! The hessian as twice as the matrix of the quadratic form. Wow ! I would like to continue discussion on this . when you become free actually at the evening when it would be (12:00 PM) here and my mom would be shouting at me to go to bed :P .

Considering the topic as algebra the partial derivatives which is the most necessary step for the problem must be avoided. Can we use partial derivatives here ? But the topic is algebra. What should be the approach then ?

Aditya Narayan Sharma - 5 years, 1 month ago

Log in to reply

@Aditya Narayan Sharma On Mondays we have a colloquium on 4 pm, so, I will not get home before 6 pm my time. But I may have some free time during the day, unless the students keep pestering me with questions about determinants ;)

Otto Bretscher - 5 years, 1 month ago

Log in to reply

@Otto Bretscher So I will be awake by 7 PM(In your zone) as that would be nearly 5 AM here. The students must keep you pestering with questions, as I have done here one after another ;) and continuing to do so .

Aditya Narayan Sharma - 5 years, 1 month ago

Log in to reply

@Aditya Narayan Sharma I like and enjoy your line of questioning.... your curiosity and insight

Otto Bretscher - 5 years, 1 month ago

Log in to reply

@Otto Bretscher Thanks Comrade ! I hope you won't mind me calling you that . That word is an I n s p i r a t i o n \mathbf{Inspiration} from you ;)

Aditya Narayan Sharma - 5 years, 1 month ago

Log in to reply

@Aditya Narayan Sharma I'm always honoured to be called a Comrade! Thanks Comrade!

Otto Bretscher - 5 years, 1 month ago

Log in to reply

@Otto Bretscher Yeah definitely we had comrade !, So better I should wait for the next problem to have a conversation with you :)

Aditya Narayan Sharma - 5 years, 1 month ago

Log in to reply

@Aditya Narayan Sharma As Comrade @Khang Nguyen Thanh points out, a simple way to examine the case f 5 ( x , y ) f_5(x,y) is to complete the squares. To proceed systematically, it is sensible to multiply with 5 first (or with a a in general) and then collect all the terms involving x x :

5 f 5 ( x , y ) = 25 x 2 + 100 y 2 + 100 x y + 15 x + 30 y + 35 = ( 5 x + 10 y + 3 2 ) 2 + 35 9 4 5f_5(x,y)=25x^2+100y^2+100xy+15x+30y+35=(5x+10y+\frac{3}{2})^2+35-\frac{9}{4} 131 4 \geq \frac{131}{4}

There are quite a few ways to tackle this problem!

Otto Bretscher - 5 years, 1 month ago

@Aditya Narayan Sharma I think we are pretty much done with this problem... are we? We had a good dialogue indeed!

Otto Bretscher - 5 years, 1 month ago

We have: f a ( x , y ) = 5 x 2 + 20 x y + 20 y 2 + 3 ( x + 2 y ) + 7 + ( a 5 ) ( x 2 + 4 y 2 ) = 5 ( x + 2 y ) 2 + 3 ( x + 2 y ) + 7 + ( a 5 ) ( x 2 + 4 y 2 ) = 5 ( x + 2 y + 3 10 ) 2 + 131 20 + ( a 5 ) ( x 2 + 4 y 2 ) \begin{aligned} f_a(x,y)&=5x^2+20xy+20y^2+3(x+2y)+7+(a-5)(x^2+4y^2)\\ &=5(x+2y)^2+3(x+2y)+7+(a-5)(x^2+4y^2)\\ &=5\left(x+2y+\dfrac{3}{10}\right)^2+\dfrac{131}{20}+(a-5)(x^2+4y^2) \end{aligned}

If a < 5 a<5 , we can choose y = k ; x = 2 k 3 10 y=k; x=-2k-\dfrac{3}{10} and k + k\to+\infty .

This implies f a ( x , y ) = ( a 5 ) ( ( 2 k + 3 10 ) 2 + 4 k 2 ) + f_a(x,y)=(a-5)\left(\left(2k+\dfrac{3}{10}\right)^2+4k^2\right)\to +\infty when k + k\to+\infty

If a = 5 a=5 , we get f a ( x , y ) = 5 ( x + 2 y + 3 10 ) 2 + 131 20 131 20 f_a(x,y)=5\left(x+2y+\dfrac{3}{10}\right)^2+\dfrac{131}{20}\ge\dfrac{131}{20}

The equality holds when x + 2 y + 3 10 = 0 x+2y+\dfrac{3}{10}=0 .

So a = 5 \boxed{a=5} is the smallest value satisfied.

Your first step gives you ( 15 + a ) y 2 (15+a)y^2 , not 4 a y 2 4ay^2 as required.

Otto Bretscher - 5 years, 1 month ago

Log in to reply

I corrected it.

Khang Nguyen Thanh - 5 years, 1 month ago

Log in to reply

Yes, this works! (+1) A skillful completion of squares.

Otto Bretscher - 5 years, 1 month ago
Pi Han Goh
Apr 28, 2016

We first prove that a > 0 a> 0 : Suppose otherwise, that is suppose there exists a minimum value when a 0 a\leq 0 , then setting y = 0 y=0 shows that f a ( x , 0 ) = a x 2 + 3 x + 7 f_a(x,0) = ax^2 + 3x+7 is either a straight line or a parabola that concave downwards. Either way, neither them has a minimum value. Thus, a > 0 a> 0 must be true.

We can rewrite the matrix of the quadratic part of f a ( x , y ) f_a(x,y) as

A a = [ a 10 10 4 a ] A_a = \begin{bmatrix}{a} && {10} \\ {10} && {4a}\end{bmatrix}

By Cayley-Hamilton theorem , the eigenvalues of A a A_a satisfy the equation λ I 2 A = 0 | \lambda I_2 - A | = 0 , or equivalently,

0 = λ a 10 10 λ 4 a 0 = ( λ a ) ( λ 4 a ) 100 0 = λ 2 + λ ( 5 a ) + ( 4 a 2 100 ) . 0 = \begin{vmatrix}{\lambda - a} && {-10} \\ {-10} && {\lambda - 4a}\end{vmatrix} \quad \Leftrightarrow \quad 0= (\lambda - a)(\lambda - 4a)-100 \quad \Leftrightarrow \quad 0 = \lambda^2 + \lambda(-5a) + (4a^2-100) \; .

Applying the quadratic formula , we can solve for λ \lambda to get λ = 5 a ± 9 a 2 + 400 2 \lambda = \dfrac{5a\pm \sqrt{9a^2+400}}2 .

The necessary requirement for the function f a ( x , y ) f_a(x,y) to have a minimum value, is to have all its eigenvalues to be strictly non-negative.

Let λ 1 < λ 2 \lambda _1 < \lambda_2 denote these two eigenvalues. Because we have shown that a > 0 a> 0 , then λ 2 0 \lambda_2\geq 0 . We are left to determine the range of a a such that λ 1 0 \lambda_1 \geq 0 as well. So,

5 a 9 a 2 + 400 2 0 ( 25 a ) 2 9 a 2 + 400 a 5 . \dfrac{5a - \sqrt{9a^2+400}}2 \geq 0 \quad \Leftrightarrow \quad (25a)^2 \geq 9a^2 + 400 \quad \Leftrightarrow \quad a \geq 5 \; .

Thus, we know that there is a minimum value when a > 5 a>5 . What's left to prove is to determine whether a = 5 a=5 gives a minimum f a ( x , y ) f_a(x,y) :

f 5 ( x , y ) = 5 x 2 + 20 y 2 + 20 x y + 3 x + 6 y + 7 . f_5(x,y) = 5x^2 + 20y^2 + 20xy + 3x+6y + 7 \; .

We then find the kernel of this quadratic form:

[ 5 10 0 10 20 0 ] R 2 R 2 2 R 1 [ 5 10 0 0 0 0 ] ( x , y ) = ( 2 t , t ) , t R . \left [ \begin{array} { c c | c} 5 & 10 & 0 \\ 10 & 20 & 0 \end{array}\right ] \stackrel{R_2\to R_2 - 2R_1}{\huge \longrightarrow}\left [ \begin{array} { c c | c} 5 & 10 & 0 \\ 0 & 0 & 0 \end{array}\right ] \quad \Leftrightarrow \quad (x,y) = (-2t,t), t \in \mathbb R \; .

Now f 5 ( 2 t , t ) = 7 f_5(-2t,t) = 7 , showing that f 5 ( x , y ) f_5(x,y) has a minimum value. And we're done! We successfully proved that f a ( x , y ) f_a(x,y) has a minimum value when a 5 a\geq \boxed5 .

Yes, indeed, very nicely done! You got it, Comrade!

As you can see from the various solutions here, there are quite a few ways to show that f 5 ( x , y ) f_5(x,y) has a global minimum.

The most straightforward approach is to complete the squares, as Comrade @Khang Nguyen Thanh did.

We can also point out that the coefficient vector of the linear terms, [ 3 , 6 ] T [3,6]^T , is in the image (or range or column space) of the matrix, which guarantees the existence of a critical point; that was Comrade @Aditya Sharma 's approach.

All in all, we have some interesting and high-quality solutions here.

Otto Bretscher - 5 years, 1 month ago

Log in to reply

Me luv thiss question

Pi Han Goh - 5 years, 1 month ago

Log in to reply

As you wish , Comrade

Otto Bretscher - 5 years, 1 month ago

Log in to reply

@Otto Bretscher Is the expression f ( x , y , z , w ) = x 2 + 9 y 2 + 4 z 2 + w 2 6 x y + 4 x z 2 x w 12 y z + 6 y w 4 z w + x 3 y + 2 z + w f(x,y,z,w)=x^2+9y^2+4z^2+w^2-6xy+4xz-2xw-12yz+6yw-4zw+x-3y+2z+w

Aditya Narayan Sharma - 5 years, 1 month ago

Log in to reply

@Aditya Narayan Sharma Yes, isn't that what I typed? It's easy to get one of them wrong...

Otto Bretscher - 5 years, 1 month ago

So far so good... keep up the good work, Comrade!

One detail: There is no need to invoke Hamilton-Cayley here: The eigenvalues are the solutions of det ( λ I 2 A ) = 0 \det(\lambda I_2-A)=0 by definition .

Otto Bretscher - 5 years, 1 month ago

Log in to reply

Okay doneeeeeee!

By definition? Really? Okay then....

You got a typo there: it's Cayley-Hamilton, not Hamiltion-Cayley. That's like attributing Leibniz for discovering calculus. Ahahahahaha!

Pi Han Goh - 5 years, 1 month ago

The matrix for a = 5 a=5 is [ 5 10 10 20 ] \begin{bmatrix} 5 & 10\\ 10 & 20 \end{bmatrix}

Otto Bretscher - 5 years, 1 month ago

You need to fix a typo @Pi Han Goh ,

5 a 9 a 2 + 400 2 > 0 \dfrac{5a - \sqrt{9a^2+400}}2 > 0 would be 5 a 9 a 2 + 400 2 0 \dfrac{5a - \sqrt{9a^2+400}}2 \ge 0 . Else all is well.

Aditya Narayan Sharma - 5 years, 1 month ago

Log in to reply

FIXEEED!!!!!!

Pi Han Goh - 5 years, 1 month ago
Michael Mendrin
Apr 25, 2016

The 2nd order equation

a x 2 + 4 a y 2 + 20 x y + 3 x + 6 y + 7 = k a{x}^{2}+4a{y}^{2}+20xy+3x+6y+7=k

becomes a family of hyperbolas for a < 5 \left| a \right| <5 , because then

B 2 4 A C = 20 2 4 1 a 4 a = 400 16 a 2 > 0 {B}^{2}-4AC={20}^{2}-4\cdot 1a\cdot 4a=400-16{a}^{2}>0

Hence, we first seek if there is a value for a 5 a\ge 5 such that this reduces to a pair of lines. To find that, letting the equation equal k k , we solve for y y

y = 1 8 a ( 6 20 x ± ( ( 400 16 a 2 ) x 2 + ( 240 48 a ) x + 16 a k 112 a + 36 ) y=\dfrac { 1 }{ 8a } \left( -6-20x\pm \sqrt { (\left( 400-16{ a }^{ 2 } \right) { x }^{ 2 }+\left( 240-48a \right) x+16ak-112a+36 } \right)

We see that if a = 5 a=5 , then the radical becomes independent of x x , and the problem is answered. For a < 5 a<-5 , the 2nd order equation becomes a family of ellipses with a global maximum and no global minimum.

I detect a computational error in the 4th line where you claim that 16 a 2 = 4 a 2 16a^2=4a^2 . Correcting this will somewhat alter your argument... we will take it from there.

Your work shows that the equation f 5 ( x , y ) = k f_5(x,y)=k has solutions if k 131 20 k\geq \frac {131}{20} , which proves that there is a minimum in this case. But why does f a ( x , y ) f_a(x,y) fail to have a minimum when a < 5 a<5 ?

Otto Bretscher - 5 years, 1 month ago

Log in to reply

Okay, fixed. For 5 < a < 5 -5<a<5 , we have a family of hyperbolas. More specifically, we have a hyperbolic paraboloid, which doesn't have a global minimum anywhere. For a = 5 a=5 , then we have a parabolic cylinder, which does have a global minimum, which is in the form of a line. I'll come back to this later and look for more problems and clean up my solution, and maybe expand on it.

Michael Mendrin - 5 years, 1 month ago
Dharanya Lavanya
Apr 30, 2016

For the function (which is a second degree equation in 2 variables) to have a global minimum and a to be smallest , I think it must represent a parabola. So, H^2= A.B where 2H is the coefficient of xy , B is the coefficent of y^2 and A is the coefficient of x^2 . Therefore, a comes out to be 5.

Please improve my solution, I know I've answered it vaguely.

Sal Gard
Jun 1, 2016

Despite not knowing about Hessian matrices, here is my solution. Taking partials, 2ax+20y=-3, 8ay+20x=-6. Now substitute y in terms of x and a and one gets 2a^2x-15-50x+3a=0. Tracking the discrimant for a to real in terms of x yields eventually x=-3/20. When we track this discriminant, we are finding the smallest x such that there is a corresponding real a, as this will occur in an increasing multivariable function. Plug this back into the equation 2ax+20y=-3, with y substituted, yielding a^2-10a+25=0. And the solution is obvious.

Moderator note:

You have shown the calculations for there to be a local minimum. Why must this be the global minimum also? For example, y = x ( x 1 ) ( x + 1 ) y = x(x-1)(x+1) has a local minimum but doesn't have a global minimum.

In this case, the result still holds true because of the classification of quadratic forms.

You are losing me in the second line when you say "Tracking the discriminant..." Please be more explicit. How did you find x = 3 20 x=-\frac{3}{20} and what is the significance of that value?

Also, there are typos in both of your partials in the first line.

I would have loved to solve the problems you recently posted, including the one dedicated to me, if only the numbers involved weren't that huge. You are "number crazy" indeed! I don't see any way to do those in a reasonable amount of time without cheating and resorting to technology ;)

Otto Bretscher - 5 years ago

Log in to reply

Sorry. I am twelve. I have a lot to learn in life and I am in that stage of experimentation. Thank you for your consideration.

Sal Gard - 5 years ago

Log in to reply

You are doing great for your age! Keep up that enthusiasm and curiosity!

Otto Bretscher - 5 years ago

You have shown the calculations for there to be a local minimum. Why must this be the global minimum also? For example, y = x ( x 1 ) ( x + 1 ) y = x(x-1)(x+1) has a local minimum but doesn't have a global minimum.

In this case, the result still holds true because of the classification of quadratic forms.

Calvin Lin Staff - 5 years ago
Arnav Goel
May 4, 2016

I DONE IT IN A DIFFERENT WAY

CLEARLY THE EQUATION IS HOMOGENOUS OF 2 DEGREE FOR MIN (global min) ITS DETERMINANT MUST BE ZERO (pair of straight lines) solving determinant will give directly 5

You need to pay attention to the linear terms as well, as we have discussed in other solutions.

Otto Bretscher - 5 years, 1 month ago

0 pending reports

×

Problem Loading...

Note Loading...

Set Loading...