Maximize an Integral involving a probability distribution

Calculus Level 5

Let $p:[0, \infty)\to (0,\infty)$ be a continuous function such that $\int_{0}^{\infty}p(x) \, dx=1$ Over all such functions $p(x)$ , define the integral $I_p$ as follows $I_p= \int_{0}^{\infty} e^{-4x}\ln(p(x)) \, dx$ Find $\max_p I_p$

Hint : The inequality $e^x \geq 1+x$ might come handy.

This problem can be solved by using knowledge of high-school calculus only. Good luck!

The answer is 0.0965736.

3 solutions

Abhishek Sinha
Dec 7, 2015

Using the inequality $e^z \geq 1+z$ , for any function $p(x)$ satisfying the given properties, we have $\ln\frac{p(x)}{4 e^{-4x}} \leq \frac{p(x)}{4e^{-4x}}-1$ Multiplying both sides by $4e^{-4x}$ , we have $4 e^{-4x} \ln\frac{p(x)}{4 e^{-4x}} \leq p(x)-4e^{-4x}$ Integrating both sides in the interval $[0,\infty)$ , we have $\int_{0}^{\infty} 4 e^{-4x} \ln\frac{p(x)}{4 e^{-4x}} dx \leq \int_{0}^{\infty}\big( p(x)-4e^{-4x}\big) dx=1-1=0$ i.e., $4\int_{0}^{\infty} e^{-4x} \ln(p(x)) dx \leq \int_{0}^{\infty} 4 e^{-4x} \ln (4 e^{-4x})dx= \ln(4)-1$ Thus, we have the following upper-bound on $I_p$ $I_p \leq \frac{1}{4} (\ln(4)-1) \hspace{15pt} (*)$ By tracing the above series of inequalities backwards, we see that the upper-bound is achievable iff we have equality in the inequality we started with, i.e., $\frac{p(x)}{4 e^{-4x}}=1, \forall x \in [0,\infty)$ . i.e., the Exponential Distribution $p(x)=4e^{-4x}, x\geq 0$ achieves the upper-bound (*) on $I_p$ .

Nice solution! Yours is one of those proofs where you need to know what the answer is before you can construct the proof. You could guess $p(x) = 4e^{-4x}$ by maximizing $I_p$ over all negative exponentials $p_k(x) = ke^{-kx}$ , I suppose.

Alternatively, the Calculus of Variations could be used to extremize $I_p$ over all positive continuous functions $p$ whose integral is $1$ , and this approach would construct the fact that the maximum occurs for $p(x) = 4e^{-4x}$ .

Suppose that $p(x)$ is the function that minimizes $I$ . Let $q$ be any continuous function of compact support with integral equal to $0$ . Then there exists $k > 0$ such that $p + \lambda q$ is a positive function for all $|\lambda| < k$ , and we deduce that $F_q(\lambda) \,=\, I_{p +\lambda q}$ must have a turning point at $\lambda = 0$ , and hence $F_q'(0) \; = \; \int_0^\infty \frac{e^{-4x}}{p(x)} q(x)\,dx \; = \; 0 \;.$ Since $F_q'(0) = 0$ for all continuous functions $q$ of compact support that integrate to $0$ , we deduce that $\frac{e^{-4x}}{p(x)}$ must be constant (if not, it is fiddly but possible to construct a function $q(x)$ of compact support with integral $0$ for which $F_q'(0)$ is nonzero). Thus $p(x)$ must be a multiple of $e^{-4x}$ , and we are done.

Not High School calculus, of course, but useful. The Calculus of Variations can be used to solve all sorts of extremal problems of this sort.

Mark Hennings - 5 years, 6 months ago

Yes, this is inspired. The ``black-magic" that I actually used in constructing this proof is the non-negativity of KL divergence . I usually reserve the sledge-hammer method of calculus of variations for more difficult problems where the integral involves derivatives of unknown variables.

Abhishek Sinha - 5 years, 6 months ago

Plinio Sd
Dec 15, 2015

I did a discrete approach. Let us define the function $p_L(x)$ as $p_n$ , if $nL \leq x < (n+1)L$ .

Then, we must maximize $\begin{aligned} I_{p_{L}} &= \sum_{n=0}^{+\infty} \int_{nL}^{(n+1)L} e^{-4x} \ln(p_n) \\ &= \dfrac{1}{4}(1-e^{4L}) \sum_{n=0}^{+\infty} e^{-4nL}\ln(p_n) \end{aligned},$

with the constraint, $\sum_{n=0}^{+\infty} p_n L = 1.$

We can solve this problem using Lagrange multipliers. Writing the Lagrangian,

$\mathcal{L} = \dfrac{1}{4}(1-e^{4L}) \sum_{n=0}^{+\infty} e^{-4nL}\ln(p_n) + \lambda \left( \sum_{n=0}^{+\infty} p_n L - 1 \right),$

we know that $\dfrac{\partial \mathcal{L}}{\partial p_n} = 0$ , i.e,

$\begin{aligned} & \dfrac{1}{4} (1-e^{-4L}) e^{-4nL}\dfrac{1}{p_n} + \lambda L = 0\\ \Rightarrow & p_n = -\dfrac{1-e^{-4L}}{4\lambda L} e^{-4nL}. \end{aligned}$

Now we use the constraint to determine $\lambda$ ,

$\begin{aligned} & \sum_{n=0}^{+\infty} -\dfrac{1-e^{-4L}e^{-4nL}}{4\lambda L}L = 1 \\ \Rightarrow & \lambda = - \dfrac{1-e^{-4L}}{4}\sum_{n=0}^{+\infty} e^{-4nL} = -\dfrac{1}{4}. \\ \end{aligned}$

Therefore, $p_n = \dfrac{1-e^{-4L}}{L} e^{-4nL}$ .

Now we use the function $p_L(x)$ as we defined and make $L \to 0$ . This way, we find that $p(x) = 4 e^{-4x},$ which is continuous. Note that $p_L(x)$ is not continuous, so when we take the $\lim_{L \to 0} p_L(x) = p(x)$ , we may not necessarily find a continuous function $p(x)$ .

Finally, it is easy to calculate $I_p$ and we find $\dfrac{1}{4}(\ln(4) - 1)$ .

That's a really interesting approach, working from first principles. However, to be completely rigorous, we need to justify why $I_{p_n}\to I_p$ . Which convergence theorem can help us establish this ?

Abhishek Sinha - 5 years, 6 months ago

Well, that's a good question. I would try to use the dominated convergence theorem to exchange the limit of $L$ with the integral. We can use the theorem, because there is a function $g(x)$ , which is integrable, such that $|e^{-4x}\ln(p_L(x))| \leq g(x), \forall x \in [0, \infty), \forall L \in (0,L_0)$ . For example, $g(x) = 16 (x+1) e^{-x}$ .

Plinio SD - 5 years, 6 months ago

Jeremi Litarowicz
Dec 15, 2015

If $p(x)$ maximises $I_p$ then any change to $p(x)$ will either decrease or not change $I_p$ .

So, we define a function: $p_2(x)=\begin{cases}p(x)-dy, & \text{if } x=\sigma \\ p(x)+dy, & \text{if } x=d \\ p(x), & \text{otherwise}\end{cases}$

Note that $\int_{0}^{\infty} p(x)\,dx=\int_{0}^{\infty} p_2(x)\,dx$ .

Now we look at the relationship between $I_p$ and $I_{p_2}$ : $I_{p_2}=I_p+dx[e^{-4\sigma}(\ln(p(\sigma)-dy)-\ln(\sigma))+e^{-4d}(\ln(p(d)+dy)-\ln(d))] = I_p+dx[-e^{-4\sigma}\frac{dy}{p(\sigma)}+e^{-4d}\frac{dy}{p(d)}]$

Given our condition of $I_p$ being maximal, we know that $I_{p_2}-I_p\leq0$ . This implies that: $dy dx[-\frac{e^{-4\sigma}}{p(\sigma)}+\frac{e^{-4d}}{p(d)}]\leq0$

Since the sign of $dy dx$ is not specified, we have that: $-\frac{e^{-4\sigma}}{p(\sigma)}+\frac{e^{-4d}}{p(d)}=0$ $e^{4\sigma}p(\sigma)=e^{4d}p(d)$

Since the two sides are independent, we have that: $e^{4\sigma}p(\sigma)=\lambda=e^{4d}p(d)$ $p(\sigma)=\lambda e^{-4\sigma}$

Now we solve for $\lambda$ : $\int_{0}^{\infty} p(x)\,dx = \int_{0}^{\infty} \lambda e^{-4x}\,dx = \lambda/4 = 1$ $\lambda=4$

Substituting back into $I_p$ we get: $I_p = \int_{0}^{\infty} e^{-4x}\ln(4 e^{-4x})\,dx = \int_{0}^{\infty} e^{-4x}(\ln(4) -4x)\,dx = \boxed{\frac{1}{4}(\ln{4}-1)}$

Really awesome approach!

Abhishek Sinha - 5 years, 4 months ago

Maximize an Integral involving a probability distribution

This problem can be solved by using knowledge of high-school calculus only. Good luck!

The answer is 0.0965736.

3 solutions

0 pending reports