| Version 2 |
Version 1 |
| Since the Legendre transform is the maximum of a function, Lagrange multipliers should be the natural choice for handling constraints. However, there is a problem due to the fact that we are mostly interested in the maximum's functional dependence on the transform parameter, and not on its value.\\\\ |
Since the Legendre transform is the maximum of a function, Lagrange multipliers should be the natural choice for handling constraints. However, there is a problem due to the fact that we are mostly interested in the maximum's functional dependence on the transform parameter, and not on its value.\\\\ |
| \textbf{THE LEGENDRE-LAGRANGE PROBLEM}\\ |
\textbf{THE LEGENDRE-LAGRANGE PROBLEM}\\ |
|
|
| Let $f(\overline{x})$ be a function of the real n-vector $\overline{x}$ and $g(\overline{p})$ its Legendre transform defined by |
Let $f(\overline{x})$ be a function of the real n-vector $\overline{x}$ and $g(\overline{p})$ its Legendre transform defined by |
| \begin{align} |
\begin{align} |
| g(\overline{p}) \;=\;\overline{p}.\overline{x}-f(\overline{x})\quad\quad p_i\;=\;\frac{\partial f}{\partial x_i} |
g(\overline{p}) \;=\;\overline{p}.\overline{x}-f(\overline{x})\quad\quad p_i\;=\;\frac{\partial f}{\partial x_i} |
| \end{align} |
\end{align} |
| The vector $\overline{p}$ is a function of $\overline{x}$, or conversely, $\overline{x}$ is a function of $\overline{p}$, so that $g(\overline{p})$ is a function of $\overline{p}$ alone. |
The vector $\overline{p}$ is a function of $\overline{x}$, or conversely, $\overline{x}$ is a function of $\overline{p}$, so that $g(\overline{p})$ is a function of $\overline{p}$ alone. |
| The Legendre transform is also alternatively defined as the maximum of the function $\overline{p}.\overline{x}-f(\overline{x})$. This maximum is reached for $\overline{x}$ satifying the second set of equations in (1). |
The Legendre transform is also alternatively defined as the maximum of the function $\overline{p}.\overline{x}-f(\overline{x})$. This maximum is reached for $\overline{x}$ satifying the second set of equations in (1). |
| Suppose now that the components $x_i$ are not all independent, but are linked by the constraint: |
Suppose now that the components $x_i$ are not all independent, but are linked by the constraint: |
| \begin{align} |
\begin{align} |
| h(\overline{x})\;=\;0 |
h(\overline{x})\;=\;0 |
| \end{align} |
\end{align} |
| This equation defines one of the components, $x_\alpha$ for example, as a function of all the others. Putting this function into (1) would give us the transform $g(\overline{p})$ as a function of the vector $\overline{p}$ with n-1 components. It would be nice if we could instead use a Lagrange multiplier $k$ and compute the maximum of the function $$\overline{p}.\overline{x}-f(\overline{x})+kh(\overline{x})$$ with all its n components. But then, how does the constraint (2) on $\overline{x}$ translate to $\overline{p}$? The answer is amazingly simple, as we shall see next.\\\\ |
This equation defines one of the components, $x_\alpha$ for example, as a function of all the others. Putting this function into (1) would give us the transform $g(\overline{p})$ as a function of the vector $\overline{p}$ with n-1 components. It would be nice if we could instead use a Lagrange multiplier $k$ and compute the maximum of the function $$\overline{p}.\overline{x}-f(\overline{x})+kh(\overline{x})$$ with all its n components. But then, how does the constraint (2) on $\overline{x}$ translate to $\overline{p}$? The answer is amazingly simple, as we shall see next.\\\\ |
| \textbf{DIRECT COMPUTATION OF THE TRANSFORM}\\ |
\textbf{DIRECT COMPUTATION OF THE TRANSFORM}\\ |
|
|
| We are going first to compute the Legendre transform the hard way, without the help of a multiplier, by considering $x_\alpha$ as a function of the other components: |
We are going first to compute the Legendre transform the hard way, without the help of a multiplier, by considering $x_\alpha$ as a function of the other components: |
| \begin{align} |
\begin{align} |
| p_i=\frac{df}{dx_i}=\frac{\partial f}{\partial x_i}+\frac{\partial f}{\partial x_\alpha}\frac{\partial x_\alpha}{\partial x_i}\quad\quad \frac{dh}{dx_i}=\frac{\partial h}{\partial x_i}+\frac{\partial h}{\partial x_\alpha}\frac{\partial x_\alpha}{\partial x_i}=0\quad\quad i\neq\alpha |
p_i=\frac{df}{dx_i}=\frac{\partial f}{\partial x_i}+\frac{\partial f}{\partial x_\alpha}\frac{\partial x_\alpha}{\partial x_i}\quad\quad \frac{dh}{dx_i}=\frac{\partial h}{\partial x_i}+\frac{\partial h}{\partial x_\alpha}\frac{\partial x_\alpha}{\partial x_i}=0\quad\quad i\neq\alpha |
| \end{align} |
\end{align} |
| Taking the value of \Large $\frac{\partial x_\alpha}{\partial x_i}$ \normalsize from the second set of equations (3) and putting it into the first set, we get: |
Taking the value of \Large $\frac{\partial x_\alpha}{\partial x_i}$ \normalsize from the second set of equations (3) and putting it into the first set, we get: |
| \begin{align} |
\begin{align} |
| p_i\;=\;\frac{\partial f}{\partial x_i}-\Phi\frac{\partial h}{\partial x_i}\quad \mbox{with} \quad \Phi\;=\;\frac{\frac{\partial f}{\partial x_\alpha}}{\frac{\partial h}{\partial x_\alpha}}\quad\mbox{and}\quad i\neq\alpha |
p_i\;=\;\frac{\partial f}{\partial x_i}-\Phi\frac{\partial h}{\partial x_i}\quad \mbox{with} \quad \Phi\;=\;\frac{\frac{\partial f}{\partial x_\alpha}}{\frac{\partial h}{\partial x_\alpha}}\quad\mbox{and}\quad i\neq\alpha |
| \end{align} |
\end{align} |
| From definition (1), the Legendre transform $g$ is therefore: |
From definition (1), the Legendre transform $g$ is therefore: |
| \begin{align} |
\begin{align} |
| g(\overline{p})\;=\;\sum_{i\neq\alpha} x_i\left( \frac{\partial f}{\partial x_i} - \Phi\frac{\partial h}{\partial x_i}\right)-f(\overline{x}) |
g(\overline{p})\;=\;\sum_{i\neq\alpha} x_i\left( \frac{\partial f}{\partial x_i} - \Phi\frac{\partial h}{\partial x_i}\right)-f(\overline{x}) |
| \end{align}\\ |
\end{align}\\ |
| \textbf{BACK TO THE LAGRANGE MULTIPLIER}\\ |
\textbf{BACK TO THE LAGRANGE MULTIPLIER}\\ |
|
|
|
In the first equation (4), if we set $$i=\alpha$$ we get $$p_\alpha = 0$$ and conversely, getting the value of $\Phi$. So, in (5), we may remove the condition $$i\neq\alpha$$ and sum over all the n components of $\overline{p}$. The constraint $$p_\alpha = 0$$ reduces the number of independent components to n-1. In fact, we are back to the traditional Lagrange method with the multiplier $\Phi$ and an additional constraint. We even have the choice between n such constraints. They generate up to n functionally different transforms corresponding to the n possible forms of the function $\overline{x}$, according to which component $x_\alpha$ we choose to eliminate.
|
In the first equation (4), if we set $i=\alpha$, we get $p_\alpha = 0$. Conversely, ignoring the the second equation (4), if we set $p_\alpha = 0$, we get the value of $\Phi$. So, in (5), we may remove the condition $i\neq\alpha$ and sum over all the n components of $\overline{p}$. The condition $p_\alpha = 0$ reduces the number of independent components to n-1. In fact, we are back to the traditional Lagrange method with the multiplier $\Phi$ and the additional constraint $$p_\alpha = 0$$ We even have the choice between n such constraints. They generate up to n functionally different transforms corresponding to the n possible forms of the function $\overline{x}$, according to which component $x_\alpha$ we choose to eliminate.
|
| The method extends easily to the case of more than one constraint: $h_1=h_2=...h_m=0$. We use m multipliers $\Phi_1, \Phi_2...\Phi_m$. They are computed by equating to zero any set of m components from $\overline{p}$. |
The method extends easily to the case of more than one constraint: $h_1=h_2=...h_m=0$. We use m multipliers $\Phi_1, \Phi_2...\Phi_m$. They are computed by equating to zero any set of m components from $\overline{p}$. |
| \begin{thebibliography}{1} |
\begin{thebibliography}{1} |
| \bibitem {A} \PMlinkexternal{Fersanz at PM - Legendre Transform}{http://planetmath.org/encyclopedia/LegendreTransform.html}\\ |
\bibitem {A} \PMlinkexternal{Fersanz at PM - Legendre Transform}{http://planetmath.org/encyclopedia/LegendreTransform.html}\\ |
| This link is actually broken but, hopefully, should be operative soon. |
This link is actually broken but, hopefully, should be operative soon. |
| \bibitem {B} \PMlinkexternal{Wikipedia - Legendre transformation}{http://en.wikipedia.org/wiki/Legendre_transformation} |
\bibitem {B} \PMlinkexternal{Wikipedia - Legendre transformation}{http://en.wikipedia.org/wiki/Legendre_transformation} |
| \end{thebibliography} |
\end{thebibliography} |