PlanetMath (more info)
 Math for the people, by the people. Sponsor PlanetMath
Encyclopedia | Requests | Forums | Docs | Wiki | Random | RSS  
Login
create new user
name:
pass:
forget your password?
Main Menu
Owner confidence rating: High Entry average rating: No information on entry rating
[parent] proof of Birkhoff-von Neumann theorem (Proof)

First, we prove the following lemma:
Lemma:
A convex combination of doubly stochastic matrices is doubly stochastic.

Proof:
Let $\{A_i\}_{i=1}^m$ be a collection of $n \times n$ doubly-stochastic matrices, and suppose $\{\lambda_i\}_{i=1}^m$ is a collection of scalars satisfying $\sum_{i=1}^m \lambda_i=1$ and $\lambda_i \geq 0$ for each $i=1,\ldots,m$ We claim that $A=\sum_{i=1}^m \lambda_iA_i$ is doubly stochastic.
Take any $i \in \{1,\ldots,m\}$ Since $A_i$ is doubly stochastic, each of its rows and columns sum to 1. Thus each of the rows and columns of $\lambda_iA_i$ sum to $\lambda_i$
By the definition of elementwise summation, given matrices $N=M_1+M_2$ the sum of the entries in the $i$ column of $N$ is clearly the sum of the sums of entries of the $i$ columns of $M_1$ and $M_2$ respectively. A similar result holds for the $j$ row.
Hence the sum of the entries in the $i$ column of $A$ is the sum of the sums of entries of the $i$ columns of $\lambda_kA_k$ for each $i$ that is, $\sum_{k=1}^m \lambda_k=1$ The sum of entries of the $j$ row of $A$ is the same. Hence $A$ is doubly stochastic.

Observe that since a permutation matrix has a single nonzero entry, equal to 1, in each row and column, so the sum of entries in any row or column must be 1. So a permutation matrix is doubly stochastic, and on applying the lemma, we see that a convex combination of permutation matrices is doubly stochastic.

This provides one direction of our proof; we now prove the more difficult direction: suppose $B$ is doubly-stochastic. Define a weighted graph $G=(V,E)$ with vertex set $V=\{r_1,\ldots,r_n,c_1,\ldots,c_n\}$ edge set $E$ where $e_{ij}=(r_i,c_i) \in E$ if $B_{ij} \neq 0$ and edge weight $\omega$ where $\omega(e_{ij})=B_{ij}$

Clearly $G$ is a bipartite graph, with partitions $R=\{r_1,\ldots,r_n\}$ and $C=\{c_1,\ldots,c_n\}$ since the only edges in $E$ are between $r_i$ and $c_j$ for some $i,j \in \{1,\ldots,n\}$ Furthermore, since $B_{ij} \geq 0$ then $\omega(e)>0$ for every $e \in E$

For any $A \subset V$ define $N(A)$ the neighbourhood of $A$ to be the set of vertices $u \in V$ such that there is some $v \in A$ such that $(u,v) \in E$

We claim that, for any $v \in V$ $\sum_{u \in N(\{v\})} \omega(u,v)=1$ Take any $v \in V$ either $v \in R$ or $v \in C$ Since $G$ is bipartite, $v \in R$ implies $N(\{v\}) \subset C$ and $v \in C$ implies $N(\{v\}) \subset R$ Now, $$ \begin{array}{c} {\displaystyle v=r_i \implies \sum_{u \in N(r_i)} \omega(r_i,u)=\sum_{\stackrel{j=1,}{e_{ij} \in E}}^n \omega(r_i,c_j) = \sum_{\stackrel{j=1,}{B_{ij} \neq 0}}^n B_{ij}= \sum_{j=1}^n B_{ij}=1 } \\ {\displaystyle v=c_j \implies \sum_{u \in N(c_j)} \omega(u,c_j)=\sum_{\stackrel{i=1,}{e_{ij} \in E}}^n \omega(r_i,c_j) = \sum_{\stackrel{i=1,}{B_{ij} \neq 0}}^n B_{ij}= \sum_{i=1}^n B_{ij}=1 } \end{array} $$ since $B$ is doubly stochastic. Now, take any $A \subset R$ We have $$ \sum_{\stackrel{v \in A}{w \in N(A)}} \omega(v,w) = \sum_{v \in A} \sum_{w \in N(\{v\})} \omega(v,w) = \sum_{v \in A} 1 = |A| $$ Let $B=N(A)$ But then clearly $A \subset N(B)$ by definition of neighbourhood. So $$ |N(A)|=|B| = \sum_{\stackrel{v \in B}{w \in N(B)}} \omega(v,w) \geq \sum_{\stackrel{v \in B}{w \in A}} \omega(v,w) = \sum_{\stackrel{w \in A}{v \in N(A)}} \omega(v,w) = |A| $$ So $|N(A)| \geq |A|$ We may therefore apply the graph-theoretic version of Hall's marriage theorem to $G$ to conclude that $G$ has a perfect matching.

So let $M \subset E$ be a perfect matching for $G$ Define an $n \times n$ matrix $P$ by $$ P_{ij} = \left\{ \begin{array}{ll} 1 & \text{if } e_{ij} \in M \\ 0 & \text{otherwise} \end{array} \right.$$

Note that $B_{ij}=0$ implies $P_{ij}=0$ if $B_{ij}=0$ then $(r_i,c_j) \not \in E$ so $(r_i,c_j) \not \in M$ which implies $P_{ij}=0$

Further, we claim that $P$ is a permutation matrix:
Let $i$ be any row of $P$ Since $M$ is a perfect matching of $G$ there exists $e_0 \in M$ such that $r_i$ is an end of $e_0$ Let the other end be $c_j$ for some $i$ then $P_{ij}=1$
Suppose $i_i,i_2 \in \{1,\ldots,n\}$ with $i_1 \neq i_2$ and $P_{i_1,j}=P_{i_2,j}=1$ for some $j$ This implies $(r_{i_1},c_j),(r_{i_2},c_j) \in M$ but this implies the vertex $i$ is the end of two distinct edges, which contradicts the fact that $M$ is a matching.
Hence, for each row and column of $P$ there is exactly one nonzero entry, whose value is 1. So $P$ is a permutation matrix.

Define $\displaystyle{\lambda=\min_{i,j \in \{1,\ldots,n\}} \{ B_{ij} \mid P_{ij} \neq 0\}}$ We see that $\lambda>0$ since $B_{ij} \geq 0$ and $P_{ij} \neq 0 \implies B_{ij} \neq 0$ Further, $\lambda=B_{pq}$ for some $p,q$

Let $D=B-\lambda P$ If $D=0$ then $\lambda=1$ and $B$ is a permutation matrix, so we are done. Otherwise, note that $D$ is nonnegative; this is clear since $\lambda P_{ij} \leq \lambda \leq B_{ij}$ for any $B_{ij} \neq 0$

Notice that $D_{pq}=B_{pq}-\lambda P_{pq} = \lambda - \lambda \cdot 1 = 0$

Note that since every row and column of $B$ and $P$ sums to 1, that every row and column of $D=B-\lambda P$ sums to $1-\lambda$ Define $B'=\frac{1}{1-\lambda}D$ Then every row and column of $B'$ sums to 1, so $B'$ is doubly stochastic.

Rearranging, we have $B=\lambda P + (1-\lambda) B'$ Clearly $B_{ij}=0$ implies that $P_{ij}=0$ which implies that $B'_{ij}=0$ so the zero entries of $B'$ are a superset of those of $B'$ But notice that $B'_{pq}=\frac{1}{1-\lambda} D_{pq}=0$ so the zero entries of $B'$ are a strict superset of those of $B$

We have decomposed $B$ into a convex combination of a permutation matrix and another doubly stochastic matrix with strictly more zero entries than $B$ Thus we may apply this procedure repeatedly on the doubly stochastic matrix obtained from the previous step, and the number of zero entries will increase with each step. Since $B$ has at most $n^2$ nonzero entries, we will obtain a convex combination of permutation matrices in at most $n^2$ steps.

Thus $B$ is indeed expressible as a convex combination of permutation matrices.

References

G. Birkhoff, ``Tres observaciones sobre el algebra lineal,'' Univ. Nac. Tucumán Rev, Ser. A, no. 5, pp147-151. (1946)




"proof of Birkhoff-von Neumann theorem" is owned by Andrea Ambrosio. [ full author list (2) | owner history (1) ]
(view preamble | get metadata)

View style:


This object's parent.
Log in to rate this entry.
(view current ratings)

Cross-references: expressible, number, strictly, strict, superset, clear, matching, perfect matching, Hall's marriage theorem, implies, vertices, neighbourhood, partitions, bipartite graph, weight, edge, vertex, graph, permutation matrix, similar, sum, columns, rows, scalars, collection, proof, matrices, doubly stochastic, convex combination

This is version 11 of proof of Birkhoff-von Neumann theorem, born on 2002-11-20, modified 2006-09-05.
Object id is 3611, canonical name is ProofOfBirkoffVonNeumannTheorem.
Accessed 8924 times total.

Classification:
AMS MSC15A51 (Linear and multilinear algebra; matrix theory :: Stochastic matrices)

Pending Errata and Addenda
None.
[ View all 4 ]
Discussion
Style: Expand: Order:
forum policy

No messages.

Interact
post | correct | update request | add example | add (any)