Appendix: A Combinatorial Construction of the Schubert Polynomials

Notes on Schubert Polynomials
Appendix

Arun Ram
Department of Mathematics and Statistics
University of Melbourne
Parkville, VIC 3010 Australia
aram@unimelb.edu.au

Last update: 2 July 2013

A Combinatorial Construction of the Schubert Polynomials

by Nantel Bergeron

In this appendix, we shall give a combinatorial rule based on diagrams for the construction of the Schubert polynomials. A different algorithm had been conjectured (and proved in the case of vexillary permutations) by A. Kohnert. We shall give, at the end of this appendix, a sketch of how one can show the equivalence of the two rules. I wish to acknowledge my indebtedness to Mark Shimozono for the stimulating exchanges regarding this work.

Combinatorial construction

Here a "diagram" will be any finite non empty set of lattice points $(i, j)$ in the positive quadrant $(i \geq 1, j \geq 1) .$ For example the diagram $D (w)$ of a permutation $w$ is a diagram in the above sense. Let $D$ be any diagram. We denote by $D_{(r, r + 1)}$ the diagram $D$ restricted to the row $r$ and $r + 1 .$ Let $j (r, D) = (j_{1}, j_{2}, \dots, j_{k})$ be the columns of $D$ in which there is exactly one element of $D_{(r, r + 1)}$ per column. Choose a column $j_{i} \in j (r, D) .$ Assume first that $(r + 1, j_{i}) \in D_{(r, r + 1)} .$ If $i = k$ or if $(r, j_{i + 1}) \in D_{(r, r + 1)},$ let $D_{1}$ be the diagram obtained from $D$ by replacing the element $(r + 1, j_{i})$ by $(r, j_{i}) .$ Now suppose instead that $(r, j_{i}) \in D_{(r, r + 1)} .$ We say that the point $(r, j_{i})$ is $r -fixed$ with respect to $D (w)$ if the number of elements of $D$ in the column $j_{i}$ and in the rows $r' > r$ is equal to the number of elements of $D (w)$ in the same area. Now if $i = 1$ (and if there is no r-fixed element with respect to $D (w)$ in $D)$ or if $(r + 1, j_{i - 1}) \in D_{(r, r + 1)},$ let $D_{1}$ be the diagram obtained from $D$ by replacing the element $(r, j_{i})$ by $(r + 1, j_{i}) .$ In both cases we say that the diagram $D_{1}$ is obtained from $D$ by a $" B -move "$ (with respect to $D (w)).$ For example let $D$ be such that $D_{(r, r + 1)}$ is the following:

For this case $j (r, D) = (2, 5, 8, 9) .$ We can perform on this diagram a B-move in column 2, 5 or 9 and obtain, respectively, the following diagrams:

\begin{matrix}  \end{matrix}

The element in column 8 is not allowed to move since $(r + 1, 5) \notin D_{(r, r + 1)} .$ Let $Ω (w)$ denote the set of all diagrams (including $D (w))$ obtainable from $D (w)$ by any sequence of B-moves.

Next for $D \in Ω (w)$ let $x^{D}$ denote the monomial $x_{1}^{a_{1}} x_{2}^{a_{2}} x_{3}^{a_{3}} \dots$ where $a_{i}$ is the number of elements of $D$ in the $i^{th}$ row. For any permutation $w$ we shall have the following theorem:

\begin{matrix} (B.1) & 𝔖_{w} = \sum_{D \in Ω (w)} x^{D} . \end{matrix}

To prove this we will proceed by reverse induction on $ℓ (w) .$ If $w = w_{0}$ (the longest element of $S_{n})$ then (B.1) holds since $Ω (w_{0})$ contains only the element $D (w_{0})$ and $x^{D (w_{0})} = x^{δ} .$ On the other hand from (4.3), $𝔖_{w_{0}} = x^{δ} .$ Now if $w \neq w_{0}$ then let $r = min {i : w (i) < w (i + 1)} .$ From (4.2) we have

\begin{matrix} (B.2) & 𝔖_{w} = \partial_{r} 𝔖_{w s_{r}} . \end{matrix}

Let $v = w s_{r} .$ By the induction hypothesis equation (B.1) holds for $𝔖_{v} .$ The induction step will be to "apply" the operator $\partial_{r}$ to the diagrams in $Ω (v) .$ To this end we need more tools.

For the moment let us fix $D \in Ω (v) .$ Let $a = a_{r} (D)$ and $b = a_{r + 1} (D)$ be respectively the number of elements of $D$ in the $r th$ and $r + 1 st$ rows. We have

\begin{matrix} (B.3) & \partial_{r} x^{D} = \partial_{r} \dots x_{r}^{a} x_{r + 1}^{b} \dots = {\begin{matrix} \sum_{k = 0}^{a - b - 1} \dots x_{r}^{a - r - 1} x_{r + 1}^{b + r} \dots & if a > b, \\ 0 & if a = b, \\ - \sum_{k = 0}^{a - b - 1} \dots x_{r}^{a + r} x_{r + 1}^{b - r - 1} \dots & if a < b . \end{matrix} \end{matrix}

This suggests we define the operator $\partial_{r}$ directly on the diagram $D .$ For this we need only to concentrate our attention on the rows $r$ and $r + 1$ of $D .$ Let $j (r, D) = (j_{1}, j_{2}, \dots, j_{p}) .$ Notice that in all columns $j < w (r)$ of $D_{(r, r + 1)}$ there are exactly two elements and in column $w (r) = j_{1}$ of $D_{(r, r + 1)}$ there is exactly one element in position $(r, j_{1}) .$ We shall now reduce the sequence of indices $j (r, D)$ according to the following rule. Let $J_{(0)} = (j_{2}, j_{3}, \dots, j_{p}) .$ Remove from $J_{(0)}$ all pairs $j_{k}, j_{k + 1}$ for which $(r, j_{k}) \in D$ and $(r + 1, j_{k + 1}) \in D .$ Let us denote the resulting sequence by $J_{(1)} .$ Repeat recursively this process on $J_{(1)}$ until no such pair can be found. Let us denote by $f (r, D) = (f_{1}, f_{2}, \dots, f_{q})$ the final sequence. From construction, the sequence $f (r, D)$ is such that if $(r, f_{k}) \in D$ then $(r, f_{k + 1}) \in D .$ Let $up (r, D)$ be the minimal $k$ such that $(r, f_{k}) \in D .$ If $(r + 1, f_{q}) \in D$ then set $up (r, D) = q + 1 .$ We are now in a position to define the operation of $\partial_{r}$ on the diagram $D .$ To this end let us first assume that $a > b .$ This means that we have $a - b$ more elements. in row $r$ then in row $r + 1 .$ Hence $q - up (r, D) + 1 \geq a - b - 1$ for $q$ the length of $f (r, D) .$ The equality holds if and only if $up (r, D) = 1 .$ In the case $a > b$ the operator $\partial_{r}$ on the diagram $D$ is defined by the map

\begin{matrix} (B.4a) & \partial_{r} D \to {D_{0}, D_{1}, D_{2}, \dots, D_{a - b - 1}} \end{matrix}

where $D_{0}$ is identical to $D$ except that we remove the element in position $(r, w (r))$ and for $k = 1, 2, \dots, a - b - 1$ we successively set $D_{k}$ to be identical to $D_{k - 1}$ except that the element $(r, f_{up (r, D) + k - 1})$ is replaced by $(r + 1, f_{up (r, D) + k - 1}) .$ Now if $a < b$ we have $up (r, D) - 1 \geq b - a + 1$ (with equality iff $up (r, D) = q + 1).$ So $up (r, D) - 1 > b - a .$ In this case the operator $\partial_{r}$ on the diagram $D$ is defined by the map

\begin{matrix} (B.4b) & \partial_{r} D \to {D_{0}, D_{1}, D_{2}, \dots, D_{b - a - 1}} \end{matrix}

where $D_{0}$ is identical to $D$ except that we remove the element in position $(r, w_{r})$ and the element $(r + 1, f_{up (r, D) - 1})$ is replaced by $(r, f_{up (r, D) - 1}) .$ For $k = 1, 2, \dots, b - a - 1$ we successively set $D_{k}$ to be identical to $D_{k - 1}$ except that the element $(r + 1, f_{up (r, D) - k - 1})$ is replaced by $(r, f_{up (r, D) - k - 1}) .$ Finally if $a = b$ then

\begin{matrix} (B.4c) & \partial_{r} D \to {} . \end{matrix}

With this definition of $\partial_{r}$ we have that

\begin{matrix} (B.5) & \partial_{r} x^{D} = \pm \sum_{D_{i} \in \partial_{r} D} x^{D_{i}}, \end{matrix}

with the positive sign in case (B.4a), and the negative sign in case (B.4b). For (B.4c) the result of (B.5) is zero.

We shall now show that

\begin{matrix} (B.6) & \partial_{r} maps Ω (v) into Ω (w) . \end{matrix}

Proof.

The reader will notice that in $D (v)$ the rectangle defined by the rows $1, 2, \dots, r + 1$ and the columns $1, 2, \dots, w (r) - 1$ is filled with elements. None of these elements can B-move. Hence these elements are fixed in any diagram $D \in Ω (v) .$ The same applies to all elements in column $w (r);$ they are packed in the smallest rows and there are no elements in the rows strictly greater than $r .$ Now let $D$ be a diagram in $Ω (v)$ and assume that $\partial_{r} D = {D_{0}, D_{1}, \dots, D_{m}}$ is non-empty. The remark above implies that the element in position $(r, w (r))$ does not affect the sequence of B-moves from $D (v)$ to $D .$ Hence we can apply the same sequence of B-moves to $D (v) - {(r, w (r))}$ and obtain $D_{0} .$ Moreover $D (v) - {(r, w (r))}$ is obtainable from $D (w)$ by a simple sequence of B-moves in rows $r, r + 1,$ for this one successively B-moves all the elements in row $r + 1$ and columns given by $j (r, D (w)) .$ This gives that $D_{0}$ is obtainable from $D (w)$ by a sequence of B-moves, that is $D_{0} \in Ω (w) .$ Now from the construction of $\partial_{r} D,$ $D_{k}$ $(k > 0)$ is obtained from $D_{k - 1}$ by exactly one B-move. Hence $\partial_{r} D \subset Ω (w) .$

$□$

It is appropriate at this point to give an example. Let $w = (6, 3, 9, 5, 1, 2, 11, 8, 4, 7, 10) .$ Hence $r = 2$ and $v = (6, 9, 3, 5, 1, 2, 11, 8, 4, 7, 10) .$ We have depicted below the diagrams $D (w)$ and $D (v) .$ In our example the fixed elements described above are colored in grey and the element in position $(r, w (r))$ is colored black.

\begin{matrix} D (w) & D (v) \end{matrix}

Now let $D$ be the following diagram of $Ω (v) .$

D = \begin{matrix}  \end{matrix}

Here, $a_{r} (D) = 7,$ $a_{r + 1} (D) = 4$ and $j (r, D) = (3, 5, 7, 8, 10) .$ The reduced sequence $f (r, D)$ is $(8, 10)$ and $up (r, D) = 1 .$ Hence $\partial_{r} D = {D_{0}, D_{1}, D_{2}}$ where

\begin{matrix} D_{0} & D_{1} & D_{2} \end{matrix}

To prove (B.1) the first step is to find a subset of $Ω (v)$ such that when we operate with $\partial_{r}$ we obtain $Ω (w) .$ To this end let

Ω_{0} (v) = {D \in Ω (v) : a_{r} (D) > a_{r + 1} (D) and up (r, D) = 1} .

We have

\begin{matrix} (B.7) & Ω (w) = ⋃_{D \in Ω_{0} (v)} \partial_{r} D (disjoint union). \end{matrix}

Proof.

It is clear from construction that the subsets $\partial_{r} D$ are disjoint when $D \in Ω_{0} (v) .$ From (B.6) we only have to prove that for any $D' \in Ω (w)$ there is a $D \in Ω_{0} (v)$ such that $D' \in \partial_{r} D .$ To see that, reduce the sequence $j (r, D') = (j_{1}, \dots, j_{p})$ by removing recursively all pairs $j_{k}, j_{k + 1}$ for which $(r, j_{k}) \in D'$ and $(r + 1, j_{k + 1}) \in D' .$ Denote the final sequence by $f' (r, D') .$ Let $D$ be the bubble diagram obtained from $D'$ by adding an element in position $(r, w (r))$ and successively B-moving all elements in positions $(r + 1, f_{i}) \in D' .$ We have that $D \in Ω (v) .$ To see this one applies to $D (v)$ the sequence of B-moves from $D (w)$ to $D - {(r, w (r))} .$ Of course one should ignore any B-move in rows $r, r + 1$ performed on the original elements of $D (v)$ in row $r .$ But by the choice of $r,$ the other B-moves apply almost directly and the resulting diagram is precisely $D .$ Moreover since $f (r, D) = f' (r, D')$ and $up (r, D) = 1$ we have $D \in Ω_{0} (v)$ and $D' \in \partial_{r} D .$

$□$

We shall now investigate the effect of $\partial_{r}$ on $Ω_{1} (v) = Ω (v) - Ω_{0} (v) .$ More precisely we have

\begin{matrix} (B.8) & \sum_{D \in Ω_{1} (v)} \partial_{r} x^{D} = 0 . \end{matrix}

Proof.

There are two classes of diagrams in $Ω_{1} (v) .$ The first class contains the diagrams $D$ for which $a_{r} (D) = a_{r + 1} (D) .$ In this case it is trivial that $\partial_{r} x^{D} = 0 .$ The other class is formed by the diagrams $D$ such that $a_{r} (D) \neq a_{r + 1} (D)$ and $up (r, D) > 1 .$ In this case we shall construct an involution, $D \to D',$ such that $\partial_{r} x^{D} + \partial_{r} x^{D'} = 0 .$ Let $f (r, D) = (f_{1}, f_{2}, \dots, f_{q}),$ $a = a_{r} (D)$ and $b = a_{r + 1} (D) .$ We first define the involution for the case $a > b .$ Since $up (r, D) > 1$ we must have $q - up (r, D) + 1 \geq a - b .$ So let $D'$ be identical to $D$ except that the elements in positions $(r, f_{up (r, D)}), (f, f_{up (r, D) + 1}), \dots, (r, f_{up (r, D) + a - b - 1})$ are B-moved to the positions $(r + 1, f_{up (r, D)}), (r + 1, f_{up (r, D) + 1}), \dots, (r, f_{up (r, D) + a - b - 1}) .$ It is clear that $D' \in Ω (v) .$ But $f (r, D') = f (r, D)$ and $up (r, D') > up (r, D) > 1,$ hence $D' \in Ω_{1} (v) .$ Moreover we have $a_{r} (D) = b$ and $a_{r + 1} (D) = a,$ hence $\partial_{r} x^{D} + \partial_{r} x^{D'} = 0 .$ The case $a < b$ is similar to the previous one.

$□$

A proof of (B.1) is now completed combining (B.2), (B.5), (B.7) and (B.8). More precisely using the induction hypothesis, we have

\begin{matrix} 𝔖_{w} & = & \partial_{r} 𝔖_{v} & (B.2) \\ = & \sum_{D \in Ω (v)} \partial_{r} x^{D} \\ = & \sum_{D \in Ω_{0} (v)} \partial_{r} x^{D} & (B.8) \\ = & \sum_{D \in Ω_{0} (v)} \sum_{D_{i} \in \partial_{r} D} x^{D_{i}} & (B.5) \\ = & \sum_{D' \in Ω (w)} x^{D'} . & (B.7) \end{matrix}

Kohnert's construction

Let $D$ be any diagram. Choose $(i, j) \in D$ such that $(i, j') \notin D$ for all $j' < j .$ Let us suppose that there is a point $(i', j) \notin D$ with $i' < i .$ Then let $h < i$ be the largest integer such that $(h, j) \notin D$ and let $D_{1}$ denote the diagram obtained from $D$ by replacing $(i, j)$ by $(h, j) .$ We say that $D_{1}$ is obtained from $D$ by a "K-move". Now let $K (D (w))$ denote the set of all diagrams (including $D$ itself) obtainable from $D$ by any sequence of K-moves. Kohnert's conjecture states that for any permutation $w$ we have

\begin{matrix} (B.9) & 𝔖_{w} = \sum_{D \in K (D (w))} x^{D} . \end{matrix}

A. Kohnert has proved (B.9) for the case where $w$ is a vexillary permutation but the general case was still open. For the interested reader here is a sketch of how one may prove (B.9).

We have noticed by computer that $Ω (w) = K (D (w)) .$ The idea then is to show both inclusions by induction. The inclusion $K (D (w)) \subset Ω (w)$ is the easiest one. We only have to show that any K-move of an element $(i, j)$ to $(h, j)$ can be simulated using B-moves. For this we proceed by induction on $i - h .$ If $i - h = 1$ then the K-move is simply one B-move. Now if $i - h > 1,$ we first perform the sequence of B-moves in row $h, h + 1$ necessary to B-move the element $(h + 1, j)$ to $(h, j) .$ Then using the induction hypothesis we can K-move $(i, j)$ to $(h + 1, j) .$ Finally we reverse the first sequence of B-moves in rows $h, h + 1 .$ That shows $K (D (w)) \subset Ω (w) .$

The other inclusion needs a lot more work. For $D \in K (D (w))$ and $i$ any row of $D$ let $B_{i} (D)$ denote the set of all diagrams (including $D)$ obtainable from $D$ by any sequence of B-moves in the rows $i, i + 1$ only. It is clear that if $i$ is big enough then $B_{i} (D) \subset K (D (w)) .$ We may then proceed by reverse induction on $i .$ Now for a fixed $i,$ notice that $B_{i} (D (w))$ is obtainable from $D (w)$ using only K-moves. Let $Ω_{0}$ denote the set of all diagrams obtainable from $B_{i} (D (w))$ by any sequence of K-moves for which no elements crosses the border between the rows $i + 1$ and $i + 2 .$ A simple inductive algorithm may be used here to show that for any $D \in Ω_{0}$ we have $B_{i} (D) \subset Ω_{0} .$ Next let $Ω_{k}$ denote the set of all diagrams of $K (D (w))$ which have $k$ more elements than $D (w)$ in the rows $1, 2, \dots, i + 1 .$ For almost all the cases it is fairly easy to show (using induction on $k$ and the induction hypothesis on $i)$ that for $D \in Ω_{k}$ we have $B_{i} (D) \subset Ω_{k} .$ But some of the cases are really hard to formalize! Now this completed would show that $Ω (w) \subset K (D (w))$ since $K (D (w)) = \cup Ω_{k} .$

Notes and References

This is a typed excerpt of the book Notes on Schubert Polynomials by I. G. Macdonald.

page history

Notes on Schubert PolynomialsAppendix

A Combinatorial Construction of the Schubert Polynomials

by Nantel Bergeron

Combinatorial construction

Kohnert's construction

Notes and References

Notes on Schubert Polynomials
Appendix