This course studies the combinatorial and algebraic structure of partially ordered sets and lattices, with an emphasis on how local order data controls global enumeration and topology. It develops the basic language of finite posets, then moves into the central invariants that measure their size and shape, such as chains, antichains, ranks, and interval structure. From there, the course introduces Möbius functions, incidence algebras, and inversion formulas as algebraic tools for counting and organizing combinatorial information.
The main themes are decomposition, duality, and structure: how posets can be broken into chains and antichains, how lattice operations and closure operators encode algebraic behavior, and how geometric lattices and matroids connect order theory to geometry and independence. Later chapters broaden the perspective to Eulerian posets, flag enumeration, order complexes, and topological properties such as shellability and Cohen-Macaulayness, showing how combinatorial posets can carry deep geometric and homological information. The final chapters bring these ideas back to enumeration through $P$-partitions and order polynomials, culminating in a synthesis of local interval behavior and global counting principles.
# Introduction
This opening chapter sets the direction for the course. We study finite partially ordered sets as objects that can be drawn, counted, multiplied, inverted, and topologised. The guiding theme is that order relations encode algebraic structure: once a set has a partial order, it has intervals, incidence functions, chains, rank data, and often a lattice operation. Chapters 1 through 5 develop these ideas through finite-poset constructions, decomposition theorems, incidence algebras, Möbius inversion, and lattices; here we fix the viewpoint and introduce the examples that will reappear throughout the course.
The course begins from undergraduate algebra, basic combinatorics, linear algebra, elementary graph theory, and familiarity with generating functions. Later chapters use a small amount of commutative algebra, simplicial topology, matroid theory, representation theory, and homological language; when those topics enter, the note treats them as structured tools for finite posets rather than as assumed background. The central objects are finite, so the emphasis is not on set-theoretic pathology. Instead, finite posets provide a common language for enumeration, representation by diagrams, inversion formulas, lattice theory, and simplicial topology.
## What Does A Poset Remember?
What information is retained when we forget the labels of a collection and keep only the comparisons among its elements? A partial order records which elements are below which others, but it also records many derived objects: chains, antichains, intervals, ideals, filters, and linear extensions. These objects are not decoration; they are the basic data from which the algebraic and enumerative invariants of the course are built.
[definition: Finite Poset]
A finite poset is a pair $(P, \le)$ where $P$ is a finite set and $\le$ is a binary relation on $P$ satisfying, for all $x,y,z \in P$,
\begin{align*}
x \le x, \qquad x \le y \text{ and } y \le x \implies x = y, \qquad x \le y \text{ and } y \le z \implies x \le z.
\end{align*}
[/definition]
The three axioms are reflexivity, antisymmetry, and transitivity. In finite examples, the order is often represented by a Hasse diagram, where only cover relations are drawn and all other comparisons are recovered by transitivity.
[example: Divisibility Poset]
Let $P=\{1,2,3,4,6,12\}$ and define $a \le b$ to mean $a \mid b$. The relation is reflexive because $a=1\cdot a$, so $a\mid a$ for every $a\in P$. It is antisymmetric because if $a\mid b$ and $b\mid a$, then $b=ra$ and $a=sb$ for positive integers $r,s$, hence $a=sra$, so $sr=1$, and therefore $r=s=1$ and $a=b$. It is transitive because if $a\mid b$ and $b\mid c$, then $b=ra$ and $c=tb$ for positive integers $r,t$, so $c=t(ra)=(tr)a$, hence $a\mid c$.
The minimum element is $1$, since
\begin{align*}
1\mid 1,\ 1\mid 2,\ 1\mid 3,\ 1\mid 4,\ 1\mid 6,\ 1\mid 12.
\end{align*}
The maximum element is $12$, since
\begin{align*}
1\mid 12,\ 2\mid 12,\ 3\mid 12,\ 4\mid 12,\ 6\mid 12,\ 12\mid 12.
\end{align*}
The element $2$ lies below $4,6,12$ because $4=2\cdot 2$, $6=3\cdot 2$, and $12=6\cdot 2$. The elements $4$ and $6$ are incomparable: $4\nmid 6$ because $6/4$ is not an integer, and $6\nmid 4$ because $4/6$ is not an integer.
For the interval from $2$ to $12$, an element $z\in P$ belongs to $[2,12]$ exactly when $2\mid z$ and $z\mid 12$. Checking the six elements of $P$, the possibilities are $2,4,6,12$, while $1$ fails $2\mid 1$ and $3$ fails $2\mid 3$. Thus
\begin{align*}
[2,12]=\{2,4,6,12\}.
\end{align*}
This example turns divisibility calculations into order-theoretic data: extrema, comparable pairs, incomparable pairs, and intervals are all read from the same relation $a\mid b$.
[/example]
This example already illustrates the main habit of the course: translate a familiar combinatorial or algebraic situation into order language, then use the order to compute. The next basic example is more symmetric and will serve as the testing ground for many theorems.
[example: Boolean Lattice]
Let $B_n$ be the set of all subsets of $\{1,\dots,n\}$ ordered by inclusion. For a subset $A\subseteq \{1,\dots,n\}$, its rank is $\rho(A)=|A|$, so the rank-$k$ elements are exactly the subsets with $k$ elements:
\begin{align*}
\{A\in B_n:\rho(A)=k\}=\{A\subseteq \{1,\dots,n\}:|A|=k\}.
\end{align*}
There are $\binom{n}{k}$ such subsets, because choosing an element of rank $k$ is the same as choosing which $k$ of the $n$ elements belong to it.
A maximal chain from $\varnothing$ to $\{1,\dots,n\}$ must increase the subset size by one at each step:
\begin{align*}
\varnothing=A_0\subset A_1\subset A_2\subset \cdots \subset A_n=\{1,\dots,n\},
\end{align*}
with $|A_i|=i$ for every $i$. Since $A_i$ has one more element than $A_{i-1}$, there is a unique element $a_i\in \{1,\dots,n\}$ such that
\begin{align*}
A_i=A_{i-1}\cup\{a_i\}.
\end{align*}
Thus the chain determines the ordered list $(a_1,\dots,a_n)$, and every element of $\{1,\dots,n\}$ appears exactly once in that list. Conversely, any permutation $(a_1,\dots,a_n)$ defines a maximal chain by
\begin{align*}
A_i=\{a_1,\dots,a_i\}.
\end{align*}
Therefore the maximal chains in $B_n$ are in bijection with the $n!$ permutations of $\{1,\dots,n\}$.
This single poset records binomial coefficients through its rank sizes and permutations through its maximal chains, which is why it reappears in symmetric chain decompositions and in the Sperner problem.
[/example]
The Boolean lattice shows why posets are not merely diagrams. Its ranks form Pascal's triangle, its maximal chains encode permutations, and its antichains lead to extremal set theory. Many later results will be first proved or tested in $B_n$ before being transported to more general graded posets.
## Why Intervals Matter?
How can local order data determine global algebraic formulas? The answer begins with intervals. Most invariants in the course are computed not from the whole poset at once, but from the subposets lying between two comparable elements.
[definition: Interval In A Poset]
Let $(P, \le)$ be a poset and let $x,y \in P$ with $x \le y$. The interval from $x$ to $y$ is
\begin{align*}
[x,y] := \{z \in P : x \le z \le y\}.
\end{align*}
[/definition]
Intervals are the local pieces on which incidence functions live. To turn this local viewpoint into algebra, we need a basic incidence function that records exactly where comparisons occur and that can later be inverted.
[definition: Zeta Function Of A Finite Poset]
Let $(P, \le)$ be a finite poset. The zeta function of $P$ is the function
\begin{align*}
\zeta_P : \{(x,y) \in P \times P : x \le y\} \to \mathbb{Z}
\end{align*}
given by
\begin{align*}
\zeta_P(x,y) := 1 \quad \text{for } x \le y.
\end{align*}
[/definition]
The zeta function is the order-theoretic analogue of the constant function $1$ on intervals. Once summation over lower elements is written using $\zeta_P$, the natural next question is whether that summation operation can be reversed in the same order-theoretic language.
For a finite poset, this reversal is multiplication in the incidence algebra of functions on comparable pairs, where convolution is the product obtained by summing over intermediate elements of an interval. The result below is the finite-poset analogue of inverting a triangular summation matrix: it says that every cumulative function on lower ideals has a unique local inverse, and that inverse is governed by a new invariant $\mu_P$ attached to intervals of the order.
The theorem below says that the inverse exists and is again controlled interval by interval. Thus Möbius inversion is not an extra structure imposed on a poset; it is the algebraic inverse of the most basic incidence function attached to the order.
[quotetheorem:7126]
[citeproof:7126/algebraic-combinatorics-ii-posets-and-lattices]
This theorem is the first sign that posets support algebra. The proof is linear algebra, but the inverse matrix is controlled by the interval structure of $P$, so algebraic inversion becomes a combinatorial computation. The finiteness hypothesis ensures that every displayed sum is finite and that the triangular matrix is an ordinary finite matrix; for an infinite poset such as $(\mathbb{Z}, \le)$, the expression $\sum_{x \le y} f(x)$ may involve infinitely many terms and need not make sense in an arbitrary commutative ring. The theorem also does not compute $\mu_P$ by itself; later incidence-algebra methods give recursive formulas interval by interval. Nor does the result require $P$ to be a lattice or graded poset, so it is an inversion theorem for order relations rather than for meet and join operations.
[example: Boolean Möbius Values]
Let $A\subseteq B$ in $B_n$, and write $m=|B\setminus A|$. We show that $\mu_{B_n}(A,B)=(-1)^m$ by checking the inverse relation for the zeta function. If $A=B$, then $m=0$, so
\begin{align*}
(-1)^{|B\setminus A|}=(-1)^0=1.
\end{align*}
Now suppose $A\subsetneq B$. Every subset $C$ with $A\subseteq C\subseteq B$ is uniquely of the form $C=A\cup S$ for a subset $S\subseteq B\setminus A$. For such a $C$,
\begin{align*}
|C\setminus A|=|S|.
\end{align*}
Therefore
\begin{align*}
\sum_{A\subseteq C\subseteq B} (-1)^{|C\setminus A|}=\sum_{S\subseteq B\setminus A}(-1)^{|S|}.
\end{align*}
Grouping the subsets $S\subseteq B\setminus A$ by their size gives
\begin{align*}
\sum_{S\subseteq B\setminus A}(-1)^{|S|}=\sum_{j=0}^{m}\binom{m}{j}(-1)^j.
\end{align*}
By the [binomial theorem](/theorems/750),
\begin{align*}
\sum_{j=0}^{m}\binom{m}{j}(-1)^j=(1+(-1))^m=0^m=0,
\end{align*}
because $m>0$. Thus the proposed values give $1$ on the diagonal and give total sum $0$ over every nontrivial interval, so they satisfy the Möbius inverse relations. Hence
\begin{align*}
\mu_{B_n}(A,B)=(-1)^{|B\setminus A|}.
\end{align*}
The sign is positive when $B$ is obtained from $A$ by adding an even number of elements and negative when it is obtained by adding an odd number, which is exactly the sign pattern used in inclusion-exclusion.
[/example]
The example explains why Möbius inversion is not an isolated trick. Inclusion-exclusion is the Boolean case of a much broader inverse operation, and the course will repeatedly identify familiar formulas as Möbius inversion on a suitable poset.
## How Do Chains And Antichains Measure Size?
If a poset is not totally ordered, what does it mean for it to be large in a vertical or horizontal direction? Chains measure vertical length, while antichains measure horizontal spread. The tension between these two notions leads to the decomposition theorems in the early part of the course.
[definition: Chain And Antichain]
Let $(P, \le)$ be a poset. A chain in $P$ is a subset $C \subset P$ such that any two elements of $C$ are comparable. An antichain in $P$ is a subset $A \subset P$ such that no two distinct elements of $A$ are comparable.
[/definition]
Chains and antichains are dual ways of extracting simple structure from a complicated order. A chain decomposition covers the poset by totally ordered pieces, while the largest antichain measures how many pieces are necessary; the next theorem says these two measurements agree in finite posets.
[quotetheorem:8083]
[citeproof:8083]
[Dilworth's theorem](/theorems/8083) turns a structural question about partial orders into a matching problem. The finiteness assumption matters for the stated form because both sides are phrased using a maximum and a minimum. For example, take levels $A_n$ of size $n$ for $n \in \mathbb{N}$, and order every element of $A_n$ below every element of $A_m$ when $n < m$, with no comparisons inside a level; the antichains have arbitrarily large finite size but no maximum finite size, so the displayed equality is not a well-formed finite statement. The theorem also does not describe which chain cover is canonical, nor does it say that every largest antichain is compatible with every minimum chain cover. Its forward use is that width can be attacked by constructing chain decompositions, which is the route into Mirsky's theorem, Sperner theory, and rank-based inequalities.
[example: Antichains In The Boolean Lattice]
Write $[n]=\{1,\dots,n\}$, and let $\binom{[n]}{k}$ denote the set of all $k$-element subsets of $[n]$. If $A,B\in \binom{[n]}{k}$ and $A\subseteq B$, then $|A|=k=|B|$, so the inclusion cannot be proper; hence $A=B$. Thus no two distinct elements of $\binom{[n]}{k}$ are comparable, and every rank level is an antichain.
The size of the $k$th rank level is $\binom{n}{k}$. To locate the largest level, compare consecutive sizes:
\begin{align*}
\frac{\binom{n}{k+1}}{\binom{n}{k}}=\frac{n!}{(k+1)!(n-k-1)!}\cdot \frac{k!(n-k)!}{n!}=\frac{n-k}{k+1}.
\end{align*}
This ratio is greater than $1$ exactly when $n-k>k+1$, equivalently $k<\frac{n-1}{2}$, and it is less than $1$ exactly when $k>\frac{n-1}{2}$. Therefore the binomial coefficients increase up to the middle rank and then decrease. If $n=2r$, the maximum occurs at $k=r=\lfloor n/2\rfloor=\lceil n/2\rceil$; if $n=2r+1$, the two middle values satisfy
\begin{align*}
\binom{2r+1}{r+1}=\frac{(2r+1)!}{(r+1)!r!}=\frac{(2r+1)!}{r!(r+1)!}=\binom{2r+1}{r},
\end{align*}
so the maximum occurs at both $k=r=\lfloor n/2\rfloor$ and $k=r+1=\lceil n/2\rceil$.
Thus the largest rank level has size $\binom{n}{\lfloor n/2\rfloor}$. By *[Sperner's Theorem](/theorems/2585)*, no antichain in $B_n$ has more elements than this, so the middle rank levels are extremal antichains, not merely examples of antichains.
[/example]
The Boolean lattice again supplies the model calculation. Later, the normalized matching property and the [LYM inequality](/theorems/2586) will explain why rank levels dominate antichain size in a much more systematic way.
## When Does A Poset Become A Lattice?
What extra structure appears when any two elements have a best common lower bound and a best common upper bound? This question moves the course from general posets to lattices. Lattices connect order theory with algebra because their operations behave like meet and join versions of multiplication and addition.
[definition: Lattice]
A lattice is a poset $(L, \le)$ together with operations
\begin{align*}
\wedge, \vee : L \times L \to L
\end{align*}
such that for every $x,y \in L$, the elements $x \wedge y$ and $x \vee y$ satisfy both of the following conditions:
For every $z \in L$,
\begin{align*}
z \le x \text{ and } z \le y \iff z \le x \wedge y.
\end{align*}
For every $z \in L$,
\begin{align*}
x \le z \text{ and } y \le z \iff x \vee y \le z.
\end{align*}
[/definition]
The element $x \wedge y$ is the meet of $x$ and $y$, and $x \vee y$ is their join. The definition packages two universal properties: meet is the greatest lower bound, and join is the least upper bound.
[example: Divisor Lattice]
Let $n$ be a positive integer, and let $L$ be the set of positive divisors of $n$, ordered by divisibility. Write the prime factorisation as
\begin{align*}
n=p_1^{e_1}p_2^{e_2}\cdots p_r^{e_r}.
\end{align*}
Every divisor $a\in L$ has a unique form
\begin{align*}
a=p_1^{\alpha_1}p_2^{\alpha_2}\cdots p_r^{\alpha_r}
\end{align*}
with $0\le \alpha_i\le e_i$ for each $i$. If
\begin{align*}
b=p_1^{\beta_1}p_2^{\beta_2}\cdots p_r^{\beta_r},
\end{align*}
then $a\mid b$ exactly when $\alpha_i\le \beta_i$ for every $i$.
For $a,b\in L$, define
\begin{align*}
d=p_1^{\min(\alpha_1,\beta_1)}p_2^{\min(\alpha_2,\beta_2)}\cdots p_r^{\min(\alpha_r,\beta_r)}.
\end{align*}
Since $\min(\alpha_i,\beta_i)\le \alpha_i$ and $\min(\alpha_i,\beta_i)\le \beta_i$, we have $d\mid a$ and $d\mid b$. If $z\mid a$ and $z\mid b$, writing $z=p_1^{\gamma_1}\cdots p_r^{\gamma_r}$ gives $\gamma_i\le \alpha_i$ and $\gamma_i\le \beta_i$, hence $\gamma_i\le \min(\alpha_i,\beta_i)$ for every $i$, so $z\mid d$. Thus $d$ is the greatest common lower bound of $a$ and $b$, and
\begin{align*}
a\wedge b=\gcd(a,b).
\end{align*}
Similarly, define
\begin{align*}
m=p_1^{\max(\alpha_1,\beta_1)}p_2^{\max(\alpha_2,\beta_2)}\cdots p_r^{\max(\alpha_r,\beta_r)}.
\end{align*}
Because $\alpha_i\le \max(\alpha_i,\beta_i)$ and $\beta_i\le \max(\alpha_i,\beta_i)$, we have $a\mid m$ and $b\mid m$. If $a\mid z$ and $b\mid z$, with $z=p_1^{\gamma_1}\cdots p_r^{\gamma_r}$, then $\alpha_i\le \gamma_i$ and $\beta_i\le \gamma_i$, so $\max(\alpha_i,\beta_i)\le \gamma_i$ for every $i$, hence $m\mid z$. Therefore $m$ is the least common upper bound of $a$ and $b$, and
\begin{align*}
a\vee b=\operatorname{lcm}(a,b).
\end{align*}
Under the exponent-vector identification
\begin{align*}
a\longleftrightarrow (\alpha_1,\dots,\alpha_r),
\end{align*}
the divisor lattice is the product of the chains $\{0,1,\dots,e_i\}$. Meet and join are computed coordinatewise by $\min$ and $\max$. For each coordinate,
\begin{align*}
\min(\alpha,\max(\beta,\gamma))=\max(\min(\alpha,\beta),\min(\alpha,\gamma)).
\end{align*}
This identity follows by checking the two cases $\alpha\le \max(\beta,\gamma)$ and $\alpha>\max(\beta,\gamma)$; in the second case both sides equal $\max(\beta,\gamma)$. The dual identity with $\min$ and $\max$ interchanged is checked the same way, so meet distributes over join and join distributes over meet. Thus the positive divisors of $n$, ordered by divisibility, form a distributive lattice whose operations are $\gcd$ and $\operatorname{lcm}$.
[/example]
The divisor lattice shows how a lattice can encode familiar algebraic operations. To connect lattice elements back to the underlying poset, we need a way to replace downward-closed subsets by their boundary antichains.
[quotetheorem:8084]
[citeproof:8084]
This bijection is a small result with large consequences. Finiteness is used to ensure that every nonempty ideal has maximal elements and that every element of an ideal lies below one of them; in the poset $(\mathbb{N}, \le)$, the whole set $\mathbb{N}$ is an order ideal with no maximal element. The hypothesis that the subset is an order ideal is also essential: the subset $\{1,3\} \subset \{1<2<3\}$ has maximal element $3$, but it is not recovered as all elements below $3$. The theorem classifies order ideals by their boundary antichains, but it does not classify arbitrary subsets, filters, or lattice congruences. It converts a downward-closed family into a boundary object, and similar conversions between global subsets and extremal elements appear repeatedly in the theory of distributive lattices.
## Where Topology Enters
How can a finite poset carry topological information? The bridge is the order complex, which turns chains into simplices. This allows topological properties of simplicial complexes to be read from order relations.
[definition: Order Complex]
Let $(P, \le)$ be a finite poset. The order complex $\Delta(P)$ is the simplicial complex whose vertices are the elements of $P$ and whose simplices are the finite chains in $P$.
[/definition]
The order complex replaces comparisons by faces. A chain of $k+1$ elements becomes a $k$-simplex, so the topology of $\Delta(P)$ records how chains fit together.
[example: Proper Part Of The Boolean Lattice]
Let $\overline{B}_n=B_n\setminus\{\varnothing,\{1,\dots,n\}\}$, where $B_n$ is ordered by inclusion. Its elements are exactly the nonempty proper subsets of $[n]=\{1,\dots,n\}$, so the vertices of $\Delta(\overline{B}_n)$ are exactly those subsets. A $k$-simplex of $\Delta(\overline{B}_n)$ is a chain of $k+1$ distinct elements of $\overline{B}_n$, hence has the form
\begin{align*}
A_0\subset A_1\subset \cdots \subset A_k
\end{align*}
with each $A_i$ a nonempty proper subset of $[n]$.
Now view $[n]$ as the vertex set of an $(n-1)$-simplex $\Sigma$. The faces of the boundary $\partial\Sigma$ are exactly the nonempty proper subsets of $[n]$: nonempty subsets give nonempty faces, and the full subset $[n]$ is excluded because it is the whole simplex rather than a boundary face. In the barycentric subdivision of $\partial\Sigma$, vertices correspond to faces of $\partial\Sigma$, and simplices correspond to chains of faces under inclusion. Therefore its $k$-simplices are precisely the chains
\begin{align*}
A_0\subset A_1\subset \cdots \subset A_k
\end{align*}
of nonempty proper subsets of $[n]$, which are exactly the $k$-simplices of $\Delta(\overline{B}_n)$.
Thus $\Delta(\overline{B}_n)$ is the barycentric subdivision of $\partial\Sigma$. Since $\partial\Sigma$ is the boundary of an $(n-1)$-simplex, it is homeomorphic to $S^{n-2}$, and barycentric subdivision does not change the underlying [topological space](/page/Topological%20Space). Hence $\Delta(\overline{B}_n)$ is homeomorphic to an $(n-2)$-sphere.
[/example]
This topological viewpoint is needed for shellability and Cohen-Macaulay posets. The course will use combinatorial conditions, such as lexicographic shellings, to prove strong homological consequences for order complexes.
## How The Course Is Organised
Which tools will be developed, and in what order? The first lectures build the basic language of finite posets, including Hasse diagrams, ranks, linear extensions, products, sums, duality, and induced subposets. The next block studies chains and antichains through Dilworth's theorem, Mirsky's theorem, Sperner theory, and the LYM inequality.
The middle of the course develops Möbius functions and incidence algebras. Zeta functions, convolution, Möbius inversion, characteristic polynomials, and examples from subspace lattices and partition lattices form the algebraic core. After that, the notes turn to lattices: modular, distributive, complemented, geometric, and semimodular lattices, including Birkhoff's representation theorem for finite distributive lattices.
The final part introduces the topology and enumeration attached to posets. We study order complexes, shellability, Cohen-Macaulay conditions, Eulerian posets, $h$-vectors, and $P$-partitions. The course ends by connecting finite posets to quasisymmetric functions, descent sets, and order polynomials.
[remark: Recurring Examples]
The Boolean lattice, divisor lattice, face posets of simplices, Young's lattice truncated by size, partition lattices, and subspace lattices over finite fields will recur throughout the course. Each example supports several viewpoints: diagrammatic, enumerative, algebraic, and topological. Returning to the same examples makes it possible to compare the different invariants without changing the underlying object each time.
[/remark]
The aim is not to collect unrelated facts about ordered sets. The aim is to learn a small number of durable techniques: pass to intervals, invert in an incidence algebra, decompose by chains, compare ranks, represent lattices through special elements, and convert chains into simplices. These techniques are the connective tissue of algebraic combinatorics in the poset setting.
# 1. Finite Posets and Their Invariants
Finite posets are the basic objects of this course: finite sets equipped with a relation that records when one object is contained in, divides, refines, or precedes another. The first task is to learn which data of a poset are structural, which are enumerative, and how the standard constructions behave. Later chapters will build incidence algebras, Möbius functions, lattice operations, and topological invariants on top of the language introduced here.
## From Order Relations to Diagrams
How much information is needed to recover a finite order? A relation may list many comparisons, but finite posets have a smaller graphical skeleton: the cover relations. This section introduces the vocabulary used throughout the course and separates the order itself from the diagrammatic conventions used to draw it.
[definition: Partially Ordered Set]
A partially ordered set, or poset, is a pair $(P, \le)$ consisting of a set $P$ and a relation $\le$ on $P$ such that, for all $x,y,z \in P$,
a. $x \le x$,
b. if $x \le y$ and $y \le x$, then $x=y$,
c. if $x \le y$ and $y \le z$, then $x \le z$.
[/definition]
The three conditions are reflexivity, antisymmetry, and transitivity. We usually write $P$ for the poset once the order is fixed, and write $x<y$ when $x\le y$ and $x\ne y$. The first examples show how familiar containment and divisibility relations become posets with many comparisons implicit in the notation.
[example: Divisibility Poset]
Let $P=\{d\in \mathbb Z_{>0}:d\mid 12\}$, ordered by divisibility, so $a\le b$ means that $b=ak$ for some $k\in\mathbb Z_{>0}$. Since
\begin{align*}
12=1\cdot 12=2\cdot 6=3\cdot 4,
\end{align*}
the positive divisors are
\begin{align*}
P=\{1,2,3,4,6,12\}.
\end{align*}
The element $1$ is minimal because $1\mid d$ for every $d\in P$, and $12$ is maximal because $d\mid 12$ for every $d\in P$.
For example, $2<6$ since $6=2\cdot 3$, and $6<12$ since $12=6\cdot 2$, so $2<6<12$. Similarly, $3<6$ since $6=3\cdot 2$, and $3<12$ since $12=3\cdot 4$. The elements $4$ and $6$ are incomparable: $4\nmid 6$ because $6/4=3/2$ is not an integer, and $6\nmid 4$ because $4/6=2/3$ is not an integer. This example shows that divisibility gives many comparable pairs, but not every two divisors must be comparable.
[/example]
The divisibility example contains comparisons such as $1<2<6<12$, so listing both $1<6$ and $1<12$ repeats information forced by transitivity. This motivates the cover relation: it extracts the immediate comparisons needed to draw a finite poset without all transitive consequences.
[definition: Cover Relation]
Let $P$ be a poset and let $x,y\in P$. We say that $y$ covers $x$, written $x\lessdot y$, if $x<y$ and there is no $z\in P$ with $x<z<y$.
[/definition]
The cover relation is not itself transitive, so it is not a replacement order relation. If we keep only covers, we must know that no comparison has been lost permanently: whenever $x<y$, there should be a finite ladder of immediate steps from $x$ to $y$. In a finite poset this ladder exists because one can choose a shortest chain between the two endpoints, and shortestness forces each step to be a cover. That is the obstruction the formal statement resolves, and it is what makes Hasse diagrams faithful rather than merely suggestive.
[quotetheorem:8085]
[citeproof:8085]
This theorem explains why Hasse diagrams are central in finite examples: they remove redundant comparisons while preserving all order information. The finiteness hypothesis is doing real work; for example, the usual order on $\mathbb Q$ has no cover relations at all, because between any two distinct rationals lies another rational, so its cover graph cannot recover the order. The theorem does not say that a drawing of the Hasse diagram has a unique planar shape, only that the abstract cover relation determines the finite order. The next local pieces of a poset are intervals, because many later constructions attach algebraic data not to individual elements but to comparable pairs.
[definition: Interval]
Let $P$ be a poset and let $x,y\in P$ with $x\le y$. The interval from $x$ to $y$ is
\begin{align*}
[x,y] = \{z\in P : x\le z\le y\}.
\end{align*}
[/definition]
Intervals are the subposets on which many invariants are computed. Incidence algebras and Möbius functions, introduced in Chapters 3 and 4, are built by assigning numbers to intervals. The Boolean lattice gives the basic model, because every interval in it is again a smaller Boolean lattice.
[example: Boolean Intervals]
Let $B_n$ be the Boolean lattice of all subsets of $\{1,\dots,n\}$, ordered by inclusion, and suppose $A\subseteq C$. The interval from $A$ to $C$ is
\begin{align*}
[A,C]=\{S\subseteq \{1,\dots,n\}: A\subseteq S\subseteq C\}.
\end{align*}
We show that this interval has the same order structure as the Boolean lattice on the set $C\setminus A$.
Define
\begin{align*}
\varphi:[A,C]\to B_{C\setminus A}
\end{align*}
by $\varphi(S)=S\setminus A$. If $S\in[A,C]$, then $A\subseteq S\subseteq C$, so every element of $S\setminus A$ lies in $C\setminus A$; hence $\varphi(S)\subseteq C\setminus A$.
Define the reverse map
\begin{align*}
\psi:B_{C\setminus A}\to [A,C]
\end{align*}
by $\psi(T)=A\cup T$. If $T\subseteq C\setminus A$, then $A\subseteq A\cup T$, and since $A\subseteq C$ and $T\subseteq C\setminus A\subseteq C$, we have $A\cup T\subseteq C$. Thus $\psi(T)\in[A,C]$.
For $S\in[A,C]$, we have $A\subseteq S$, so
\begin{align*}
\psi(\varphi(S))=A\cup(S\setminus A)=S.
\end{align*}
For $T\subseteq C\setminus A$, no element of $T$ lies in $A$, so
\begin{align*}
\varphi(\psi(T))=(A\cup T)\setminus A=T.
\end{align*}
Thus $\varphi$ and $\psi$ are inverse bijections. Finally, if $S_1\subseteq S_2$ in $[A,C]$, then $S_1\setminus A\subseteq S_2\setminus A$; conversely, if $S_1\setminus A\subseteq S_2\setminus A$, then every element of $S_1$ is either in $A$ or in $S_1\setminus A$, hence lies in $A\cup(S_2\setminus A)=S_2$. Therefore $S_1\subseteq S_2$ if and only if $\varphi(S_1)\subseteq\varphi(S_2)$, so $[A,C]\cong B_{|C\setminus A|}$. The interval between two subsets is therefore just a smaller Boolean lattice whose coordinates are the elements that may be added to $A$ while staying inside $C$.
[/example]
## Ideals, Filters, and Linear Extensions
A finite poset usually contains many incomparable elements, so we often study it through compatible subsets or compatible total orderings. Which subsets are closed under moving downward or upward, and how can a partial order be listed linearly without violating its comparisons?
[definition: Order Ideal]
Let $P$ be a poset. An order ideal is a subset $I\subset P$ such that, whenever $x\in I$, $y\in P$, and $y\le x$, we have $y\in I$.
[/definition]
An ideal records a possible collection of elements already chosen if prerequisites must be chosen first. Once downward-closed subsets have been named, the upward-closed version is needed because reversing all comparisons should give a parallel language.
[definition: Filter]
Let $P$ be a poset. A filter is a subset $F\subset P$ such that, whenever $x\in F$, $y\in P$, and $x\le y$, we have $y\in F$.
[/definition]
Ideals and filters are interchanged by reversing the order, so they encode complementary closure conditions. To describe finite ideals without listing every element below the boundary, we need the notion of a set of mutually incomparable maximal choices.
[definition: Antichain]
Let $P$ be a poset. An antichain is a subset $A\subset P$ such that no two distinct elements of $A$ are comparable.
[/definition]
Antichains record horizontal structure in a poset, while ideals record downward closure. In a finite poset these two descriptions contain the same information: an ideal is determined by its maximal boundary, and an antichain determines everything below it.
[quotetheorem:8084]
[citeproof:8084]
The theorem converts a closed-subset question into an antichain question. The finiteness hypothesis is essential: in the infinite chain $(\mathbb N,\le)$, the ideal $\mathbb N$ has no maximal element, so the map $I\mapsto \max(I)$ would send it to the empty antichain and lose all information. The theorem does not say that every subset is determined by its maximal elements; downward closure is the condition that makes the boundary description complete. This conversion will reappear in Sperner theory and in the study of distributive lattices of ideals. A small Boolean example shows the boundary description in a form that can be checked by listing subsets.
[example: Ideals In A Boolean Lattice]
In $B_3$, ordered by inclusion, let $A=\{\{1,2\},\{3\}\}$. The ideal generated by $A$ is
\begin{align*}
\langle A\rangle=\{S\subseteq \{1,2,3\}: S\subseteq \{1,2\}\text{ or }S\subseteq \{3\}\}.
\end{align*}
The subsets of $\{1,2\}$ are $\varnothing,\{1\},\{2\},\{1,2\}$, and the subsets of $\{3\}$ are $\varnothing,\{3\}$. Taking the union of these two lists gives
\begin{align*}
\langle A\rangle=\{\varnothing,\{1\},\{2\},\{1,2\},\{3\}\}.
\end{align*}
This set is downward closed: if $T\in\langle A\rangle$ and $S\subseteq T$, then $T\subseteq \{1,2\}$ or $T\subseteq \{3\}$, so $S\subseteq \{1,2\}$ or $S\subseteq \{3\}$, hence $S\in\langle A\rangle$.
The maximal elements of $\langle A\rangle$ are exactly $\{1,2\}$ and $\{3\}$. Indeed, $\varnothing\subset \{1\}$, $\varnothing\subset \{3\}$, $\{1\}\subset \{1,2\}$, and $\{2\}\subset \{1,2\}$, so none of $\varnothing,\{1\},\{2\}$ is maximal. The two remaining elements are incomparable, since neither $\{1,2\}\subseteq \{3\}$ nor $\{3\}\subseteq \{1,2\}$ holds. Thus
\begin{align*}
\max(\langle A\rangle)=\{\{1,2\},\{3\}\}=A.
\end{align*}
The ideal is therefore completely described by its antichain boundary.
[/example]
The ideal-antichain bijection linearises closed subsets by their maximal elements, but sometimes we need to linearise the whole poset. An arbitrary listing is not enough: in the divisibility poset of divisors of $12$, listing $6$ before $2$ destroys the comparison $2<6$ and cannot support an induction that depends on smaller elements already being handled. This leads to total listings that respect all required comparisons and allow induction over the elements of $P$.
[definition: Linear Extension]
Let $P$ be a finite poset. A linear extension of $P$ is a bijection $\ell:P\to \{1,\dots,|P|\}$ such that $x<y$ in $P$ implies $\ell(x)<\ell(y)$.
[/definition]
A linear extension would fail to exist if the poset forced an infinite descent or if some required comparison had to be reversed in the listing. For finite posets, the obstruction is avoided by repeatedly choosing a minimal remaining element: no smaller unlisted element is being skipped, so the partial listing remains compatible with the order. The formal result packages this recursive choice into the induction device used throughout finite poset arguments.
[quotetheorem:8086]
[citeproof:8086]
Linear extensions give a bridge from partial orders to permutations. The finiteness hypothesis supplies the elementary recursive proof; for instance, the infinite chain $(\mathbb Z,\le)$ has no minimal element, so the remove-a-minimal-element construction cannot even start. The theorem does not say that the extension is unique: an antichain with two elements has two different linear extensions. Chapter 11 refines this idea through $P$-partitions, which allow order-preserving maps into ordered sets rather than only bijective listings.
## Rank and Graded Posets
Some posets have layers, and many enumerative invariants depend on counting elements layer by layer. The obstruction is that a poset may contain two saturated paths between the same endpoints with different lengths; for example, take elements $x,a,b,c,y$ with covers $x\lessdot a\lessdot y$ and $x\lessdot b\lessdot c\lessdot y$. The guiding question is when all maximal chains from the bottom to a given element have the same length, so that a rank is well-defined.
[definition: Chain]
Let $P$ be a poset. A chain is a subset $C\subset P$ such that every two elements of $C$ are comparable.
[/definition]
Chains measure vertical structure: their lengths detect how many cover steps separate comparable elements. This motivates a rank function, which assigns a numerical height to each element in a way compatible with every immediate upward step.
[definition: Rank Function]
Let $P$ be a finite poset. A rank function on $P$ is a map $\rho:P\to \mathbb N\cup\{0\}$ such that, whenever $x\lessdot y$, we have $\rho(y)=\rho(x)+1$.
[/definition]
A rank function alone is determined only up to additive constants on connected components of the Hasse diagram. This motivates the definition of a graded poset, where minimal elements are placed in rank $0$ so that the bottom layer has a fixed meaning.
[definition: Graded Poset]
A finite poset $P$ is graded if it has a rank function $\rho:P\to \mathbb N\cup\{0\}$ and all minimal elements of $P$ have rank $0$.
[/definition]
The definition says how ranks change along covers, but it remains to check that different saturated paths between the same endpoints have the same length. The Jordan-Dedekind condition is the formal statement that rank removes this path dependence.
[quotetheorem:8087]
[citeproof:8087]
The theorem justifies speaking of the length of an interval in a graded poset. The graded hypothesis is essential: in the five-element poset with saturated chains $x\lessdot a\lessdot y$ and $x\lessdot b\lessdot c\lessdot y$, the interval $[x,y]$ has saturated chains of lengths $2$ and $3$. The theorem does not say that every finite poset admits a rank function; it says that once rank increments are compatible with covers, chain length becomes endpoint data. Once the elements are organised into ranks, the next invariant counts how many elements occur at each level and records those counts in a generating function.
[definition: Rank Generating Function]
Let $P$ be a finite graded poset with rank function $\rho$. The rank generating function of $P$ is the polynomial $R_P(q)\in \mathbb Z[q]$ defined by
\begin{align*}
R_P(q)=\sum_{x\in P} q^{\rho(x)}.
\end{align*}
Here $\mathbb Z[q]$ denotes the [polynomial ring](/page/Polynomial%20Ring) in the indeterminate $q$ with integer coefficients.
[/definition]
Equivalently, after the polynomial has been defined, it may be evaluated at integer values of $q$ to obtain a function $\mathbb Z\to \mathbb Z$. The coefficients of $R_P(q)$ are the rank numbers of $P$. This polynomial is more than decoration: it remembers the layer sizes while deliberately forgetting how elements inside adjacent layers are connected. The Boolean lattice is the central model because its ranks are counted by binomial coefficients.
[example: Boolean Rank Generating Function]
In $B_n$, the rank of a subset $S\subseteq \{1,\dots,n\}$ is $\rho(S)=|S|$. By the definition of the rank generating function,
\begin{align*}
R_{B_n}(q)=\sum_{S\subseteq \{1,\dots,n\}}q^{|S|}.
\end{align*}
Group the subsets according to their cardinality. For each $k\in\{0,\dots,n\}$, there are $\binom{n}{k}$ subsets of $\{1,\dots,n\}$ with exactly $k$ elements, so the contribution of all rank-$k$ subsets is $\binom{n}{k}q^k$. Hence
\begin{align*}
R_{B_n}(q)=\sum_{k=0}^{n}\binom{n}{k}q^k.
\end{align*}
Expanding $(1+q)^n$ by choosing either $1$ or $q$ from each of the $n$ factors, the terms with exactly $k$ choices of $q$ contribute $\binom{n}{k}q^k$, so
\begin{align*}
(1+q)^n=\sum_{k=0}^{n}\binom{n}{k}q^k.
\end{align*}
Therefore
\begin{align*}
R_{B_n}(q)=(1+q)^n.
\end{align*}
Thus the binomial coefficients appear as the rank numbers of the Boolean lattice.
[/example]
The Boolean example has independent choices of including or excluding each element. Divisor lattices show the same principle with several possible exponents in each prime direction.
[example: Divisor Lattice Rank]
Let $N=p_1^{a_1}\cdots p_r^{a_r}$, where the $p_i$ are distinct primes, and let $P$ be the positive divisors of $N$, ordered by divisibility. By unique factorization, every $d\in P$ has a unique form
\begin{align*}
d=p_1^{b_1}\cdots p_r^{b_r}
\end{align*}
with $0\le b_i\le a_i$ for each $i$. We use
\begin{align*}
\rho(d)=b_1+\cdots+b_r
\end{align*}
as the rank.
To see why this rank matches the cover relations, suppose
\begin{align*}
d=p_1^{b_1}\cdots p_r^{b_r}.
\end{align*}
If $b_j<a_j$, then
\begin{align*}
p_jd=p_1^{b_1}\cdots p_j^{b_j+1}\cdots p_r^{b_r}
\end{align*}
covers $d$: any divisor $e$ with $d\mid e\mid p_jd$ must have exponent $b_i$ in every coordinate $i\ne j$, and exponent either $b_j$ or $b_j+1$ in the $j$th coordinate, so $e=d$ or $e=p_jd$. Conversely, if $e$ covers $d$, then the exponent vector of $e$ is coordinatewise at least that of $d$. If two coordinates increased, or if one coordinate increased by at least $2$, one could increase just one exponent by $1$ to get a divisor strictly between $d$ and $e$, contradicting that $e$ covers $d$. Thus each cover increases exactly one exponent by $1$, and therefore increases $\rho$ by $1$.
By the definition of the rank generating function,
\begin{align*}
R_P(q)=\sum_{d\mid N}q^{\rho(d)}.
\end{align*}
Writing each divisor by its exponent vector gives
\begin{align*}
R_P(q)=\sum_{0\le b_1\le a_1,\dots,0\le b_r\le a_r}q^{b_1+\cdots+b_r}.
\end{align*}
Since $q^{b_1+\cdots+b_r}=q^{b_1}\cdots q^{b_r}$, choosing the exponent $b_i$ independently in the range $0,\dots,a_i$ gives
\begin{align*}
R_P(q)=\left(\sum_{b_1=0}^{a_1}q^{b_1}\right)\cdots\left(\sum_{b_r=0}^{a_r}q^{b_r}\right).
\end{align*}
Equivalently,
\begin{align*}
R_P(q)=\prod_{i=1}^{r}(1+q+\cdots+q^{a_i}).
\end{align*}
Thus the coefficient of $q^k$ counts the divisors of $N$ whose prime-exponent sum is $k$.
[/example]
## Order Complexes and Counting Chains
Chains are not only subsets of a poset; they are also faces of a simplicial complex. This topological viewpoint becomes important in shellability and Cohen-Macaulay theory, but its first use is to organise chain enumeration. The same finite-poset language also appears in arithmetic examples: divisors ordered by divisibility form intervals controlled by prime exponents, and subspace lattices over finite fields lead to $q$-binomial coefficients. These examples explain why the course treats order, counting, and topology as parts of the same finite framework rather than as separate topics.
[definition: Order Complex]
Let $P$ be a finite poset. The order complex $\Delta(P)$ is the simplicial complex whose faces are the finite chains in $P$.
[/definition]
The empty chain is the empty face, and a $k$-dimensional face corresponds to a chain with $k+1$ elements. Removing a minimum or maximum element is common when studying topology, because cone points make the complex contractible. The face poset of a simplex gives the standard geometric example.
[example: Face Poset Of A Simplex]
Let $\Sigma$ be an $(n-1)$-simplex with vertex set $\{1,\dots,n\}$. A face of $\Sigma$ is determined by its set of vertices, so the nonempty faces are exactly the nonempty subsets $F\subseteq \{1,\dots,n\}$. Ordered by containment of faces, this is the induced subposet $B_n\setminus\{\varnothing\}$ of the Boolean lattice.
The order complex $\Delta(B_n\setminus\{\varnothing\})$ has one vertex for each nonempty face $F$ of $\Sigma$, and its simplices are precisely the chains
\begin{align*}
F_0\subset F_1\subset \cdots \subset F_k
\end{align*}
of nonempty faces. In the barycentric subdivision of $\Sigma$, the vertices are also indexed by the nonempty faces of $\Sigma$: the vertex corresponding to $F$ is the barycenter of the face $F$. A set of barycenters spans a simplex in the barycentric subdivision exactly when the corresponding faces are nested by inclusion, say
\begin{align*}
F_0\subset F_1\subset \cdots \subset F_k.
\end{align*}
Thus the faces of the order complex and the simplices of the barycentric subdivision are indexed by the same chains of nonempty faces. Therefore the order complex of the face poset $B_n\setminus\{\varnothing\}$ is the barycentric subdivision of $\Sigma$.
[/example]
The order complex counts strict chains as faces, while many algebraic-combinatorial applications count weak order-preserving maps into a finite chain. This leads to zeta polynomials, which are an enumerative precursor to incidence algebra.
[definition: Zeta Polynomial]
Let $P$ be a finite poset. The zeta counting function is the map $Z_P:\mathbb Z_{>0}\to \mathbb N\cup\{0\}$ defined by letting $Z_P(m)$ be the number of weakly order-preserving maps $f:P\to \{1,\dots,m\}$, where $x\le y$ implies $f(x)\le f(y)$.
[/definition]
For fixed $P$, this counting function agrees on positive integers with a polynomial $Z_P(m)\in \mathbb Q[m]$, called the zeta polynomial of $P$; here $\mathbb Q[m]$ is the polynomial ring in the indeterminate $m$ with rational coefficients. In later chapters, this polynomial viewpoint will be related to order polytopes and $P$-partitions. The case of a chain reduces the count to weakly increasing sequences.
[example: Zeta Polynomial Of A Chain]
Let $C_n=\{c_1<\cdots<c_n\}$. A weakly order-preserving map $f:C_n\to \{1,\dots,m\}$ is determined by the values $a_i=f(c_i)$, and the order-preserving condition is exactly
\begin{align*}
1\le a_1\le a_2\le \cdots \le a_n\le m.
\end{align*}
Thus $Z_{C_n}(m)$ is the number of weakly increasing sequences of length $n$ with entries in $\{1,\dots,m\}$.
We count these sequences by converting them into strictly increasing sequences. Given
\begin{align*}
1\le a_1\le a_2\le \cdots \le a_n\le m,
\end{align*}
define
\begin{align*}
b_i=a_i+i-1.
\end{align*}
Then $b_1=a_1\ge 1$, and $b_n=a_n+n-1\le m+n-1$. Also, for each $i<n$,
\begin{align*}
b_i=a_i+i-1\le a_{i+1}+i-1<a_{i+1}+i=b_{i+1}.
\end{align*}
So the sequence $b_1,\dots,b_n$ is strictly increasing and satisfies
\begin{align*}
1\le b_1< b_2<\cdots < b_n\le m+n-1.
\end{align*}
Conversely, given a strictly increasing sequence
\begin{align*}
1\le b_1< b_2<\cdots < b_n\le m+n-1,
\end{align*}
define
\begin{align*}
a_i=b_i-i+1.
\end{align*}
Since the $b_i$ are integers and $b_i<b_{i+1}$, we have $b_i+1\le b_{i+1}$. Therefore
\begin{align*}
a_i=b_i-i+1\le b_{i+1}-i=b_{i+1}-(i+1)+1=a_{i+1}.
\end{align*}
The bounds are $a_1=b_1\ge 1$ and
\begin{align*}
a_n=b_n-n+1\le (m+n-1)-n+1=m.
\end{align*}
Thus weakly increasing sequences $a_1,\dots,a_n$ in $\{1,\dots,m\}$ are in bijection with $n$-element subsets $\{b_1<\cdots<b_n\}$ of $\{1,\dots,m+n-1\}$. Hence
\begin{align*}
Z_{C_n}(m)=\binom{m+n-1}{n}.
\end{align*}
The zeta polynomial of a chain therefore counts multisets of size $n$ chosen from an $m$-element set.
[/example]
## Operations on Finite Posets
To build new examples, we need constructions that combine old ones while preserving the order-theoretic data we care about. The main operations are product, ordinal sum, disjoint union, duality, and passage to induced subposets.
[definition: Product Of Posets]
Let $P$ and $Q$ be posets. The product poset $P\times Q$ is the Cartesian product with order
\begin{align*}
(p,q)\le (p',q') \quad \text{if and only if} \quad p\le p' \text{ in } P \text{ and } q\le q' \text{ in } Q.
\end{align*}
[/definition]
In a product, a cover step changes one coordinate while the other coordinates stay fixed, so a candidate rank should add the coordinate ranks. The issue is whether this additive rule is globally consistent and whether counting elements of a fixed total rank separates into independent choices from the two factors. The theorem below answers both questions for graded factors, giving the precise reason that rank generating functions factor for products built from independent coordinates.
[quotetheorem:8088]
[citeproof:8088]
This theorem explains the product formula for divisor lattices, since the divisor lattice of $p_1^{a_1}\cdots p_r^{a_r}$ is a product of chains. The graded hypotheses are necessary for this rank statement: take the earlier five-element poset with covers $x\lessdot a\lessdot y$ and $x\lessdot b\lessdot c\lessdot y$. Its interval $[x,y]$ has saturated chains of lengths $2$ and $3$, so no rank function can make every cover increase rank by one while assigning a single rank to $y$. If this poset is used as a factor in a product, the same incompatible chain lengths persist in that coordinate. The theorem does not say that all invariants multiply under products; it is specifically a statement about ranks and the generating function built from them. Products combine coordinates with simultaneous comparison; the next construction instead keeps two posets side by side with no cross-comparisons.
[definition: Disjoint Union Of Posets]
Let $P$ and $Q$ be posets. Their disjoint union $P\sqcup Q$ is the disjoint union of the underlying sets, ordered by the original orders inside $P$ and $Q$, with no new comparisons between elements of $P$ and elements of $Q$.
[/definition]
Disjoint union is useful when a poset decomposes into unrelated components. A different operation forces every element of one component to lie below every element of the other, producing a vertical concatenation.
[definition: Ordinal Sum]
Let $P$ and $Q$ be posets. The ordinal sum $P\oplus Q$ is the disjoint union of the underlying sets, ordered by the original orders inside $P$ and $Q$, together with the relation $p<q$ for every $p\in P$ and $q\in Q$.
[/definition]
Disjoint union places posets side by side, while ordinal sum stacks one entirely below the other. To compare statements with their order-reversed versions, we also need an operation that reverses every comparison without changing the underlying elements.
[definition: Dual Poset]
Let $P$ be a poset. The dual poset $\operatorname{Dual}(P)$ has the same underlying set as $P$, with order $x\le_{\operatorname{Dual}(P)}y$ if and only if $y\le_P x$.
[/definition]
Duality turns ideals into filters and minimal elements into maximal elements. This motivates induced subposets, because after reversing or combining orders we often restrict attention to a selected finite family while keeping the ambient comparisons intact.
[definition: Induced Subposet]
Let $P$ be a poset and let $S\subset P$. The induced subposet on $S$ is the poset with underlying set $S$ and order inherited from $P$.
[/definition]
These constructions allow a small library of examples to generate many more. They also provide tests for proposed invariants: a useful invariant should have predictable behaviour under at least some of these operations. Truncated infinite examples are especially common in algebraic combinatorics.
[example: Truncated Young Poset]
Let $Y_{\le N}$ be the set of integer partitions $\lambda=(\lambda_1,\lambda_2,\dots)$ with $\lambda_1\ge \lambda_2\ge \cdots \ge 0$ and
\begin{align*}
|\lambda|=\lambda_1+\lambda_2+\cdots \le N.
\end{align*}
We order $Y_{\le N}$ by inclusion of Young diagrams: $\lambda\le \mu$ means that every row of $\lambda$ has at most as many boxes as the corresponding row of $\mu$, equivalently $\lambda_i\le \mu_i$ for every $i$.
The rank is the size function $\rho(\lambda)=|\lambda|$. If $\mu$ is obtained from $\lambda$ by adding one box in row $j$, then $\mu_j=\lambda_j+1$, all other row lengths are unchanged, and
\begin{align*}
|\mu|=(\lambda_1+\cdots+\lambda_j+1+\cdots)-(\lambda_1+\cdots+\lambda_j+\cdots)+|\lambda|=|\lambda|+1.
\end{align*}
There is no partition $\nu$ with $\lambda<\nu<\mu$, because any strict containment $\lambda<\nu\le \mu$ must contain a box of $\mu\setminus \lambda$, and that skew difference has exactly one box; hence $\nu=\mu$. Thus adding one box gives a cover.
Conversely, suppose $\lambda\lessdot \mu$ in $Y_{\le N}$. Since $\lambda<\mu$, at least one row length increases, so $|\mu|-|\lambda|\ge 1$. If $|\mu|-|\lambda|\ge 2$, choose one box of $\mu\setminus\lambda$ whose addition to $\lambda$ still gives a Young diagram; calling the resulting partition $\nu$, we have
\begin{align*}
|\nu|=|\lambda|+1\le |\mu|-1<N+1.
\end{align*}
Hence $\nu\in Y_{\le N}$ and $\lambda<\nu<\mu$, contradicting that $\mu$ covers $\lambda$. Therefore $|\mu|-|\lambda|=1$, so every cover adds exactly one box and increases rank by $1$.
Finally, $Y_{\le N}$ is finite: a partition in $Y_{\le N}$ has at most $N$ nonzero parts, and each part is an integer between $0$ and $N$. Thus there are at most $(N+1)^N$ possible lists of parts to check, so only finitely many partitions occur. The truncated Young poset is therefore a finite ranked model of Young's lattice, keeping exactly the partitions of sizes $0,1,\dots,N$.
[/example]
# 2. Chains, Antichains, and Decomposition Theorems
These notes continue from Chapter 1's basic language of finite posets, Hasse diagrams, intervals, rank functions, and Boolean lattices. This chapter studies how a finite poset can be cut into comparable pieces and incomparable layers. The basic numerical invariants are width and height, but the deeper point is that these numbers are also optimal decomposition lengths. Dilworth's theorem and Mirsky's theorem turn extremal questions about antichains and chains into covering theorems, and the last part of the chapter specializes these ideas to ranked posets such as Boolean lattices.
## Width, Height, and Decompositions
How much of a poset can be seen along a single line, and how much can be seen across a single level? Chains and antichains measure these two opposite directions. The first task is to make this quantitative in a way that supports exact decomposition results.
[definition: Chain]
Let $(P, \le)$ be a poset. A subset $C \subset P$ is a chain if for all $x,y \in C$, either $x \le y$ or $y \le x$.
[/definition]
Chains record total comparability inside a poset. In a Hasse diagram, a finite chain is a path moving upward through comparable elements, possibly skipping intermediate elements. To measure the opposite direction we need subsets in which no two distinct elements can be placed on the same comparable thread.
[definition: Antichain]
Let $(P, \le)$ be a poset. A subset $A \subset P$ is an antichain if for all distinct $x,y \in A$, neither $x \le y$ nor $y \le x$.
[/definition]
Antichains record the obstruction to arranging the poset into a single chain. The two extremal sizes attached to chains and antichains are the invariants that decomposition theorems will identify with minimal covers.
[definition: Height and Width]
Let $P$ be a finite poset. The height of $P$ is
\begin{align*}
h(P) &= \max\{|C| : C \subset P \text{ is a chain}\}.
\end{align*}
The width of $P$ is
\begin{align*}
w(P) &= \max\{|A| : A \subset P \text{ is an antichain}\}.
\end{align*}
[/definition]
The definitions are extremal, so they ask for large objects. The covering versions ask for many small objects whose union is the whole poset.
[definition: Chain Decomposition]
Let $P$ be a finite poset. A chain decomposition of $P$ is a collection of pairwise disjoint chains $C_1,\dots,C_r \subset P$ such that
\begin{align*}
P &= C_1 \cup \cdots \cup C_r.
\end{align*}
[/definition]
A chain decomposition places each element into one comparable thread. Since a chain contains at most one element from any antichain, every antichain gives a lower bound on the number of chains required. The dual covering problem asks for horizontal layers rather than vertical threads.
[definition: Antichain Decomposition]
Let $P$ be a finite poset. An antichain decomposition of $P$ is a collection of pairwise disjoint antichains $A_1,\dots,A_s \subset P$ such that
\begin{align*}
P &= A_1 \cup \cdots \cup A_s.
\end{align*}
[/definition]
An antichain decomposition places the poset into horizontal layers. Since an antichain contains at most one element from any chain, every chain gives a lower bound on the number of antichains required.
[example: Boolean Lattice Width and Height]
In the Boolean lattice $B_n$ of all subsets of $\{1,\dots,n\}$ ordered by inclusion, each fixed-cardinality layer is an antichain: if $S,T\subseteq \{1,\dots,n\}$ have $|S|=|T|=k$ and $S\subseteq T$, then $|T\setminus S|=|T|-|S|=0$, so $S=T$. Thus no two distinct sets in the same layer are comparable.
The rank $k$ layer consists of all $k$-element subsets of an $n$-element set, so its size is $\binom{n}{k}$. To locate the largest layer, compare adjacent binomial coefficients:
\begin{align*}
\frac{\binom{n}{k+1}}{\binom{n}{k}}=\frac{n-k}{k+1}.
\end{align*}
This ratio is at least $1$ exactly when $n-k\ge k+1$, equivalently $k\le (n-1)/2$. Hence the layer sizes increase up to the middle and decrease after the middle, so the largest layer occurs at $k=\lfloor n/2\rfloor$ or, equivalently when $n$ is odd, also at $k=\lceil n/2\rceil$.
A maximal chain is obtained by adding one new element at each step, for example
\begin{align*}
\varnothing \subset \{i_1\} \subset \{i_1,i_2\} \subset \cdots \subset \{i_1,\dots,i_n\}=\{1,\dots,n\}.
\end{align*}
The sets in this chain have cardinalities $0,1,\dots,n$, so there are $n+1$ sets. Therefore $h(B_n)=n+1$, while the middle layer already shows $w(B_n)\ge \binom{n}{\lfloor n/2\rfloor}$. Later, *Sperner's theorem for Boolean lattices* proves that this lower bound is exact: $w(B_n)=\binom{n}{\lfloor n/2\rfloor}$.
[/example]
This example shows the shape of the subject: rank layers produce antichains, while saturated chains produce vertical structure. The next results explain when these apparent lower bounds are exact.
## Dilworth's Theorem and Mirsky's Theorem
Suppose a poset contains an antichain of size $m$. Then at least $m$ chains are needed to cover the poset. The problem is whether this necessary condition is also sufficient; Dilworth's theorem says that it is.
[quotetheorem:8083]
[citeproof:8083]
Dilworth's theorem converts an extremal antichain problem into a covering problem, but its hypotheses and formulation matter. Finiteness ensures that maximum antichains, minimum chain decompositions, and maximum matchings in the auxiliary bipartite graph all exist as ordinary finite objects; without this, the same sentence may not even have a maximum to refer to. For example, take a poset built as a disjoint union of finite antichains of sizes $1,2,3,\dots$, with every element in the $n$th antichain declared less than every element in the $(n+1)$st antichain. It has antichains of every finite size but no largest finite antichain, so the finite theorem cannot be applied by choosing a numerical width. The theorem also does not say that an arbitrary greedy chain cover is optimal; optimality is certified by the matching or by a maximum antichain.
[example: Divisibility Poset on Divisors]
Let $P$ be the divisors of $60=2^2\cdot 3\cdot 5$, ordered by divisibility. Every divisor has a unique form $2^a3^b5^c$ with $a\in\{0,1,2\}$ and $b,c\in\{0,1\}$, so the map
\begin{align*}
2^a3^b5^c \longmapsto (a,b,c)
\end{align*}
identifies $P$ with the product poset
\begin{align*}
\{0,1,2\}\times \{0,1\}\times \{0,1\},
\end{align*}
where $(a,b,c)\le (a',b',c')$ exactly when $a\le a'$, $b\le b'$, and $c\le c'$.
We compute the rank-$2$ layer, meaning the exponent vectors whose coordinate sum is $2$. The possibilities are
\begin{align*}
(2,0,0),\ (1,1,0),\ (1,0,1),\ (0,1,1).
\end{align*}
If $(a,b,c)\le (a',b',c')$ and both vectors have coordinate sum $2$, then
\begin{align*}
(a'-a)+(b'-b)+(c'-c)=2-2=0.
\end{align*}
Each summand is nonnegative, so each summand is $0$, and therefore $(a,b,c)=(a',b',c')$. Thus the four rank-$2$ vectors form an antichain, corresponding to the four divisors
\begin{align*}
4,\ 6,\ 10,\ 15.
\end{align*}
Hence $w(P)\ge 4$, so any chain decomposition needs at least four chains.
Four chains actually cover all twelve divisors:
\begin{align*}
1<2<4<12<60.
\end{align*}
\begin{align*}
3<6<30.
\end{align*}
\begin{align*}
5<10<20.
\end{align*}
\begin{align*}
15.
\end{align*}
These chains are disjoint, and their sizes add to
\begin{align*}
5+3+3+1=12,
\end{align*}
which is the number of divisors of $60$. Therefore they form a chain decomposition with four chains. The antichain gives the lower bound $4$, and this explicit cover attains it, so the minimum number of chains is $4$, in agreement with *Dilworth's theorem*.
[/example]
The dual covering question asks for antichains instead of chains. Since a chain of size $h(P)$ must use distinct antichain layers, the minimum number of antichains is at least $h(P)$. Mirsky's theorem says that this lower bound is always sharp for finite posets: if $h(P)$ is the maximum size of a chain in a finite poset $P$, then $P$ can be partitioned into exactly $h(P)$ antichains, and no partition into fewer antichains is possible.
Mirsky's theorem is the exact dual companion to Dilworth's theorem in the finite setting: width controls the number of chains needed to cover a poset, while height controls the number of antichains needed to cover it. The finiteness hypothesis is essential to this clean formulation. For example, the chain $(\mathbb Z,\le)$ has antichains of size at most $1$, but it has no finite maximum height, so the statement cannot be transferred naively to arbitrary infinite posets. The theorem also asserts existence of an optimal antichain decomposition, not that every rank-like grading already gives the desired decomposition.
[example: Interval Order Chain Cover]
Let $P$ be the six closed intervals
\begin{align*}
[0,2],\ [1,3],\ [3,4],\ [4,6],\ [5,7],\ [7,8],
\end{align*}
ordered by $[a,b]<[c,d]$ exactly when every point of $[a,b]$ is less than every point of $[c,d]$, equivalently when $b<c$. Thus a chain is a left-to-right list of intervals separated by strict gaps.
The intervals $[0,2]$ and $[1,3]$ are incomparable: $[0,2]<[1,3]$ would require $2<1$, and $[1,3]<[0,2]$ would require $3<0$, both false. Hence $\{[0,2],[1,3]\}$ is an antichain of size $2$, so every chain decomposition needs at least two chains.
Now check the proposed two-chain cover. Since $2<3$ and $4<5$, we have
\begin{align*}
[0,2] < [3,4] < [5,7].
\end{align*}
Since $3<4$ and $6<7$, we also have
\begin{align*}
[1,3] < [4,6] < [7,8].
\end{align*}
These two chains are disjoint and together contain all six listed intervals, so they form a chain decomposition of $P$ with two chains. The antichain gives the lower bound $2$, and the displayed decomposition attains it; equivalently, this is the optimal cover predicted by *Dilworth's theorem*.
[/example]
The interval example emphasizes that the theorem is not restricted to ranked posets. The next section returns to ranked posets, where the antichains of interest are often expected to be whole rank levels.
## Sperner Posets and the LYM Inequality
In many ranked posets, the largest antichains are rank levels. The central question is what structural condition forces this behavior. For Boolean lattices, the answer is Sperner's theorem, strengthened by a weighted inequality due to Lubell, Yamamoto, and Meshalkin.
[definition: Sperner Poset]
Let $P$ be a finite ranked poset with rank function $\rho:P\to \{0,\dots,r\}$. Write
\begin{align*}
P_i &= \{x\in P : \rho(x)=i\}.
\end{align*}
The poset $P$ is Sperner if
\begin{align*}
w(P) &= \max_{0\le i\le r} |P_i|.
\end{align*}
[/definition]
Being Sperner says that no mixed-rank antichain beats the largest rank level. This is a strong statement because an arbitrary antichain can draw elements from many ranks. To prove such a statement, we need a condition saying that subsets cannot become too sparse as they move between adjacent ranks.
[definition: Normalized Matching Property]
Let $P$ be a finite ranked poset with rank levels $P_0,\dots,P_r$. The poset $P$ has the normalized matching property if for every $0\le i<r$ and every subset $A\subset P_i$, its upper shadow
\begin{align*}
\nabla_i &: \mathcal P(P_i)\to \mathcal P(P_{i+1})
\end{align*}
given by
\begin{align*}
\nabla_i(A) &= \{y\in P_{i+1}: x<y \text{ for some } x\in A\}
\end{align*}
satisfies
\begin{align*}
\frac{|\nabla_i(A)|}{|P_{i+1}|} \ge \frac{|A|}{|P_i|}.
\end{align*}
[/definition]
The normalized matching property says that moving one rank upward does not decrease relative size. Iterating this principle should control how an antichain can be distributed across ranks. This motivates the following theorem, which turns adjacent-rank expansion into the weighted LYM bound needed for Sperner-type results.
[quotetheorem:8089]
[citeproof:8089]
The conclusion is called an LYM inequality, and it packages all rank levels into a single numerical constraint. The normalized matching hypothesis is not decorative: it prevents a small lower-rank set from having too small an upper shadow, which would let mixed-rank antichains evade rank-size accounting. For example, in a ranked poset with $P_0=\{a,b\}$, $P_1=\{c,d\}$, and only $a<c$, the normalized matching condition fails for $A=\{b\}\subset P_0$ because its upper shadow is empty. The same sparse comparability also produces the mixed-rank antichain $\{b,c,d\}$ of size $3$, larger than either rank level. Thus the theorem cannot be expected without an expansion condition. This motivates the next theorem: once such a weighted constraint is known, the ordinary Sperner bound follows by comparing every rank size to the largest one.
[quotetheorem:8090]
[citeproof:8090]
This reduction is useful because proving a weighted inequality can be easier than classifying maximum antichains directly, but the weighted hypothesis is stronger than the final Sperner conclusion. A poset can be Sperner for accidental reasons while failing the LYM inequality for some antichain, so the implication is only a sufficient criterion, not a characterization. For a concrete example, let $P_0=\{a,b\}$ and $P_1=\{c,d,e\}$, with the only cross-rank comparabilities $a<d$ and $b<e$. The largest rank has size $3$, and every antichain has size at most $3$, so the poset is Sperner. However $\{a,b,c\}$ is an antichain with LYM weight $1/2+1/2+1/3>1$, so the LYM inequality fails. Without any LYM-type control, even the Sperner conclusion can fail: take a ranked poset with rank sizes $|P_0|=2$ and $|P_1|=2$, with a single comparability $a<c$ between the ranks. Then $\{b,c,d\}$ is a mixed-rank antichain of size $3$, larger than every rank level. In Boolean lattices, the weights have a concrete interpretation through maximal chains. This motivates proving the following Boolean-lattice LYM theorem by counting how often an antichain meets all maximal chains.
[quotetheorem:8091]
[citeproof:8091]
The LYM inequality gives a strong averaged form of Sperner's theorem. It says not only that an antichain cannot be too large, but that elements in small ranks consume more of the maximal-chain budget. The antichain hypothesis is essential: if $A$ is itself a maximal chain in $B_n$, then the same sum is $\sum_{k=0}^n 1/\binom{n}{k}$, which is already greater than $1$ for $n\ge 1$. The statement is also special to the Boolean lattice in this exact form because the denominator $\binom{n}{|S|}$ counts the size of the rank containing $S$, and the proof uses the uniform count of maximal chains through each set of a fixed size. Applying the previous implication now gives the exact width of the Boolean lattice.
[quotetheorem:8092]
[citeproof:8092]
Sperner's theorem identifies the largest possible size in Boolean lattices, but it should not be read as a theorem about all ranked posets. The Boolean lattice has strong symmetry and the LYM maximal-chain count; a ranked poset with the same rank sizes but sparse comparabilities can have larger mixed-rank antichains. For instance, if $P_0=\{a,b\}$, $P_1=\{c,d\}$, and the only comparability is $a<c$, then the largest rank has size $2$ but $\{b,c,d\}$ is an antichain of size $3$. Thus the Boolean-lattice hypotheses are doing real work, and examples help show what the extremal families look like when those hypotheses hold. The middle rank is the main source of maximum antichains, and odd-dimensional Boolean lattices have two middle ranks of the same size.
[example: Maximum Antichains in Small Boolean Lattices]
In $B_4$, any two distinct $2$-element subsets are incomparable: if $S\subseteq T$ and $|S|=|T|=2$, then $|T\setminus S|=|T|-|S|=0$, so $S=T$. Thus the rank-$2$ layer is an antichain. Its size is
\begin{align*}
\binom{4}{2}=\frac{4\cdot 3}{2\cdot 1}=6.
\end{align*}
To see that no antichain in $B_4$ has size $7$, let $A\subset B_4$ be an antichain. The rank sizes in $B_4$ are
\begin{align*}
\binom{4}{0}=1,\quad \binom{4}{1}=4,\quad \binom{4}{2}=6,\quad \binom{4}{3}=4,\quad \binom{4}{4}=1.
\end{align*}
Hence $\binom{4}{|S|}\le 6$ for every $S\subseteq \{1,2,3,4\}$, so $1/\binom{4}{|S|}\ge 1/6$. If $|A|=7$, then by the *Lubell Yamamoto Meshalkin Inequality for Boolean Lattices*,
\begin{align*}
1\ge \sum_{S\in A}\frac{1}{\binom{4}{|S|}}\ge \sum_{S\in A}\frac{1}{6}=\frac{|A|}{6}=\frac{7}{6},
\end{align*}
which is impossible because $7/6>1$. Therefore the six $2$-element subsets form a maximum antichain in $B_4$.
In $B_5$, the relevant middle ranks have sizes
\begin{align*}
\binom{5}{2}=\frac{5\cdot 4}{2\cdot 1}=10
\end{align*}
and
\begin{align*}
\binom{5}{3}=\frac{5\cdot 4\cdot 3}{3\cdot 2\cdot 1}=10.
\end{align*}
Each fixed-cardinality rank is an antichain by the same containment-and-equal-size argument, so the $2$-element subsets and the $3$-element subsets each give an antichain of size $10$. Since the rank sizes in $B_5$ are $1,5,10,10,5,1$, every $S\subseteq \{1,2,3,4,5\}$ satisfies $\binom{5}{|S|}\le 10$, and the LYM inequality gives
\begin{align*}
1\ge \sum_{S\in A}\frac{1}{\binom{5}{|S|}}\ge \sum_{S\in A}\frac{1}{10}=\frac{|A|}{10}.
\end{align*}
Thus $|A|\le 10$ for every antichain $A\subset B_5$. The two middle ranks attain this bound, while any mixed-rank antichain must still spend total LYM weight at most $1$.
[/example]
The chapter's decomposition theorems and Sperner theory fit together through the same principle introduced in Chapter 1's rank and chain language: comparability imposes global accounting constraints. Dilworth and Mirsky translate the constraints into exact covers, while LYM translates them into a weighted inequality over ranks. These tools recur later when incidence algebras, rank generating functions, and lattice operations turn poset structure into algebraic data. They also connect this part of the course to matching theory in graph theory, extremal set theory through Sperner families, and the use of rank enumerators as generating functions for combinatorial objects.
# 3. Möbius Functions and Inversion
Chapters 1 and 2 treated posets through intervals, chains, antichains, ranks, and decomposition theorems. We now add the central enumerative tool of the course: Möbius inversion on a poset. The guiding question is how to recover local data from cumulative sums over order intervals, in the same way that the classical number-theoretic Möbius function recovers an arithmetic function from its divisor sums.
## Cumulative Functions on Locally Finite Posets
Suppose that a function $g$ on a poset is obtained by summing another function $f$ over everything below a point. The first problem is to describe the universal coefficients that undo this summation, independently of the particular functions $f$ and $g$. These coefficients live on intervals, so the right finiteness hypothesis is not that the whole poset is finite, but that every interval has finite size.
[definition: Locally Finite Poset]
A poset $P$ is locally finite if every interval
\begin{align*}
[x,y] := \{z \in P : x \le z \le y\}
\end{align*}
is finite.
[/definition]
Local finiteness makes interval sums finite even when $P$ itself is infinite. This is the setting for divisor posets, subspace lattices over finite fields, and many graded combinatorial posets that are infinite but finite in bounded rank.
[example: Divisibility Poset Is Locally Finite]
Let $P=\mathbb N$ be ordered by divisibility, so $a\le b$ means $a\mid b$. To check local finiteness, fix comparable elements $a\le b$, meaning $a\mid b$. Then there is a positive integer $q$ such that $b=aq$.
The interval $[a,b]$ is
\begin{align*}
[a,b]=\{c\in \mathbb N: a\mid c \text{ and } c\mid b\}.
\end{align*}
If $c\in [a,b]$, then $a\mid c$, so $c=ad$ for some $d\in\mathbb N$. Since also $c\mid b$, there is some $e\in\mathbb N$ with $b=ce$. Substituting $b=aq$ and $c=ad$ gives
\begin{align*}
aq=ade.
\end{align*}
Cancelling the positive integer $a$ gives $q=de$, so $d\mid q=b/a$.
Conversely, if $d\mid b/a$, write $b/a=de$. Then $c=ad$ satisfies $a\mid c$ and
\begin{align*}
b=a(b/a)=ade=(ad)e=ce,
\end{align*}
so $c\mid b$. Therefore
\begin{align*}
[a,b]=\{ad : d\mid b/a\}.
\end{align*}
A positive integer has only finitely many positive divisors, so this set is finite. Hence every interval in the divisibility poset is finite, even though $\mathbb N$ itself is infinite.
[/example]
The example shows that cumulative sums over comparable pairs are meaningful beyond finite posets, provided each interval is finite. To write such sums uniformly, we need a function that marks exactly the comparable pairs and contributes one copy of each allowed term. This constant interval kernel is the zeta function.
[definition: Zeta Function of a Poset]
Let $P$ be a locally finite poset, and set
\begin{align*}
I(P) := \{(x,y) \in P \times P : x \le y\}.
\end{align*}
The zeta function of $P$ is the function $\zeta_P: I(P) \to \mathbb Z$ defined by
\begin{align*}
\zeta_P(x,y) = 1 \qquad \text{for every } (x,y) \in I(P).
\end{align*}
[/definition]
The zeta function records accumulation without distinguishing which intermediate point contributed. To reverse that accumulation, we need coefficients whose total contribution cancels on every non-singleton interval.
[definition: Möbius Function of a Poset]
Let $P$ be a locally finite poset, and let $I(P)=\{(x,y) \in P \times P : x \le y\}$. The Möbius function of $P$ is the function $\mu_P: I(P) \to \mathbb Z$ determined by $\mu_P(x,x)=1$ and, for $x<y$,
\begin{align*}
\sum_{x \le z \le y} \mu_P(x,z) = 0.
\end{align*}
[/definition]
The cancellation condition defines the inverse coefficients, but it is useful only if the values can be computed from smaller intervals. This raises the next problem: isolate the unknown top value in the cancellation equation to obtain a recurrence.
[quotetheorem:8093]
[citeproof:8093]
This recurrence is often the most practical way to compute examples, because it reduces the value on $[x,y]$ to values on intervals with the same lower endpoint and smaller upper endpoint. Local finiteness is essential here: in a poset with an interval containing infinitely many elements between $x$ and $y$, the displayed recurrence would ask for an infinite sum in $\mathbb Z$, so the coefficient would not be defined by this argument. For instance, the interval $[0,1]$ in the rationally ordered set $\mathbb Q \cap [0,1]$ contains infinitely many intermediate rationals.
The recurrence does not say that the value is determined only by the length of a longest chain. Different intervals of the same rank can have different numbers of intermediate elements and hence different cancellation behaviour. What it does say is stronger and more local: $\mu_P(x,y)$ depends only on the whole interval $[x,y]$, a principle that will drive the Boolean, divisor, product, and partition-lattice computations below.
[example: Boolean Lattice Möbius Values]
Let $B_n$ be ordered by inclusion, and fix $A\subset B$. Put $S=B\setminus A$ and $r=|S|$. The map $C\mapsto C\setminus A$ sends the interval $[A,B]$ bijectively and order-preservingly onto the Boolean lattice of subsets of $S$, so there are $\binom{r}{i}$ elements $C\in[A,B]$ with $|C\setminus A|=i$.
We show by induction on $r$ that $\mu_{B_n}(A,B)=(-1)^r$. If $r=0$, then $A=B$, and $\mu_{B_n}(A,A)=1=(-1)^0$ by the definition of the Möbius function. Now assume $r>0$ and that the formula holds for all smaller rank differences. By *[Recursive Formula for the Möbius Function](/theorems/8093)*,
\begin{align*}
\mu_{B_n}(A,B)=-\sum_{A\subseteq C\subset B}\mu_{B_n}(A,C).
\end{align*}
Group the terms according to $i=|C\setminus A|$. Since $C\subset B$, the possible values are $0\le i\le r-1$, and the induction hypothesis gives $\mu_{B_n}(A,C)=(-1)^i$ for every such $C$. Therefore
\begin{align*}
\mu_{B_n}(A,B)=-\sum_{i=0}^{r-1}\binom{r}{i}(-1)^i.
\end{align*}
Using the binomial theorem,
\begin{align*}
0=(1-1)^r=\sum_{i=0}^{r}\binom{r}{i}(-1)^i.
\end{align*}
Separating the final term gives
\begin{align*}
\sum_{i=0}^{r-1}\binom{r}{i}(-1)^i=-\binom{r}{r}(-1)^r=-(-1)^r.
\end{align*}
Substituting this into the recurrence gives
\begin{align*}
\mu_{B_n}(A,B)=-\bigl(-(-1)^r\bigr)=(-1)^r.
\end{align*}
Thus the Möbius value in a Boolean interval depends only on the number of elements added when passing from $A$ to $B$, and it is the alternating sign $(-1)^{|B\setminus A|}$.
[/example]
## Möbius Inversion on Posets
The main question is now precise: if $g(y)$ records the total contribution of $f(x)$ over all $x \le y$, how do we recover $f(y)$ from the values of $g$? In linear algebra terms, the zeta function is an upper-triangular matrix indexed by the poset, and the Möbius function supplies its inverse.
[quotetheorem:8094]
[citeproof:8094]
The statement is local in the lower interval below $y$, so for finite posets the finiteness condition is automatic. For infinite locally finite posets, local finiteness of intervals alone does not make the lower set $\{x \in P : x \le y\}$ finite. For example, take a poset with elements $a_1,a_2,\dots$ and a top element $t$, with $a_i<t$ and no other comparabilities; every interval is finite, but the sum $\sum_{x \le t} f(x)$ may contain infinitely many nonzero terms.
Thus Möbius inversion is not a licence to sum over arbitrary infinite down-sets. It applies either when the relevant lower sets are finite, as in divisor sums below a fixed integer, or when the functions have finite support on those lower sets. This distinction is why the divisor example works cleanly even though the ambient divisibility poset is infinite.
[example: Classical Divisor Mobius Inversion]
Fix $n\in\mathbb N$, and suppose $g(m)=\sum_{e\mid m}f(e)$ for every positive integer $m$. In the divisibility poset, the lower elements of $n$ are exactly its positive divisors, so *[Möbius Inversion on a Locally Finite Poset](/theorems/8094)* gives
\begin{align*}
f(n)=\sum_{d\mid n}\mu_{\mathbb N}(d,n)g(d).
\end{align*}
It remains to identify the interval coefficient. For each divisor $d\mid n$, define
\begin{align*}
\phi:[d,n]\to [1,n/d],\qquad \phi(c)=c/d.
\end{align*}
If $c\in[d,n]$, then $d\mid c$ and $c\mid n$, so $c=da$ and $n=cb$ for some $a,b\in\mathbb N$. Hence $n=dab$, so $a\mid n/d$, and $\phi(c)=a$ lies in the divisor interval $[1,n/d]$. Conversely, if $a\mid n/d$, write $n/d=ab$ and set $c=da$. Then $d\mid c$ and
\begin{align*}
n=d(n/d)=dab=cb,
\end{align*}
so $c\mid n$ and $c\in[d,n]$. Thus $\phi$ has inverse $a\mapsto da$.
The map also preserves divisibility: for $c_1,c_2\in[d,n]$ with $c_i=da_i$,
\begin{align*}
c_1\mid c_2 \quad \text{if and only if} \quad da_1\mid da_2 \quad \text{if and only if} \quad a_1\mid a_2.
\end{align*}
Therefore $[d,n]$ is isomorphic to the divisor interval $[1,n/d]$, and by *Mobius Function Depends Only on the Interval*,
\begin{align*}
\mu_{\mathbb N}(d,n)=\mu_{\mathbb N}(1,n/d).
\end{align*}
Writing $\mu_{\mathrm{arith}}(q):=\mu_{\mathbb N}(1,q)$, the inversion formula becomes
\begin{align*}
f(n)=\sum_{d\mid n}\mu_{\mathrm{arith}}(n/d)g(d).
\end{align*}
Thus the poset inversion theorem is exactly the classical divisor [Möbius inversion formula](/theorems/1749).
[/example]
The divisor example uses functions on single elements, but later arguments require functions whose inputs are whole intervals. A pair-valued summation can decompose an interval $[x,y]$ by choosing an intermediate point $z$, so the natural multiplication should add contributions over all cuts $x\le z\le y$. Local finiteness is exactly what keeps this sum finite, and the resulting function space is the incidence algebra.
[definition: Incidence Algebra]
Let $P$ be a locally finite poset and let $R$ be a commutative ring. Set
\begin{align*}
I(P) := \{(x,y) \in P \times P : x \le y\}.
\end{align*}
The incidence algebra $I(P;R)$ is the function space
\begin{align*}
I(P;R) := \{\alpha: I(P) \to R\},
\end{align*}
with product
\begin{align*}
(\alpha * \beta)(x,y) = \sum_{x \le z \le y}\alpha(x,z)\beta(z,y).
\end{align*}
[/definition]
The convolution product reflects cutting an interval $[x,y]$ at an intermediate point $z$. Once summation is encoded as multiplication by $\zeta_P$, inversion becomes a question inside the incidence algebra: does the Möbius function act as the element that cancels this cumulative summation on every interval? The theorem answers that question in the pair-valued setting, where the order of convolution matters.
[quotetheorem:8095]
[citeproof:8095]
This version is useful when a quantity attached to an interval is built by summing over all possible intermediate upper endpoints. Its hypotheses are exactly what make convolution legal: without local finiteness, an interval such as $[0,1] \subset \mathbb Q \cap [0,1]$ would produce an infinite sum over intermediate rational points, so the product in the incidence algebra would not be defined over an arbitrary ring. The theorem also does not invert sums taken over unrelated decompositions of an object; the intermediate point must lie in the same interval and the summation must match the convolution order.
This interval form prepares the product and duality rules, where the whole interval structure rather than individual vertices controls the answer. It also explains why later computations focus on recognising intervals up to isomorphism: once an interval is identified, all incidence-algebra identities on that interval transfer with it.
## Structure Theorems and Computations
After the inversion theorem, the computational problem becomes: how can we evaluate $\mu_P(x,y)$ without running the recurrence from scratch on every interval? The most reusable answers come from interval isomorphisms, products, duality, and cancellation results in lattices.
[quotetheorem:8096]
[citeproof:8096]
This explains why Boolean intervals have values depending only on rank difference, and why divisor intervals reduce to the prime factorisation of a quotient. The hypothesis is interval isomorphism, not merely having the same number of ranks or the same endpoint ranks. For example, a three-element chain from bottom to top has Möbius value $0$, while the bottom-top interval in the four-element Boolean lattice $B_2$ has Möbius value $1$; both have rank difference two, but their interval posets are not isomorphic.
The theorem therefore gives a recognition principle rather than a classification theorem. Once an interval is recognised as Boolean, a divisor lattice, or a product of smaller intervals, the Möbius value transfers; without such an isomorphism, the recurrence still has to account for the actual intermediate elements. The next theorem turns product decompositions of intervals into products of Möbius values.
[quotetheorem:8097]
[citeproof:8097]
The product formula makes the divisor example transparent, but only because the divisor interval actually decomposes as a product of prime-power chains. The conclusion cannot be inferred from having the same rank or the same number of atoms: if the interval is not isomorphic to a product interval, there is no reason for the cancellation sum to factor into independent pieces. A chain of length two and the bottom-top interval of $B_2$ again show the danger, since both are rank-two intervals but their Möbius values are different.
When a genuine product decomposition is present, each coordinate contributes independently. For divisor intervals, each prime-power chain contributes either $1$, $-1$, or $0$, and multiplying the contributions gives the classical squarefree criterion.
[example: Arithmetic Mobius Values from Prime Powers]
Write
\begin{align*}
n/d=\prod_{i=1}^{k}p_i^{a_i}
\end{align*}
with distinct primes $p_i$ and exponents $a_i\ge 1$. Every element $c\in[d,n]$ has the form
\begin{align*}
c=d\prod_{i=1}^{k}p_i^{b_i}
\end{align*}
for unique integers $b_i$ satisfying $0\le b_i\le a_i$. For two such elements,
\begin{align*}
d\prod_{i=1}^{k}p_i^{b_i}\mid d\prod_{i=1}^{k}p_i^{b_i'}
\end{align*}
holds exactly when $b_i\le b_i'$ for every $i$. Thus $[d,n]$ is order-isomorphic to the product of chains
\begin{align*}
\{0<1<\cdots<a_1\}\times\cdots\times\{0<1<\cdots<a_k\}.
\end{align*}
For a chain $0<1<\cdots<a$, the recursive definition gives $\mu(0,0)=1$. If $a=1$, then
\begin{align*}
\mu(0,1)=-\mu(0,0)=-1.
\end{align*}
If $a\ge 2$, then the recurrence gives
\begin{align*}
\mu(0,a)=-\sum_{j=0}^{a-1}\mu(0,j).
\end{align*}
Using $\mu(0,0)=1$, $\mu(0,1)=-1$, and inductively $\mu(0,j)=0$ for $2\le j<a$, the sum is
\begin{align*}
\sum_{j=0}^{a-1}\mu(0,j)=1+(-1)+0+\cdots+0=0,
\end{align*}
so $\mu(0,a)=0$ for every $a\ge 2$.
By *Product Formula for Mobius Functions*, the product decomposition of the interval gives
\begin{align*}
\mu_{\mathbb N}(d,n)=\prod_{i=1}^{k}\mu_{\{0<\cdots<a_i\}}(0,a_i).
\end{align*}
Each factor is $-1$ when $a_i=1$ and $0$ when $a_i\ge 2$. Therefore, if $n/d$ is squarefree, so that every $a_i=1$, then
\begin{align*}
\mu_{\mathbb N}(d,n)=\prod_{i=1}^{k}(-1)=(-1)^k.
\end{align*}
If some prime square divides $n/d$, then some $a_i\ge 2$, so one factor in the product is $0$ and hence
\begin{align*}
\mu_{\mathbb N}(d,n)=0.
\end{align*}
This is exactly the squarefree formula for the arithmetic Möbius value $\mu_{\mathrm{arith}}(n/d)$.
[/example]
Product decompositions do not cover every symmetry of intervals. Another basic operation is to reverse all comparisons: the interval that used to run from $x$ up to $y$ becomes the corresponding interval from $y$ up to $x$ in the dual order. Since the Möbius recurrence is defined only from the internal order of an interval, the natural question is whether reversing the order changes the scalar or merely swaps the endpoints. The result records this duality precisely.
[quotetheorem:8098]
[citeproof:8098]
Duality is useful because it lets lower-interval computations be read as upper-interval computations after reversing the order. The result requires the full order-dual interval, not just a visual symmetry of the Hasse diagram; if the reversed order is not the order relation being used, the recurrence being solved is a different one. What duality does not provide is a new numerical formula: it preserves an existing Möbius value after the endpoints are swapped.
These structural rules compute many values, but lattice theory also gives cancellation tests that force a value or a sum of values to vanish. The next theorem uses joins with a fixed non-minimal element to isolate the top of a finite lattice, so it depends on having joins and on having a finite total cancellation sum.
[quotetheorem:8099]
[citeproof:8099]
[Weisner's theorem](/theorems/8099) is often used after choosing $a$ so that the condition $x\vee a=\hat{1}$ is restrictive. Each hypothesis has a role: finiteness makes the total Möbius cancellation sum finite, the lattice structure supplies the join $x\vee a$, and the condition $a>\hat{0}$ ensures that the cancellation is taken over a non-singleton interval. If $a=\hat{0}$, the displayed sum becomes only the term with $x=\hat{1}$, which is not generally $0$; in $B_1$, it equals $\mu(\hat{0},\hat{1})=-1$.
The theorem does not determine individual Möbius values by itself. It gives a vanishing identity over a selected slice of the lattice, which becomes powerful when that slice can be described combinatorially. In geometric lattices this is a standard route to [characteristic polynomial](/page/Characteristic%20Polynomial) recurrences and sign patterns for Möbius values.
[example: Weisner Cancellation in a Boolean Lattice]
Let $U=\{1,\dots,n\}$, so in the Boolean lattice $B_n$ the minimum is $\varnothing$, the maximum is $U$, and the join is union. Choose a nonempty subset $A\subset U$. For $X\subset U$, the condition $X\vee A=\hat{1}$ means
\begin{align*}
X\cup A=U.
\end{align*}
This holds exactly when every element outside $A$ already lies in $X$, that is,
\begin{align*}
U\setminus A\subseteq X.
\end{align*}
Hence each such $X$ has a unique form
\begin{align*}
X=(U\setminus A)\cup Y \qquad \text{with } Y=X\cap A\subseteq A.
\end{align*}
By *Boolean Lattice Möbius Values*, $\mu_{B_n}(\varnothing,X)=(-1)^{|X|}$. Since $U\setminus A$ and $Y$ are disjoint,
\begin{align*}
|X|=|(U\setminus A)\cup Y|=|U\setminus A|+|Y|=n-|A|+|Y|.
\end{align*}
Therefore the Weisner sum is
\begin{align*}
\sum_{X\subset U:\,X\cup A=U}\mu_{B_n}(\varnothing,X)=\sum_{Y\subseteq A}(-1)^{n-|A|+|Y|}.
\end{align*}
Factoring out the part independent of $Y$ gives
\begin{align*}
\sum_{Y\subseteq A}(-1)^{n-|A|+|Y|}=(-1)^{n-|A|}\sum_{Y\subseteq A}(-1)^{|Y|}.
\end{align*}
Grouping subsets $Y\subseteq A$ by their size $j$ gives
\begin{align*}
\sum_{Y\subseteq A}(-1)^{|Y|}=\sum_{j=0}^{|A|}\binom{|A|}{j}(-1)^j.
\end{align*}
By the binomial theorem,
\begin{align*}
\sum_{j=0}^{|A|}\binom{|A|}{j}(-1)^j=(1-1)^{|A|}.
\end{align*}
Since $A$ is nonempty, $|A|>0$, so
\begin{align*}
(1-1)^{|A|}=0.
\end{align*}
Thus
\begin{align*}
\sum_{X\subset U:\,X\vee A=\hat{1}}\mu_{B_n}(\varnothing,X)=0,
\end{align*}
which realizes Weisner's cancellation identity here as the ordinary binomial cancellation over the subsets of $A$.
[/example]
We finish with a major family whose Möbius values are less elementary than the Boolean and divisor cases but central in algebraic combinatorics. The partition lattice shows how Möbius functions encode nontrivial factorisations over blocks.
[example: Partition Lattice Mobius Values]
Let $\Pi_n$ be ordered by refinement, so $\pi \le \sigma$ means every block of $\pi$ is contained in a block of $\sigma$. Suppose the blocks of $\sigma$ are $B_1,\dots,B_k$, and for each $i$ the block $B_i$ is the union of exactly $m_i$ blocks of $\pi$. Name those $\pi$-blocks
\begin{align*}
C_{i,1},\dots,C_{i,m_i}.
\end{align*}
Every partition $\tau$ with $\pi \le \tau \le \sigma$ can merge blocks of $\pi$ only inside a single block $B_i$ of $\sigma$, because each block of $\tau$ must still be contained in some block of $\sigma$. Thus $\tau$ determines, for each $i$, a partition $\rho_i$ of $\{1,\dots,m_i\}$ by declaring $a$ and $b$ to lie in the same block of $\rho_i$ exactly when $C_{i,a}$ and $C_{i,b}$ lie in the same block of $\tau$.
This gives a map
\begin{align*}
\Phi:[\pi,\sigma]\to \Pi_{m_1}\times\cdots\times\Pi_{m_k}, \qquad \Phi(\tau)=(\rho_1,\dots,\rho_k).
\end{align*}
It is injective because the tuple $(\rho_1,\dots,\rho_k)$ records exactly which $\pi$-blocks have been merged inside each $B_i$. It is surjective because, given partitions $\rho_i\in\Pi_{m_i}$, we construct $\tau$ by merging the blocks $C_{i,a}$ and $C_{i,b}$ precisely when $a$ and $b$ lie in the same block of $\rho_i$. This constructed $\tau$ satisfies $\pi\le\tau\le\sigma$. The map is order-preserving in both directions: $\tau\le\tau'$ exactly when every merge made by $\tau$ is also made by $\tau'$, which is exactly the condition $\rho_i\le\rho_i'$ for every $i$. Hence
\begin{align*}
[\pi,\sigma]\cong \Pi_{m_1}\times\cdots\times\Pi_{m_k}.
\end{align*}
By *Mobius Function Depends Only on the Interval*, the Möbius value $\mu_{\Pi_n}(\pi,\sigma)$ equals the Möbius value from the bottom to the top of this product interval. By *Product Formula for Mobius Functions*,
\begin{align*}
\mu_{\Pi_n}(\pi,\sigma)=\prod_{i=1}^{k}\mu_{\Pi_{m_i}}(\hat{0},\hat{1}).
\end{align*}
Using the standard top-interval value in the partition lattice,
\begin{align*}
\mu_{\Pi_m}(\hat{0},\hat{1})=(-1)^{m-1}(m-1)!,
\end{align*}
each factor becomes
\begin{align*}
\mu_{\Pi_{m_i}}(\hat{0},\hat{1})=(-1)^{m_i-1}(m_i-1)!.
\end{align*}
Substituting these factors into the product gives
\begin{align*}
\mu_{\Pi_n}(\pi,\sigma)=\prod_{i=1}^{k}(-1)^{m_i-1}(m_i-1)!.
\end{align*}
Thus the Möbius value of a partition interval factors block-by-block according to how many blocks of $\pi$ are merged inside each block of $\sigma$.
[/example]
The chapter's main lesson is that Möbius functions are local inverse coefficients for order summation. The zeta function records accumulation over intervals, the Möbius function reverses it, and the usable computations come from recognising interval structure: Boolean intervals give signs, divisor intervals give the arithmetic Möbius function, products multiply values, duality preserves them, and lattice joins create vanishing identities.
# 4. Incidence Algebras
Incidence algebras turn Möbius inversion from Chapter 3 from a formula attached to one poset into an algebraic operation. There the Möbius function appeared as the inverse of summation over intervals; here we make that inverse live inside a ring whose elements are functions on comparable pairs. The prerequisites are finite posets, intervals, graded posets and rank, the Mobius function of a poset, basic ring language, and ordinary matrix multiplication. This viewpoint also explains why chain enumeration, exponential generating functions, interval types, and later topological invariants of posets naturally share the same language.
## Incidence Algebra and Convolution
How can we encode all interval-by-interval summation operations on a finite poset in a single algebraic object? The answer is to use functions whose input is an interval endpoint pair, and to multiply them by summing over all intermediate points. Throughout this section let $P$ be a finite poset and let $R$ be a commutative ring with identity $1_R$.
[definition: Incidence Algebra]
The incidence algebra $I(P;R)$ is the set of functions
\begin{align*}
f:\{(x,y)\in P^2:x\le y\}\to R, \qquad (x,y)\mapsto f(x,y).
\end{align*}
Addition and scalar multiplication are defined pointwise.
The convolution product of $f,g \in I(P;R)$ is the function $f*g \in I(P;R)$ given by
\begin{align*}
(f*g)(x,y)=\sum_{x \le z \le y} f(x,z)g(z,y).
\end{align*}
[/definition]
The intermediate element $z$ plays the role of a cut point in the interval $[x,y]$. Thus convolution records all ways of decomposing an interval into a lower interval followed by an upper interval, which is exactly the kind of decomposition that appears in Mobius inversion.
[example: Incidence Algebra of a Chain]
Let $P=\{1<2<\cdots<n\}$ and let $R$ be a commutative ring with identity. A function $f\in I(P;R)$ assigns a value only to pairs $(i,j)$ with $i\le j$, so it determines an upper triangular matrix $A=(a_{ij})$ by setting $a_{ij}=f(i,j)$ when $i\le j$ and $a_{ij}=0_R$ when $i>j$.
If $g\in I(P;R)$ corresponds to the upper triangular matrix $B=(b_{ij})$, then the ordinary matrix product $AB$ has $(i,j)$-entry
\begin{align*}
(AB)_{ij}=\sum_{k=1}^{n} a_{ik}b_{kj}.
\end{align*}
For $k<i$, we have $a_{ik}=0_R$ because $A$ is upper triangular. For $k>j$, we have $b_{kj}=0_R$ because $B$ is upper triangular. Hence only indices with $i\le k\le j$ can contribute, and
\begin{align*}
(AB)_{ij}=\sum_{i\le k\le j} a_{ik}b_{kj}.
\end{align*}
Substituting $a_{ik}=f(i,k)$ and $b_{kj}=g(k,j)$ gives
\begin{align*}
(AB)_{ij}=\sum_{i\le k\le j} f(i,k)g(k,j)=(f*g)(i,j).
\end{align*}
Thus, on a chain, convolution is exactly upper triangular matrix multiplication; incidence algebras extend this multiplication rule from totally ordered finite posets to arbitrary finite posets.
[/example]
The chain example suggests that $I(P;R)$ should be a ring, but matrices no longer provide a proof once the poset is not totally ordered. The obstruction is associativity: cutting an interval first at $u$ and then at $v$ must give the same total contribution as making the cuts in the opposite grouping. The formal result checks this finite reindexing and identifies the zero-step kernel that acts as the identity, giving the algebraic foundation for inverses and powers in $I(P;R)$.
[quotetheorem:8100]
[citeproof:8100]
The identity element isolates intervals of length zero, so convolution now behaves like multiplication in a familiar algebra. The finiteness of $P$ is used in an essential way here: every convolution sum is finite, so associativity is finite reindexing rather than a question about convergence. For instance, in the usual order on $\mathbb Q$, the interval $[0,1]$ has infinitely many cut points, so the convolution value at $(0,1)$ is not an ordinary finite sum without a finiteness or support condition. The commutativity of $R$ is stronger than associativity strictly needs, but it ensures that $I(P;R)$ is naturally an $R$-algebra with the expected scalar behaviour. The theorem does not say that incidence algebras for different posets are the same; the multiplication still remembers exactly how intervals can be cut. To express summation over every element of an interval, we need the incidence algebra element that assigns weight one to every comparable pair.
[definition: Zeta Element]
The zeta element of $I(P;R)$ is the function
\begin{align*}
\zeta:\{(x,y)\in P^2:x\le y\}\to R
\end{align*}
defined by $\zeta(x,y)=1_R$ for every $x\le y$ in $P$.
[/definition]
Convolving with $\zeta$ performs summation over one side of an interval. To undo this operation, we use the incidence-algebra invertibility criterion: an incidence function $f$ is convolution-invertible exactly when every diagonal value $f(x,x)$ is a unit of $R$. This reduces invertibility to the zero-length intervals; once the diagonal entries are invertible, the remaining values of the inverse are determined recursively by increasing interval length.
For $\zeta$, every diagonal value is $1_R$, so $\zeta$ is a unit and interval summation has an inverse. Its necessity is already visible on singleton intervals: convolution at $(x,x)$ is just multiplication in $R$, so a nonunit diagonal entry could never become $1_R$ after multiplication by another incidence function. The result is also only a criterion for existence; the off-diagonal entries of the inverse still have to be computed recursively interval by interval. Over noncommutative coefficient rings, the same statement requires more care about left and right inverses and about how scalars act, which is why the course stays with commutative $R$. We need a name for the inverse of $\zeta$ because its entries will be used throughout the chapter as the Mobius values of the poset; this motivates the following definition.
[definition: Möbius Element]
The Mobius element of $P$ over $R$ is the function
\begin{align*}
\mu:\{(x,y)\in P^2:x\le y\}\to R
\end{align*}
defined by $\mu=\zeta^{-1}$ in $I(P;R)$.
[/definition]
The definition is compact, but computations still require a recursive description of the entries of $\mu$. Expanding the equations $\zeta*\mu=\delta$ and $\mu*\zeta=\delta$ recovers the familiar interval recurrence from the previous chapter.
[quotetheorem:8101]
[citeproof:8101]
The recursion is useful for calculation, while the inverse relation is better for conceptual proofs. The finiteness assumption ensures that induction on interval size reaches every value after finitely many steps; without local finiteness, the displayed sums might not even be finite. The recurrence also depends on the normalisation $\zeta(x,x)=1_R$, since the diagonal equation is what forces $\mu(x,x)=1_R$. What the theorem does not provide is a closed form for $\mu$ in an arbitrary poset; different interval shapes can produce very different values. The next theorem is needed to translate the algebraic identity $\mu=\zeta^{-1}$ back into a usable formula for functions on the poset.
[quotetheorem:7126]
[citeproof:7126/algebraic-combinatorics-ii-posets-and-lattices-2]
This theorem turns cumulative data on a poset into pointwise data by a single algebraic inversion. The finite-poset hypothesis is what makes the sums ordinary finite sums in $R$; for infinite posets, one needs local finiteness at minimum, and additional convergence or support assumptions may be needed depending on the coefficient setting. The theorem also assumes no order-theoretic symmetry: it works even when intervals have unrelated shapes, but then the values of $\mu(x,y)$ may have to be computed separately. It does not say that cumulative data is numerically stable or easy to evaluate, only that the inverse is algebraically determined. On a chain, the formula reduces to finite differences, so it is a good test case for the signs and vanishing pattern of $\mu$.
[example: Mobius Function of a Chain]
For the chain $1<2<\cdots<n$, we compute the Mobius element from *Möbius Recursion*. The diagonal equation gives
\begin{align*}
\mu(i,i)=1_R.
\end{align*}
For an adjacent interval $[i,i+1]$, the recursion gives
\begin{align*}
0_R=\sum_{i\le k\le i+1}\mu(i,k)=\mu(i,i)+\mu(i,i+1)=1_R+\mu(i,i+1).
\end{align*}
Adding $-1_R$ to both sides gives
\begin{align*}
\mu(i,i+1)=-1_R.
\end{align*}
It remains to show that $\mu(i,j)=0_R$ when $j-i\ge 2$. Fix $i$ and argue by induction on $d=j-i\ge 2$. The recursion on $[i,j]$ gives
\begin{align*}
0_R=\sum_{i\le k\le j}\mu(i,k)=\mu(i,i)+\mu(i,i+1)+\sum_{i+2\le k\le j-1}\mu(i,k)+\mu(i,j).
\end{align*}
The first two terms are $1_R$ and $-1_R$. If $d=2$, the middle sum is empty, so the displayed equation becomes $0_R=1_R-1_R+\mu(i,j)=\mu(i,j)$. If $d>2$, every term in the middle sum has distance between $2$ and $d-1$, so it is $0_R$ by the induction hypothesis; again $0_R=\mu(i,j)$. Therefore
\begin{align*}
\mu(i,j)=0_R \qquad \text{for } j-i\ge 2.
\end{align*}
Thus Mobius inversion on a chain keeps only the current and immediately preceding cumulative values, so it is exactly the finite-difference operation inverse to cumulative summation.
[/example]
## Chain-Counting Functions and Exponentials
What does convolution count when the factors are all zeta elements? Repeated convolution cuts an interval into several consecutive pieces, so powers of $\zeta$ enumerate multichains. This gives an internal version of exponential generating functions inside $I(P;R)$.
[definition: Chain-Counting Functions]
For $k\in\mathbb N$, define $C_k \in I(P;R)$ by
\begin{align*}
C_k(x,y)=|\{(x_0,\dots,x_k):x=x_0\le x_1\le\cdots\le x_k=y\}|\cdot 1_R.
\end{align*}
Set $C_0=\delta$.
[/definition]
The parameter $k$ counts the number of convolution steps, not the number of strict cover relations. Repetitions are allowed, so these are multichains with prescribed endpoints; the natural question is whether they are precisely the coefficients obtained from powers of $\zeta$.
[quotetheorem:8102]
[citeproof:8102]
Powers of $\zeta$ count multichains, including repeated entries. The theorem relies on finite convolution sums and on the convention that a $k$-fold product has $k$ interval pieces; changing either convention changes the indexing of the counted chains. It counts weak chains, not saturated chains, so in the three-element chain $0<a<1$, the term counted by $0<1$ as a single strict jump is not a cover relation. In a non-locally-finite infinite poset the same formula may involve infinitely many possible cut points, so the incidence algebra product would need a separate definition. To separate genuine rank movement from repetition, we need a new element that vanishes on singleton intervals and records only strict cuts.
[definition: Strict Chain Element]
The strict chain element is the function
\begin{align*}
\eta:\{(x,y)\in P^2:x\le y\}\to R
\end{align*}
defined by $\eta=\zeta-\delta$ in $I(P;R)$. For $k\ge 0$, the notation $\eta^k$ denotes the $k$-fold convolution power of $\eta$, with $\eta^0=\delta$.
[/definition]
The subtraction removes the zero-length contribution from each interval. Consequently, a nonzero term in $\eta^k(x,y)$ must choose only strict cuts, and the convolution expansion counts strict chains
\begin{align*}
x=x_0<x_1<\cdots<x_k=y
\end{align*}
with $k$ strict steps, interpreted in $R$.
Because $P$ is finite, $\eta$ is nilpotent: if $k>|P|$, then no strict chain has $k$ strict steps. This makes it possible to use finite power-series expressions in the incidence algebra, which packages strict chain counts by length.
[quotetheorem:8103]
[citeproof:8103]
The exponential formula is useful when intervals have a uniform combinatorial model. The assumption that $R$ contains $\mathbb Q$ is needed because the coefficients $1/k!$ must be available as scalars; over $\mathbb Z$, even the coefficient $1/2$ in the contribution from two-step chains is not an integer scalar. Finiteness of $P$ is also essential for nilpotence of $\eta$, which turns the formal power series into a finite sum. In an infinite locally finite poset such as $\mathbb N$ with its usual order, each individual interval is finite, but there is no global exponent $N$ with $\eta^N=0$ on all intervals, so the exponential is not a finite incidence-algebra expression without a separate completion or local interpretation. In the Boolean lattice, strict chains are the same thing as ordered decompositions of a set difference, so the formula recovers Stirling numbers inside the incidence algebra.
[example: Boolean Algebra Chain Counts]
Let $B_n$ be the Boolean lattice of subsets of $[n]=\{1,\dots,n\}$ ordered by inclusion. Fix $A\subseteq B$ and write $D=B\setminus A$, so $|D|=m$. A strict chain
\begin{align*}
A=A_0\subset A_1\subset \cdots \subset A_k=B
\end{align*}
determines subsets
\begin{align*}
E_r=A_r\setminus A_{r-1} \qquad \text{for } 1\le r\le k.
\end{align*}
Since $A_{r-1}\subset A_r$ is strict, each $E_r$ is nonempty. The sets $E_1,\dots,E_k$ are pairwise disjoint because an element is added at the first stage where it appears, and their union is $B\setminus A=D$ because the chain ends at $B$.
Conversely, if $(E_1,\dots,E_k)$ is an ordered list of nonempty pairwise disjoint subsets whose union is $D$, define
\begin{align*}
A_r=A\cup E_1\cup \cdots \cup E_r \qquad \text{for } 0\le r\le k.
\end{align*}
Then $A_0=A$, $A_k=B$, and $A_{r-1}\subset A_r$ is strict because $E_r$ is nonempty. These two constructions undo each other, so strict chains from $A$ to $B$ with $k$ strict steps are exactly ordered decompositions of $D$ into $k$ nonempty blocks.
The Stirling number $S(m,k)$ counts unordered partitions of an $m$-element set into $k$ nonempty blocks. Each such partition has $k!$ possible orderings of its blocks, so
\begin{align*}
c_k(A,B)=k!S(m,k).
\end{align*}
Therefore the exponential chain-counting value is
\begin{align*}
\exp(\eta)(A,B)=\sum_{k\ge 0}\frac{c_k(A,B)}{k!}=\sum_{k\ge 0}\frac{k!S(m,k)}{k!}=\sum_{k\ge 0}S(m,k).
\end{align*}
Since $S(m,k)=0$ for $k>m$, this sum depends only on the interval rank $m=|B\setminus A|$, not on the particular subsets $A$ and $B$.
[/example]
The Boolean example hints at a stronger reduction: some posets have intervals whose enumerative behaviour depends only on a small amount of type data. Reduced incidence algebras formalise this compression.
## Reduced Incidence Algebras and Hereditary Intervals
When many intervals in a poset are isomorphic, computing a separate value for every pair $x\le y$ wastes structure. The reduced incidence algebra keeps only functions that are constant on chosen interval types, while preserving convolution. The key point is that the chosen [equivalence relation](/page/Equivalence%20Relation) on intervals must be compatible with cutting intervals into subintervals.
[definition: Hereditary Family of Intervals]
A hereditary family of intervals consists of a collection $\mathcal F$ of finite intervals together with an equivalence relation $\sim$ on $\mathcal F$ such that every singleton interval belongs to $\mathcal F$; every subinterval of an interval in $\mathcal F$ also belongs to $\mathcal F$; and whenever $[x,y]\sim[x',y']$, the number of cut points $z\in[x,y]$ with prescribed lower interval type $[x,z]$ and prescribed upper interval type $[z,y]$ equals the corresponding number of cut points in $[x',y']$.
[/definition]
The final condition is the structural requirement behind closure under convolution. Since it provides type-level structure constants for interval cuts, it motivates defining an incidence algebra whose functions remember only interval type.
[definition: Reduced Incidence Algebra]
Given a hereditary family of intervals $\mathcal F/\sim$ over a commutative ring $R$, the reduced incidence algebra consists of functions
\begin{align*}
f:\mathcal F\to R
\end{align*}
that are constant on equivalence classes of $\sim$. Multiplication is induced by convolution:
\begin{align*}
(f*g)([x,y])=\sum_{x\le z\le y} f([x,z])g([z,y]).
\end{align*}
[/definition]
The definition would be useless if the displayed product depended on the chosen representative of an interval type. The hereditary condition prevents this problem by making the number of cuts of each lower-and-upper type pair a type invariant.
[quotetheorem:8104]
[citeproof:8104]
A particularly important case is when interval type is determined just by rank. The hereditary condition is necessary: if two intervals are declared equivalent but one has two middle cut points of a given lower-and-upper type while the other has three, then convolving two type-constant functions can produce different values on the supposedly equivalent intervals. Thus reduced incidence algebras are not obtained from arbitrary identifications of intervals; the equivalence relation must remember enough cutting data for multiplication to descend. The same mechanism is what later links Mobius values with Euler characteristics of order complexes: the algebraic inverse records alternating chain data, while topology packages the same data as a simplicial invariant. The closure theorem also does not classify all useful reductions, but it gives a checkable condition that protects the algebra structure. This leads to binomial posets, where the number of maximal chains in an interval behaves like a factorial sequence and controls all intermediate rank counts.
[definition: Binomial Poset]
A finite graded poset $P$ with rank function $\rho:P\to\mathbb N$ is a binomial poset if every interval $[x,y]$ of rank $\rho(y)-\rho(x)=n$ has the same number $B(n)$ of maximal chains, depending only on $n$.
[/definition]
The numbers $B(n)$ are the factorial function of the binomial poset. To compare different ranks inside the same interval, we divide the total number of maximal chains by the number forced below and above an intermediate element. This is also the incidence-algebra shadow of the exponential formula for labelled combinatorial species, where decompositions into ordered blocks produce factorial denominators and binomial-type coefficients.
[definition: Binomial Coefficients of a Binomial Poset]
Let $P$ be a binomial poset with factorial function $B(n)$. For $0\le k\le n$, define
\begin{align*}
\binom{n}{k}_P=\frac{B(n)}{B(k)B(n-k)}.
\end{align*}
[/definition]
The quotient in the definition is intended to count elements at a fixed rank inside any interval of rank $n$, but that interpretation is not automatic. A single intermediate element $z$ lies on $B(k)B(n-k)$ maximal chains through the lower and upper subintervals, so a uniform count requires every maximal chain in $[x,y]$ to be accounted for exactly once by its rank-$k$ element. The theorem verifies this double-counting argument and explains why binomial posets behave like Boolean lattices with a different factorial function.
[quotetheorem:8105]
[citeproof:8105]
The theorem explains why the notation resembles ordinary binomial coefficients: it counts intermediate objects in an interval. The graded hypothesis is needed so that every intermediate element has a well-defined relative rank, and the binomial-poset hypothesis is needed so that the number of maximal chains below and above $z$ depends only on those ranks. If intervals of the same rank had different numbers of maximal chains, the quotient $B(n)/(B(k)B(n-k))$ would not be a uniform count across all intervals. The identity also does not assert that the quotient is meaningful for an arbitrary graded poset; its integrality is a consequence of the chain-counting argument inside a binomial poset. The Boolean lattice is the model example, with maximal chains obtained by adding elements in a chosen order.
[example: Boolean Algebra as a Binomial Poset]
Let $B_n$ be the Boolean lattice of subsets of $[n]$, ordered by inclusion. Fix an interval $[A,B]$ and write $D=B\setminus A$, with $|D|=m$. The map
\begin{align*}
[A,B]\to B_m,\qquad C\mapsto C\setminus A
\end{align*}
is an order isomorphism from $[A,B]$ to the Boolean lattice of subsets of $D$: if $A\subseteq C\subseteq B$, then $C\setminus A\subseteq D$, and for every $E\subseteq D$ the inverse image is $A\cup E$.
A maximal chain from $A$ to $B$ has the form
\begin{align*}
A=C_0\subset C_1\subset \cdots \subset C_m=B,
\end{align*}
where each step adds exactly one element of $D$. Choosing such a chain is therefore the same as choosing an ordering $(d_1,\dots,d_m)$ of the elements of $D$, with
\begin{align*}
C_r=A\cup\{d_1,\dots,d_r\}\qquad \text{for }0\le r\le m.
\end{align*}
There are $m!$ orderings of the $m$ elements of $D$, so the factorial function of $B_n$ is $B(m)=m!$.
For $0\le k\le m$, the binomial-poset coefficient is therefore
\begin{align*}
\binom{m}{k}_{B_n}=\frac{B(m)}{B(k)B(m-k)}=\frac{m!}{k!(m-k)!}.
\end{align*}
This equals the number of intermediate subsets $C$ with $A\subseteq C\subseteq B$ and $|C\setminus A|=k$, because such a subset is obtained by choosing exactly $k$ elements from the $m$-element set $D=B\setminus A$. Thus the binomial coefficients of the Boolean lattice are the ordinary binomial coefficients.
[/example]
The same framework also covers the $q$-analogue that comes from linear algebra over a finite field. Here intervals are subspace lattices, and chains refine a quotient [vector space](/page/Vector%20Space) by dimensions.
[example: Subspace Lattice over a Finite Field]
Let $L_n(q)$ be the lattice of subspaces of $\mathbb F_q^n$ ordered by inclusion. Fix $U\subseteq W$ and suppose $\dim(W/U)=m$. The map $V\mapsto V/U$ sends each intermediate subspace $U\subseteq V\subseteq W$ to a subspace of $W/U$, and its inverse sends a subspace $\overline V\subseteq W/U$ to its preimage under the quotient map $W\to W/U$. Hence $[U,W]$ is order-isomorphic to the subspace lattice of the $m$-dimensional vector space $W/U$, and after choosing a vector-space isomorphism $W/U\cong \mathbb F_q^m$, it is enough to count $k$-dimensional subspaces of $\mathbb F_q^m$.
Count ordered linearly independent $k$-tuples $(v_1,\dots,v_k)$ in $\mathbb F_q^m$. There are $q^m-1$ choices for $v_1$. Once $v_1,\dots,v_r$ have been chosen and are linearly independent, their span has $q^r$ elements, so $v_{r+1}$ has $q^m-q^r$ choices. Therefore the number of ordered linearly independent $k$-tuples in $\mathbb F_q^m$ is
\begin{align*}
(q^m-1)(q^m-q)\cdots(q^m-q^{k-1}).
\end{align*}
For a fixed $k$-dimensional subspace, the same argument inside that subspace shows that its ordered bases are counted by
\begin{align*}
(q^k-1)(q^k-q)\cdots(q^k-q^{k-1}).
\end{align*}
Every ordered linearly independent $k$-tuple is an ordered basis of exactly one $k$-dimensional subspace, namely its span. Dividing by the number of ordered bases of each fixed $k$-subspace gives
\begin{align*}
\binom{m}{k}_q=\frac{(q^m-1)(q^m-q)\cdots(q^m-q^{k-1})}{(q^k-1)(q^k-q)\cdots(q^k-q^{k-1})}.
\end{align*}
Thus the number of intermediate subspaces $V$ with $\dim(V/U)=k$ depends only on $m=\dim(W/U)$ and $k$, so the reduced incidence algebra of $L_n(q)$ records interval data by dimension and replaces ordinary binomial coefficients with their $q$-analogues.
[/example]
Reduced incidence algebras explain why Mobius inversion often produces closed formulas in highly symmetric posets. When interval type is controlled by rank, dimension, or another hereditary invariant, convolution becomes multiplication in a smaller algebra of type functions. The rest of the course uses this compression repeatedly, especially when lattice operations and topological invariants make interval types visible.
# 5. Lattices and Closure Operators
After Chapters 3 and 4 treated intervals through Möbius functions and incidence algebras, this chapter upgrades partially ordered sets from incidence objects to algebraic objects. The guiding question is: when does order determine canonical operations, and when do those operations recover the order? We begin with meets and joins, then use closure operators to manufacture lattices from concrete data, and finally prove Birkhoff's representation theorem, which says that every finite distributive lattice is built from order ideals of a poset.
## Meets and Joins as Order-Theoretic Operations
A poset may have many incomparable upper bounds for two elements, or none at all. The first problem is to isolate the case where the best upper and lower bounds exist inside the poset and behave like algebraic operations.
[definition: Meet And Join]
Let $(P, \le)$ be a poset and let $x,y \in P$. A meet of $x$ and $y$ is an element $x \wedge y \in P$ such that $x \wedge y \le x$, $x \wedge y \le y$, and whenever $z \le x$ and $z \le y$, we have $z \le x \wedge y$. A join of $x$ and $y$ is an element $x \vee y \in P$ such that $x \le x \vee y$, $y \le x \vee y$, and whenever $x \le z$ and $y \le z$, we have $x \vee y \le z$.
[/definition]
The definition says that $x \wedge y$ is the greatest lower bound and $x \vee y$ is the least upper bound. If either object exists, it is unique by antisymmetry, so the symbols $\wedge$ and $\vee$ become operations rather than choices. We now single out posets where these two operations are always available, because those are the posets in which order can be manipulated algebraically.
[definition: Lattice]
A lattice is a poset $(L, \le)$ such that every pair $x,y \in L$ has a meet $x \wedge y$ and a join $x \vee y$.
[/definition]
The order and the two operations encode the same information. This is why lattices sit between order theory and algebra: order-theoretic statements can often be rewritten as identities involving $\wedge$ and $\vee$. The next theorem records the identities that make this translation possible and will be used throughout the rest of the chapter.
[quotetheorem:8106]
[citeproof:8106]
The theorem lets us recognize lattices either as ordered sets with bounds or as algebras with two binary operations satisfying standard identities. The lattice hypothesis is doing real work here: in a general poset, a pair may have several minimal upper bounds but no least upper bound, so there is no well-defined operation $x\vee y$ to which associativity or absorption could apply. Even when some meets and joins exist locally, the identities only make sense for the pairs and triples for which all the displayed expressions exist. The theorem therefore does not say that every order-theoretic construction is algebraic; it says that once binary meets and joins exist everywhere, the order can be recovered from them without extra data.
The next examples show the same formal laws in arithmetic, set-theoretic, and linear settings.
[example: Divisor Lattice]
Fix $n\ge 1$, and write
\begin{align*}
n=p_1^{\alpha_1}\cdots p_r^{\alpha_r}.
\end{align*}
Every positive divisor $d\mid n$ has a unique form
\begin{align*}
d=p_1^{\beta_1}\cdots p_r^{\beta_r}\quad\text{with}\quad 0\le \beta_i\le \alpha_i.
\end{align*}
For two divisors $a=p_1^{\beta_1}\cdots p_r^{\beta_r}$ and $b=p_1^{\gamma_1}\cdots p_r^{\gamma_r}$, divisibility is componentwise: $a\mid b$ exactly when $\beta_i\le \gamma_i$ for every $i$, because then
\begin{align*}
b/a=p_1^{\gamma_1-\beta_1}\cdots p_r^{\gamma_r-\beta_r}
\end{align*}
has nonnegative exponents.
Thus the map
\begin{align*}
p_1^{\beta_1}\cdots p_r^{\beta_r}\longmapsto (\beta_1,\dots,\beta_r)
\end{align*}
is an order isomorphism from the divisor poset of $n$ to $\{0,\dots,\alpha_1\}\times\cdots\times\{0,\dots,\alpha_r\}$ with componentwise order. Under this map,
\begin{align*}
\gcd(a,b)=p_1^{\min(\beta_1,\gamma_1)}\cdots p_r^{\min(\beta_r,\gamma_r)}
\end{align*}
and
\begin{align*}
\operatorname{lcm}(a,b)=p_1^{\max(\beta_1,\gamma_1)}\cdots p_r^{\max(\beta_r,\gamma_r)}.
\end{align*}
The first divisor is the meet because a divisor $c=p_1^{\delta_1}\cdots p_r^{\delta_r}$ satisfies $c\mid a$ and $c\mid b$ exactly when $\delta_i\le \beta_i$ and $\delta_i\le \gamma_i$ for every $i$, which is exactly $\delta_i\le \min(\beta_i,\gamma_i)$. The second divisor is the join because $a\mid c$ and $b\mid c$ exactly when $\beta_i\le \delta_i$ and $\gamma_i\le \delta_i$ for every $i$, which is exactly $\max(\beta_i,\gamma_i)\le \delta_i$. Hence the positive divisors of $n$ form a lattice, with meet given by $\gcd$ and join given by $\operatorname{lcm}$.
[/example]
The divisor example is distributive because minima and maxima in a product of chains distribute. Linear algebra gives a different lattice where the same meet-join language is useful, but distributivity can fail.
[example: Subspace Lattice]
Let $V$ be a finite-dimensional vector space over a field $k$, and order the set of linear subspaces of $V$ by inclusion. For subspaces $U,W\subset V$, the meet is $U\cap W$: it is a subspace, it satisfies $U\cap W\subset U$ and $U\cap W\subset W$, and if $X\subset U$ and $X\subset W$, then every $x\in X$ lies in both $U$ and $W$, so $X\subset U\cap W$.
The join is the sum
\begin{align*}
U+W=\{u+w:u\in U,\ w\in W\}.
\end{align*}
This set is a subspace because
\begin{align*}
(u_1+w_1)+(u_2+w_2)=(u_1+u_2)+(w_1+w_2)\in U+W
\end{align*}
and, for $\lambda\in k$,
\begin{align*}
\lambda(u+w)=(\lambda u)+(\lambda w)\in U+W.
\end{align*}
It contains $U$ since $u=u+0\in U+W$, and it contains $W$ since $w=0+w\in U+W$. If $Y$ is any subspace with $U\subset Y$ and $W\subset Y$, then for every $u\in U$ and $w\in W$ we have $u,w\in Y$, hence $u+w\in Y$; therefore $U+W\subset Y$. Thus $U+W$ is the least upper bound.
This lattice records linear dependence. In a two-dimensional vector space, let $A,B,C$ be three distinct one-dimensional subspaces. Since $B$ and $C$ are distinct lines through $0$, their sum has dimension $2$, so $B+C=V$. Hence
\begin{align*}
A\wedge(B\vee C)=A\cap(B+C)=A\cap V=A.
\end{align*}
On the other hand, distinct one-dimensional subspaces intersect only in $0$, so
\begin{align*}
A\wedge B=A\cap B=0
\end{align*}
and
\begin{align*}
A\wedge C=A\cap C=0.
\end{align*}
Therefore
\begin{align*}
(A\wedge B)\vee(A\wedge C)=0+0=0.
\end{align*}
Since $A\ne 0$, the two sides of the distributive identity are different, so the subspace lattice is not distributive in this case.
[/example]
Pairwise meets and joins are enough for many finite arguments, but infinite constructions require arbitrary intersections or generated objects. This leads to the complete version of the definition.
[definition: Complete Lattice]
A complete lattice is a poset $(L,\le)$ such that every subset $S \subset L$ has a meet $\bigwedge S$ and a join $\bigvee S$.
[/definition]
In a finite lattice, completeness follows from repeated binary meets and joins together with top and bottom elements. In infinite examples, completeness is a genuine condition; for instance, closure systems are naturally complete because intersections of closed objects stay closed. For finite lattices, another basic structural question is how the elements immediately above the bottom and immediately below the top control generation.
To understand how a finite lattice is assembled from its bottom or dismantled from its top, we name the elements that cover the bottom and the elements covered by the top. These elements are often the first candidates for generators, and in geometric examples they correspond to points and hyperplanes. We need the following definition to refer to those first and last layers without redrawing the Hasse diagram each time.
[definition: Atoms And Coatoms]
Let $L$ be a lattice with bottom element $\hat{0}$ and top element $\hat{1}$. An atom is an element $a \in L$ such that $\hat{0} < a$ and there is no $x \in L$ with $\hat{0} < x < a$. A coatom is an element $c \in L$ such that $c < \hat{1}$ and there is no $x \in L$ with $c < x < \hat{1}$.
[/definition]
Atoms measure how the lattice is generated upward from the bottom, while coatoms measure the dual question downward from the top. In the Boolean lattice $B_n$, atoms are singleton subsets and coatoms are complements of singleton subsets. After identifying these first layers, we need a way to compare how arbitrary elements combine across ranks.
Some of the most useful finite lattices carry a rank function. The modular and semimodular conditions describe whether the rank lost in passing to a meet is compensated by the rank gained in passing to a join. We need the following definition to turn that rank comparison into a reusable lattice condition.
[definition: Modular And Semimodular Lattice]
Let $L$ be a finite lattice with rank function $\rho:L\to \mathbb N \cup \{0\}$. The lattice $L$ is semimodular if for all $x,y \in L$,
\begin{align*}
\rho(x)+\rho(y) \ge \rho(x \wedge y)+\rho(x \vee y).
\end{align*}
The lattice $L$ is modular if for all $x,y \in L$,
\begin{align*}
\rho(x)+\rho(y) = \rho(x \wedge y)+\rho(x \vee y).
\end{align*}
[/definition]
The rank formulation is especially natural in examples from vector spaces and matroids. It packages a geometric dimension formula into a lattice condition. There is also an identity formulation of modularity, which does not require a rank function and will be used to locate the first obstruction to distributivity.
[quotetheorem:8107]
[citeproof:8107]
The subspace lattice is the model example: the dimension formula for subspaces is exactly the modular rank equality. The finiteness and rankedness assumptions in the theorem are not cosmetic, because the proof uses rank as a numerical invariant that strictly increases along proper containment and therefore detects equality of comparable elements. Without such a rank function, the displayed modular identity still makes sense, but it must be checked directly rather than inferred from a dimension formula. The rank equality is also stronger than semimodularity: semimodularity gives only an inequality, so it cannot force the two comparable lattice elements in the proof to have the same rank. Modularity is weaker than distributivity, and this distinction becomes visible in the five-element and diamond-shaped forbidden examples.
[example: The Nondistributive Lattices M3 And N5]
The lattice $M_3$ has elements $\hat{0},\hat{1},a,b,c$, where $a,b,c$ are incomparable atoms. Since $b$ and $c$ are distinct atoms, their only common lower bound is $\hat{0}$ and their least common upper bound is $\hat{1}$, so
\begin{align*}
b\vee c=\hat{1}.
\end{align*}
Therefore
\begin{align*}
a\wedge(b\vee c)=a\wedge\hat{1}=a.
\end{align*}
Also, since $a,b,c$ are distinct atoms, the only common lower bound of $a$ and $b$ is $\hat{0}$, and the only common lower bound of $a$ and $c$ is $\hat{0}$. Hence
\begin{align*}
a\wedge b=\hat{0}.
\end{align*}
and
\begin{align*}
a\wedge c=\hat{0}.
\end{align*}
Thus
\begin{align*}
(a\wedge b)\vee(a\wedge c)=\hat{0}\vee\hat{0}=\hat{0}.
\end{align*}
Since $a\ne\hat{0}$, the two sides of the distributive identity are unequal, so $M_3$ is not distributive.
The lattice $N_5$ has a chain $\hat{0}<u<v<\hat{1}$ and another element $w$ with $\hat{0}<w<\hat{1}$, where $w$ is incomparable with both $u$ and $v$. Because $w$ and $v$ have no common lower bound above $\hat{0}$, we have
\begin{align*}
w\wedge v=\hat{0}.
\end{align*}
Thus
\begin{align*}
u\vee(w\wedge v)=u\vee\hat{0}=u.
\end{align*}
On the other side, the only common upper bound of $u$ and $w$ is $\hat{1}$, because $v$ is not above $w$ and $w$ is not above $u$. Hence
\begin{align*}
u\vee w=\hat{1}.
\end{align*}
Therefore
\begin{align*}
(u\vee w)\wedge v=\hat{1}\wedge v=v.
\end{align*}
Since $u<v$, we have $u\ne v$, so
\begin{align*}
u\vee(w\wedge v)\ne (u\vee w)\wedge v.
\end{align*}
Thus $N_5$ fails the modular identity even with $u\le v$.
[/example]
The two examples are small, but they are structurally decisive: $M_3$ shows that modularity need not imply distributivity, while $N_5$ shows how modularity itself can fail.
## Closure Operators and Moore Families
Many lattices do not appear first as abstract posets. They arise from a process that takes a set of generators and closes it under a rule: span in linear algebra, subgroup generation in algebra, convex hull in geometry, or [transitive closure](/theorems/1493) in graph theory. The question is how much order structure is forced by such a closure process.
[definition: Closure Operator]
Let $E$ be a set. A closure operator on $E$ is a map $c:2^E \to 2^E$ such that for all $A,B \subset E$,
\begin{align*}
A \subset c(A).
\end{align*}
If $A \subset B$, then $c(A) \subset c(B)$. Also,
\begin{align*}
c(c(A)) = c(A).
\end{align*}
[/definition]
The three axioms are extensivity, monotonicity, and idempotence. They say that closing a set adds allowed consequences, respects inclusion of data, and stabilizes after the first closure. Once a closure process is present, the next question is which subsets have already absorbed all their consequences.
[definition: Closed Set]
Let $c:2^E \to 2^E$ be a closure operator. A subset $F \subset E$ is closed if $c(F)=F$.
[/definition]
Closed sets are the fixed points of the closure process. The important structural question is whether these fixed points remember the lattice operations that produced them. The next theorem answers this by showing that fixed points automatically form a complete lattice, with meet given by ordinary intersection and join given by closing an ordinary union.
[quotetheorem:8108]
[citeproof:8108]
This theorem explains why lattices of subspaces and lattices of algebraic subobjects are so common. Each closure axiom is used in a different place. Extensivity ensures that a proposed join really contains the sets being joined; without it, $c(F\cup G)$ might fail even to be an upper bound. Monotonicity is what makes intersections closed: from $A\subset F_i$ we need $c(A)\subset c(F_i)$, and without that implication the meet formula can fail. Idempotence ensures that $c(\bigcup_i F_i)$ is itself closed; without it, closing the union once might still leave an object outside the fixed-point family. Thus the theorem is not merely about any operation on subsets, but specifically about operations whose fixed points are stable under arbitrary intersections and generated unions.
In each case, the join is not the raw union; it is the closed object generated by the union.
[example: Span Closure]
Let $V$ be a vector space over a field $k$, and define $c(A)=\operatorname{span}(A)$ for each $A\subset V$. Since every $a\in A$ is the linear combination $1a$, we have $A\subset \operatorname{span}(A)$. If $A\subset B$, then every finite linear combination of elements of $A$ is also a finite linear combination of elements of $B$, so $\operatorname{span}(A)\subset \operatorname{span}(B)$. Finally, $\operatorname{span}(A)$ is already a subspace, so taking the span again adds no new vectors:
\begin{align*}
\operatorname{span}(\operatorname{span}(A))=\operatorname{span}(A).
\end{align*}
Thus span is a closure operator.
The closed sets for this closure operator are exactly the linear subspaces of $V$. If $F$ is a subspace, then every finite linear combination of vectors in $F$ lies in $F$, so $\operatorname{span}(F)\subset F$; the reverse inclusion follows from $f=1f$, hence $\operatorname{span}(F)=F$. Conversely, if $\operatorname{span}(F)=F$, then $0\in \operatorname{span}(F)=F$, and for $x,y\in F$ and $\lambda\in k$ we have
\begin{align*}
x+y=1x+1y\in \operatorname{span}(F)=F.
\end{align*}
Also,
\begin{align*}
\lambda x\in \operatorname{span}(F)=F.
\end{align*}
So $F$ is a subspace.
Therefore the lattice of closed sets is the lattice of subspaces. Its meet is ordinary intersection: if $X\subset U$ and $X\subset W$, then each $x\in X$ lies in both $U$ and $W$, so $X\subset U\cap W$. Its join is
\begin{align*}
\operatorname{span}(U\cup W).
\end{align*}
For subspaces $U,W\subset V$, this equals the usual sum $U+W=\{u+w:u\in U,\ w\in W\}$. Indeed, if $u\in U$ and $w\in W$, then $u,w\in U\cup W$, so
\begin{align*}
u+w=1u+1w\in \operatorname{span}(U\cup W).
\end{align*}
Thus $U+W\subset \operatorname{span}(U\cup W)$. Conversely, $U+W$ contains $U$ because $u=u+0$, contains $W$ because $w=0+w$, and is a subspace since
\begin{align*}
(u_1+w_1)+(u_2+w_2)=(u_1+u_2)+(w_1+w_2)\in U+W
\end{align*}
and
\begin{align*}
\lambda(u+w)=(\lambda u)+(\lambda w)\in U+W.
\end{align*}
Hence every linear combination of vectors from $U\cup W$ lies in $U+W$, so $\operatorname{span}(U\cup W)\subset U+W$. Therefore
\begin{align*}
\operatorname{span}(U\cup W)=U+W.
\end{align*}
The closure-operator join is exactly the linear span generated by the two subspaces, not their raw union.
[/example]
The span example starts with a closure operator and then extracts its closed sets. Often the situation is reversed: we know a class of subsets is stable under intersections, and we want to recover the closure operation that generated it. The following definition isolates the exact stability property needed for this reconstruction.
[definition: Moore Family]
Let $E$ be a set. A Moore family on $E$ is a collection $\mathcal M\subset 2^E$ such that $E\in \mathcal M$ and for every subcollection $\mathcal A\subset \mathcal M$, the intersection $\bigcap_{A\in\mathcal A} A$ belongs to $\mathcal M$.
[/definition]
The condition is exactly the intersection property used in the proof above. Hence a Moore family looks like the closed-set side of a closure theory. The next theorem proves that no information is missing: [closure operators and Moore families](/theorems/8109) are two languages for the same structure.
[quotetheorem:8109]
[citeproof:8109]
This equivalence is a recurring construction principle: define the closed sets by an intersection-stable property, then obtain a closure operator and a complete lattice without separately checking joins. The hypotheses on a Moore family are exactly what make the reconstruction formula meaningful. The requirement $E\in\mathcal M$ guarantees that every subset $A\subset E$ has at least one closed superset, so the intersection defining $c_{\mathcal M}(A)$ is not taken over an empty collection. Closure under arbitrary intersections, not just finite intersections, is needed because the family of all members of $\mathcal M$ containing $A$ may be infinite. If only finite intersections are available, the formula can produce a subset outside $\mathcal M$, and then idempotence of the reconstructed closure operator can fail. The equivalence also says nothing about additional structure such as finite generation, ranks, or distributivity; those require extra hypotheses beyond being a closure system.
## Distributive Lattices and Order Ideals
The final topic asks which finite lattices are controlled by an underlying poset. The answer is strongest for distributive lattices: their elements can be represented as order ideals of a poset built from the lattice itself.
[definition: Distributive Lattice]
A lattice $L$ is distributive if for all $x,y,z\in L$,
\begin{align*}
x\wedge (y\vee z)=(x\wedge y)\vee(x\wedge z).
\end{align*}
Also,
\begin{align*}
x\vee (y\wedge z)=(x\vee y)\wedge(x\vee z).
\end{align*}
[/definition]
The two displayed identities are dual, and in a lattice either one implies the other. Distributivity rules out the two minimal patterns $M_3$ and $N_5$ seen above.
[example: Order Ideals Form A Distributive Lattice]
Let $P$ be a finite poset, and let $J(P)$ be the set of order ideals of $P$, ordered by inclusion. If $I,J\in J(P)$, then $I\cap J$ is an order ideal: if $x\in I\cap J$ and $y\le x$, then $y\in I$ because $I$ is an ideal and $y\in J$ because $J$ is an ideal, so $y\in I\cap J$. Also $I\cup J$ is an order ideal: if $x\in I\cup J$ and $y\le x$, then either $x\in I$ or $x\in J$, and the ideal property gives either $y\in I$ or $y\in J$, hence $y\in I\cup J$.
The meet is $I\cap J$. Indeed, $I\cap J\subset I$ and $I\cap J\subset J$, and if $X\in J(P)$ satisfies $X\subset I$ and $X\subset J$, then every $x\in X$ lies in both $I$ and $J$, so $X\subset I\cap J$. The join is $I\cup J$. It contains both $I$ and $J$, and if $Y\in J(P)$ satisfies $I\subset Y$ and $J\subset Y$, then every element of $I\cup J$ lies in $Y$, so $I\cup J\subset Y$.
Now take $I,J,K\in J(P)$. Since meet is intersection and join is union, the distributive identity becomes the ordinary set identity
\begin{align*}
I\cap (J\cup K)=(I\cap J)\cup(I\cap K).
\end{align*}
To verify it elementwise, let $x\in P$. Then
\begin{align*}
x\in I\cap(J\cup K) \Longleftrightarrow x\in I\text{ and }(x\in J\text{ or }x\in K).
\end{align*}
Distributing the logical word “and” over “or” gives
\begin{align*}
x\in I\cap(J\cup K) \Longleftrightarrow (x\in I\text{ and }x\in J)\text{ or }(x\in I\text{ and }x\in K).
\end{align*}
This is exactly
\begin{align*}
x\in I\cap(J\cup K) \Longleftrightarrow x\in (I\cap J)\cup(I\cap K).
\end{align*}
Therefore $I\cap(J\cup K)=(I\cap J)\cup(I\cap K)$, so $J(P)$ is a distributive lattice. The lattice operations are not new operations on the elements of $P$; they are the usual intersection and union, restricted to downward-closed subsets.
[/example]
Order ideals are not just examples. They are the universal model for finite distributive lattices. To formulate the theorem, we need the elements that cannot be decomposed nontrivially as joins.
[definition: Join-Irreducible Element]
Let $L$ be a finite lattice. An element $j\in L$ is join-irreducible if $j\ne \hat{0}$ and whenever $j=x\vee y$ with $x,y\in L$, we have $j=x$ or $j=y$.
[/definition]
Join-irreducibles are the lattice-theoretic analogue of indecomposable generators.
For an order-ideal lattice $J(P)$, the reconstruction program needs a way to read the original points of $P$ from the lattice operations alone. A principal ideal $\downarrow p$ is the natural candidate for the lattice element representing $p$, because it is generated by one point rather than by a union of several smaller ideals. The obstruction is that a non-principal ideal might also appear indecomposable unless it can always be split as a join of proper subideals. The theorem rules out that obstruction and identifies exactly which ideals are join-irreducible, setting up the finite distributive representation theorem.
[quotetheorem:8110]
[citeproof:8110]
The theorem says that when a distributive lattice already has the form $J(P)$, the underlying poset can be recovered internally. Finiteness is used in two ways: every nonempty ideal has maximal elements, and every ideal is the union of the principal ideals generated by those maximal elements. Ideals, rather than arbitrary subsets, are essential because $\downarrow p$ records not only the generator $p$ but all order relations below it; arbitrary subsets would forget the downward-closure condition and would not form the same lattice under union and intersection. Outside ideal lattices the statement is false as written: a lattice may have join-irreducibles without every element being recovered as the join of the ones below it, and nondistributive interactions can make the reconstruction map lose information. It remains to prove that every finite distributive lattice has such a form. The only possible choice for the underlying poset is now forced: it must be the poset of join-irreducibles.
[quotetheorem:8145]
[citeproof:8145]
This representation converts questions about finite distributive lattices into questions about finite posets, but both adjectives matter. Distributivity is what forces a join-irreducible below a finite join to lie below one of the joined elements; this is exactly the step that fails in $M_3$, where an atom lies below the join of the other two atoms without lying below either one. The lattice $N_5$ shows a second obstruction: even the modular behaviour needed to control decompositions can fail. Finiteness is also essential for the proof given here, because repeated decomposition into smaller joins must terminate and because arbitrary joins of join-irreducibles are not available in a general infinite lattice. Infinite distributive lattices have representation theorems of a different kind, usually requiring prime ideals or topology, rather than this finite order-ideal construction. In the finite distributive case, enumeration, rank generating functions, and Möbius calculations can often be reduced to the combinatorics of order ideals.
[example: Reconstructing A Boolean Lattice]
In the Boolean lattice $B_n=2^{\{1,\dots,n\}}$ ordered by inclusion, the bottom element is $\varnothing$ and the join is union: for $A,B\subset \{1,\dots,n\}$, the least subset containing both $A$ and $B$ is $A\cup B$.
We first identify the join-irreducible elements. The empty set is not join-irreducible because it is the bottom element. If $S=\{i\}$ and
\begin{align*}
S=A\cup B,
\end{align*}
then $i\in A\cup B$, so $i\in A$ or $i\in B$. Since $A\subset S$ and $B\subset S$, the condition $i\in A$ gives $A=S$, and the condition $i\in B$ gives $B=S$. Hence every singleton $\{i\}$ is join-irreducible.
Conversely, suppose $S\subset \{1,\dots,n\}$ has at least two elements. Choose $i\in S$. Then $\{i\}\subsetneq S$ and $S\setminus\{i\}\subsetneq S$, while
\begin{align*}
S=\{i\}\cup (S\setminus\{i\}).
\end{align*}
Thus $S$ is a join of two proper elements of $B_n$, so $S$ is not join-irreducible. Therefore the join-irreducibles are exactly
\begin{align*}
\{\{1\},\{2\},\dots,\{n\}\}.
\end{align*}
They form an antichain because $\{i\}\subset \{j\}$ holds exactly when $i=j$.
Now let $P=J_{\mathrm{irr}}(B_n)$. By *Birkhoff Representation Theorem*, the representation map is
\begin{align*}
\Phi(S)=\{\{i\}\in P:\{i\}\subset S\}.
\end{align*}
Since $\{i\}\subset S$ exactly when $i\in S$, this is
\begin{align*}
\Phi(S)=\{\{i\}:i\in S\}.
\end{align*}
Because $P$ is an antichain, every subset of $P$ is an order ideal: if $Q\subset P$, $\{j\}\in Q$, and $\{i\}\le \{j\}$ in $P$, then $\{i\}=\{j\}$, so $\{i\}\in Q$. Hence $J(P)=2^P$.
Under the identification $\{i\}\leftrightarrow i$, the set $\Phi(S)$ corresponds exactly to $S$. Thus Birkhoff's representation sends the Boolean lattice $B_n$ to the order-ideal lattice of an $n$-element antichain, which is again a full power set ordered by inclusion.
[/example]
The Boolean example shows the representation in the easiest case: an antichain of generators produces a Boolean lattice. For a general finite distributive lattice, the order relations among join-irreducibles record all dependencies between generators. This motivates the promised characterization: distributivity is exactly the condition that makes the join-irreducible reconstruction map an isomorphism.
[quotetheorem:8111]
[citeproof:8111]
This characterization is theorem-specific rather than merely a reformulation of terminology. If $L$ is finite but not distributive, the same map to order ideals of $J_{\mathrm{irr}}(L)$ can still be written down, but it need not preserve or reflect the lattice structure; $M_3$ is the basic example, since its three atoms would suggest a Boolean lattice with eight ideals rather than a five-element diamond. If $L$ is distributive but infinite, the finite join-irreducible reconstruction may miss elements that are not finite joins of join-irreducibles, so a different representation theorem is needed. Thus the equivalence characterizes exactly the finite distributive situation, not arbitrary lattices with many generators.
The chapter's main lesson is that lattice operations are not extra decoration on a poset: they often reveal the hidden generators and closure rules of the object. Closure operators build complete lattices from fixed points, and Birkhoff's theorem shows that finite distributive lattices are precisely the lattices obtained from order ideals of finite posets.
# 6. Geometric Lattices and Matroids
This chapter connects the lattice-theoretic and closure-operator language of Chapter 5 with the matroidal language of independence, closure, and rank. The guiding question is: when does a finite lattice behave like the lattice of linear subspaces spanned by a finite vector configuration? The answer is that atomic semimodular lattices are exactly lattices of flats of matroids, and this bridge lets us import deletion-contraction, characteristic polynomials, and Whitney's coefficient theorem into the lattice setting.
## Atomic Semimodular Lattices as Flat Lattices
The first problem is to recognize which lattices arise from a closure operation. In vector spaces, flats are subspaces spanned by subsets of a given set of vectors; in graphs, flats remember edge sets closed under cycles. The lattice-theoretic shadow of this phenomenon is the combination of being built from atoms and satisfying a rank inequality.
[definition: Atom]
Let $L$ be a finite lattice with least element $\bot$. An element $a \in L$ is an atom if $\bot < a$ and there is no $x \in L$ with $\bot < x < a$.
[/definition]
Atoms are the indivisible points from which the lattice is assembled. This motivates the stronger question of when every element of the lattice is generated from such points by joins.
[definition: Atomic Lattice]
A finite lattice $L$ is atomic if every element $x \in L$ can be written as
\begin{align*}
x = \bigvee_{a \in A} a
\end{align*}
for some set $A$ of atoms of $L$.
[/definition]
Atomicity supplies generators, but it does not control how rank behaves under joins and meets. This motivates a lattice analogue of submodularity of matroid rank, which prevents joins from creating too much new dimension.
[definition: Semimodular Lattice]
A finite ranked lattice $L$ with rank function $r:L\to \mathbb N \cup \{0\}$ is semimodular if, for all $x,y\in L$,
\begin{align*}
r(x)+r(y) \ge r(x\wedge y)+r(x\vee y).
\end{align*}
[/definition]
The inequality says that joining two elements cannot create more new rank than their ranks allow after subtracting the overlap measured by the meet. This motivates naming the lattices where point-generation and rank submodularity hold simultaneously.
[definition: Geometric Lattice]
A finite lattice $L$ is geometric if it is atomic and semimodular.
[/definition]
The definition is short because all the content lies in the interaction between atoms and rank. To test the definition, we first examine the case with no dependence among atoms.
[example: Boolean Lattice as a Geometric Lattice]
Let $B_n$ be the lattice of all subsets of $\{1,\dots,n\}$ ordered by inclusion, with meet $A\wedge B=A\cap B$ and join $A\vee B=A\cup B$. The least element is $\varnothing$, and the atoms are exactly the singleton sets $\{i\}$: each satisfies $\varnothing\subsetneq \{i\}$, and there is no subset strictly between them. If $S\subseteq \{1,\dots,n\}$, then
\begin{align*}
S=\bigcup_{i\in S}\{i\},
\end{align*}
so every element is a join of atoms.
The rank of $S$ is $r(S)=|S|$. For $A,B\subseteq \{1,\dots,n\}$, the semimodular inequality is
\begin{align*}
r(A)+r(B)\ge r(A\cap B)+r(A\cup B).
\end{align*}
Substituting $r(S)=|S|$ gives
\begin{align*}
|A|+|B|\ge |A\cap B|+|A\cup B|.
\end{align*}
To verify this, decompose $A\cup B$ into the disjoint union
\begin{align*}
A\cup B=(A\setminus B)\cup(B\setminus A)\cup(A\cap B).
\end{align*}
Hence
\begin{align*}
|A\cup B|=|A\setminus B|+|B\setminus A|+|A\cap B|.
\end{align*}
Also,
\begin{align*}
|A|=|A\setminus B|+|A\cap B|
\end{align*}
and
\begin{align*}
|B|=|B\setminus A|+|A\cap B|.
\end{align*}
Adding the last two identities gives
\begin{align*}
|A|+|B|=|A\setminus B|+|B\setminus A|+2|A\cap B|.
\end{align*}
Comparing with the formula for $|A\cup B|$ yields
\begin{align*}
|A|+|B|=|A\cup B|+|A\cap B|.
\end{align*}
Thus semimodularity holds with equality. Therefore $B_n$ is atomic and semimodular, hence geometric; it is the flat lattice of the free matroid on $n$ elements, where every subset is already closed.
[/example]
The Boolean lattice shows the independent case. This motivates introducing matroids, whose rank axioms record exactly how independence may fail while retaining enough structure for a lattice of closed sets.
[definition: Matroid by Rank]
A matroid $M$ on a finite ground set $E$ is a function $r:2^E\to \mathbb N\cup\{0\}$ such that, for all $A,B\subseteq E$ and $e\in E$, $r(\varnothing)=0$, $r(A)\le r(A\cup \{e\})\le r(A)+1$, $A\subseteq B$ implies $r(A)\le r(B)$, and
\begin{align*}
r(A)+r(B)\ge r(A\cap B)+r(A\cup B).
\end{align*}
[/definition]
The rank axioms are chosen so that closure has the same behaviour as span. This motivates isolating the closed sets, since these are the objects that will form the lattice attached to a matroid.
[definition: Flat of a Matroid]
Let $M$ be a matroid on $E$ with rank function $r$. A subset $F\subseteq E$ is a flat if, for every $e\in E\setminus F$,
\begin{align*}
r(F\cup\{e\})>r(F).
\end{align*}
[/definition]
The flats are ordered by inclusion. Their meet is intersection, while their join is the closure of union; this motivates proving that the flat poset has precisely the geometric lattice structure defined above.
[quotetheorem:8112]
[citeproof:8112]
This theorem turns every finite matroid into a lattice, but the hypotheses are doing real work. Finiteness ensures that the flat poset is a finite ranked lattice; without it, rank and joins may require extra local finiteness assumptions before the same proof applies. Flats are essential rather than arbitrary subsets: for the graphic matroid of a triangle, the full Boolean lattice on the three edges has rank three and forgets that the three edges form a circuit, while the flat lattice has rank two and retains that dependence. The rank axioms are also essential: an arbitrary closure system on a finite set can have a closed-set lattice that is not semimodular, so the conclusion does not classify all closure systems.
There is also a more concrete converse question. Instead of starting with a matroid and taking its flats, suppose we are handed only a family of subsets of a finite ground set. The next criterion records exactly when that family can be recognized as the flat family of some matroid, using closure-style axioms rather than lattice terminology.
[quotetheorem:6607]
[citeproof:6607]
The criterion explains why flats can be treated as an independent way to specify a matroid. The intersection condition says that closed sets really behave like fixed points of a closure operation, while the covering condition controls how a new element forces the smallest larger flat containing it. Thus the theorem is not claiming that every finite lattice, or every arbitrary closure system, comes from a matroid; it identifies the additional local exchange behavior needed for a family of subsets to be matroidal. To make the translation concrete, we next identify the flats of a graphic matroid.
[example: Graphic Matroid Flats]
Let $G=(V,E)$ be a finite graph and let $M(G)$ be its graphic matroid, whose rank on $A\subseteq E$ is
\begin{align*}
r(A)=|V|-c(A),
\end{align*}
where $c(A)$ is the number of connected components of the spanning subgraph $(V,A)$. We compute exactly when an edge set $F\subseteq E$ is a flat.
Let $e\in E\setminus F$, and let its endpoints be $u$ and $v$. If $u$ and $v$ lie in the same connected component of $(V,F)$, then adding $e$ does not change the connected components:
\begin{align*}
c(F\cup\{e\})=c(F).
\end{align*}
Therefore
\begin{align*}
r(F\cup\{e\})=|V|-c(F\cup\{e\})=|V|-c(F)=r(F).
\end{align*}
So such an edge prevents $F$ from being a flat, because flats require $r(F\cup\{e\})>r(F)$ for every $e\notin F$.
Conversely, if $u$ and $v$ lie in different connected components of $(V,F)$, then adding $e$ merges exactly those two components and leaves all other components unchanged:
\begin{align*}
c(F\cup\{e\})=c(F)-1.
\end{align*}
Hence
\begin{align*}
r(F\cup\{e\})=|V|-c(F\cup\{e\})=|V|-(c(F)-1)=|V|-c(F)+1=r(F)+1.
\end{align*}
Thus $r(F\cup\{e\})>r(F)$ exactly for the missing edges whose endpoints lie in different connected components of $(V,F)$.
It follows that $F$ is a flat precisely when every edge of $G$ whose endpoints lie in the same connected component of $(V,F)$ already belongs to $F$. Equivalently, a flat is obtained by taking the connected components of $(V,F)$ as blocks and including all edges of $G$ internal to each block. The flat lattice therefore records how these connected blocks merge as new rank-increasing edges are added.
[/example]
## Characteristic Polynomials and the Möbius Function
Once a geometric lattice has been identified, the next problem is to extract numerical invariants from its interval structure. Chapters 3 and 4 introduced the Möbius function as the inverse of the zeta function in the incidence algebra; here it packages the rank distribution of a geometric lattice into a polynomial.
[definition: Characteristic Polynomial of a Ranked Lattice]
Let $L$ be a finite ranked lattice with least element $\bot$, rank function $r$, and top rank $n=r(\top)$. The characteristic polynomial construction assigns to $L$ the polynomial $\chi_L \in \mathbb Z[t]$ defined by
\begin{align*}
\chi_L(t)=\sum_{x\in L}\mu(\bot,x)t^{n-r(x)},
\end{align*}
where $\mu$ is the Möbius function of $L$.
[/definition]
The exponent measures codimension, while the Möbius coefficient records inclusion-exclusion over the interval below $x$. This motivates computing the polynomial first in the free case.
[example: Characteristic Polynomial of the Boolean Lattice]
For $B_n$, fix $A\subseteq \{1,\dots,n\}$. The interval $[\varnothing,A]$ consists exactly of the subsets of $A$, so it is a Boolean lattice of rank $|A|$. Its Möbius value is $\mu(\varnothing,A)=(-1)^{|A|}$: for $A=\varnothing$ this is $\mu(\varnothing,\varnothing)=1$, and for $|A|=m>0$ the Möbius recursion gives
\begin{align*}
0=\sum_{X\subseteq A}\mu(\varnothing,X)=\mu(\varnothing,A)+\sum_{X\subsetneq A}(-1)^{|X|}.
\end{align*}
Grouping the proper subsets by size,
\begin{align*}
\sum_{X\subsetneq A}(-1)^{|X|}=\sum_{k=0}^{m-1}\binom{m}{k}(-1)^k.
\end{align*}
Since
\begin{align*}
0=(1-1)^m=\sum_{k=0}^{m}\binom{m}{k}(-1)^k,
\end{align*}
we have
\begin{align*}
\sum_{k=0}^{m-1}\binom{m}{k}(-1)^k=-(-1)^m.
\end{align*}
Substituting into the recursion gives
\begin{align*}
\mu(\varnothing,A)=(-1)^m=(-1)^{|A|}.
\end{align*}
The top rank of $B_n$ is $n$, and the rank of $A$ is $|A|$, so the characteristic polynomial is
\begin{align*}
\chi_{B_n}(t)=\sum_{A\subseteq \{1,\dots,n\}}(-1)^{|A|}t^{n-|A|}.
\end{align*}
There are $\binom{n}{k}$ subsets of size $k$, hence
\begin{align*}
\chi_{B_n}(t)=\sum_{k=0}^{n}\binom{n}{k}(-1)^k t^{n-k}.
\end{align*}
Expanding $(t-1)^n$ gives
\begin{align*}
(t-1)^n=\sum_{k=0}^{n}\binom{n}{k}t^{n-k}(-1)^k.
\end{align*}
Therefore
\begin{align*}
\chi_{B_n}(t)=(t-1)^n.
\end{align*}
This matches the free matroid interpretation: each of the $n$ independent elements contributes one identical factor $t-1$.
[/example]
The Boolean computation is the baseline where all intervals are Boolean. This motivates the partition lattice as the first non-Boolean example whose polynomial still factors completely.
[example: Partition Lattice]
Let $\Pi_n$ be the lattice of set partitions of $\{1,\dots,n\}$ ordered by refinement, with the discrete partition at the bottom. For the complete graph $K_n$, an edge set $F$ determines the partition of $\{1,\dots,n\}$ into the connected components of the spanning subgraph $(\{1,\dots,n\},F)$. Since $K_n$ contains every edge between two vertices, the flat condition says exactly that $F$ contains all edges whose endpoints lie in the same component. Thus each flat is determined by a set partition, and the flat lattice of the graphic matroid of $K_n$ is $\Pi_n$.
We compute the characteristic polynomial through the graph interpretation. For a connected graph $G$, the graphic-matroid characteristic polynomial satisfies $\chi_{M(G)}(t)=P_G(t)/t$, where $P_G(t)$ is the chromatic polynomial. In $K_n$, a proper colouring assigns distinct colours to all $n$ vertices. There are $t$ choices for the first vertex, then $t-1$ choices for the second vertex, then $t-2$ choices for the third vertex, and in general $t-(i-1)$ choices for the $i$th vertex, because it must avoid the colours already used on the previous $i-1$ adjacent vertices. Therefore
\begin{align*}
P_{K_n}(t)=t(t-1)(t-2)\cdots(t-n+1).
\end{align*}
Dividing by the initial factor $t$ gives
\begin{align*}
\chi_{\Pi_n}(t)=\chi_{M(K_n)}(t)=(t-1)(t-2)\cdots(t-n+1).
\end{align*}
So the factor $t-i$ records the choice forced at the moment a new vertex must avoid the $i$ colours already assigned to earlier vertices.
[/example]
The coefficients of $\chi_L(t)$ are not arbitrary signed integers. This motivates a cancellation-free description of them using circuits with a chosen ordering of the atoms.
[definition: Broken Circuit]
Let $M$ be a matroid on a linearly ordered ground set $E$. A circuit is a minimal dependent subset of $E$. A broken circuit is a set of the form $C\setminus\{\min C\}$, where $C$ is a circuit.
[/definition]
Broken circuits encode the cancellations that occur in Möbius inversion by marking the dependent subsets that can be paired away. This motivates selecting the subsets that avoid every such cancellation pattern, since those are the subsets expected to remain in the final coefficient count.
[definition: No-Broken-Circuit Set]
Let $M$ be a matroid on a linearly ordered ground set $E$. A subset $A\subseteq E$ is a no-broken-circuit set if it contains no broken circuit.
[/definition]
No-broken-circuit sets are defined using an order, but they count order-independent [coefficients of the characteristic polynomial](/theorems/3306). This motivates Whitney's theorem, which turns the recursive Möbius coefficients into an explicit enumeration of these surviving subsets in each cardinality.
[quotetheorem:8113]
[citeproof:8113]
Whitney's theorem is useful because it replaces Möbius values, which are recursive, by a direct enumeration of special subsets. The loopless hypothesis avoids a degenerate obstruction: if $e$ is a loop, then $\{e\}$ is a circuit, its broken circuit is $\varnothing$, and no subset can avoid all broken circuits while the characteristic polynomial is $0$. The linear order affects which subsets are called no-broken-circuit sets, since changing the least element of a circuit changes the broken circuit, but the number of such sets in each size is forced by the order-independent coefficients of $\chi_{L(M)}(t)$. The theorem counts coefficients rather than producing the polynomial without work: one still has to enumerate NBC sets or use another method to find them. To see the cancellation in a small case, we examine a single circuit.
[example: Broken Circuits in a Triangle Graph]
Let $G$ be a triangle with edges ordered $e_1<e_2<e_3$, and let $M(G)$ be its graphic matroid. In a graphic matroid, circuits are the edge sets of simple cycles, so the only circuit is
\begin{align*}
\{e_1,e_2,e_3\}.
\end{align*}
Since $\min\{e_1,e_2,e_3\}=e_1$, the corresponding broken circuit is
\begin{align*}
\{e_1,e_2,e_3\}\setminus\{e_1\}=\{e_2,e_3\}.
\end{align*}
Now list the subsets by size and remove exactly those containing $\{e_2,e_3\}$. In size $0$, the only subset is $\varnothing$, and it contains no broken circuit. In size $1$, the subsets are
\begin{align*}
\{e_1\},\{e_2\},\{e_3\},
\end{align*}
and none contains $\{e_2,e_3\}$. In size $2$, the subsets are
\begin{align*}
\{e_1,e_2\},\{e_1,e_3\},\{e_2,e_3\},
\end{align*}
so exactly the first two are no-broken-circuit sets. In size $3$, the only subset is $\{e_1,e_2,e_3\}$, and it contains $\{e_2,e_3\}$, so it is not a no-broken-circuit set.
Thus the numbers of no-broken-circuit subsets in sizes $0,1,2$ are
\begin{align*}
w_0=1,\quad w_1=3,\quad w_2=2.
\end{align*}
The triangle graph is connected on three vertices, so the rank of its graphic matroid is
\begin{align*}
r(M(G))=3-1=2.
\end{align*}
By the *[Whitney Broken Circuit Theorem](/theorems/8113)*, the characteristic polynomial has the form
\begin{align*}
\chi(t)=w_0t^2-w_1t+w_2.
\end{align*}
Substituting the counts gives
\begin{align*}
\chi(t)=t^2-3t+2.
\end{align*}
Finally,
\begin{align*}
(t-1)(t-2)=t^2-2t-t+2=t^2-3t+2,
\end{align*}
so
\begin{align*}
\chi(t)=(t-1)(t-2).
\end{align*}
The missing two subsets are exactly the cancellation pattern: $\{e_2,e_3\}$ is the broken circuit itself, and the full circuit contains it.
[/example]
## Deletion, Contraction, and Crapo's Beta Invariant
The final problem is recursive computation. In graph theory, chromatic polynomials satisfy deletion-contraction; the same mechanism works for matroids and therefore for geometric lattices arising from them.
[definition: Deletion and Contraction of a Matroid]
Let $M$ be a matroid on a finite ground set $E$, and let $e\in E$. The deletion $M\setminus e$ is the matroid on $E\setminus\{e\}$ with rank function $r_{M\setminus e}:2^{E\setminus\{e\}}\to \mathbb N\cup\{0\}$ defined by
\begin{align*}
r_{M\setminus e}(A)=r_M(A).
\end{align*}
If $e$ is not a loop, the contraction $M/e$ is the matroid on $E\setminus\{e\}$ with rank function $r_{M/e}:2^{E\setminus\{e\}}\to \mathbb N\cup\{0\}$ defined by
\begin{align*}
r_{M/e}(A)=r_M(A\cup\{e\})-r_M(\{e\}).
\end{align*}
[/definition]
Deletion asks what remains after ignoring an element; contraction asks what remains after forcing that element to be present. This motivates splitting characteristic-polynomial computations according to whether a subset contains the chosen element.
[quotetheorem:8114]
[citeproof:8114]
The recursive formula is the workhorse for computations, and its exceptional cases explain the non-loop/non-coloop hypothesis in the main recurrence. A loop gives no rank increase, so every subset $A$ pairs with $A\cup\{e\}$ and cancels; for instance, a one-loop matroid has characteristic polynomial $1-1=0$, not a deletion-contraction difference of two smaller nonzero polynomials. A coloop behaves like a free factor, so it contributes a factor $t-1$ rather than a subtraction; the one-element coloop has characteristic polynomial $t-1$. In practice, one first removes loops and factors off coloops, then applies deletion-contraction to an element that is neither. This motivates comparing it with the graph-theoretic recurrence already familiar from chromatic polynomials.
[example: Graphic Deletion Contraction]
Let $G=(V,E)$ be a graph, and let $e\in E$ be neither a loop nor a bridge. The graphic matroid satisfies the deletion-contraction identity
\begin{align*}
\chi_{M(G)}(t)=\chi_{M(G\setminus e)}(t)-\chi_{M(G/e)}(t).
\end{align*}
For any graph $H$, its chromatic polynomial and graphic-matroid characteristic polynomial are related by
\begin{align*}
P_H(t)=t^{c(H)}\chi_{M(H)}(t),
\end{align*}
where $c(H)$ is the number of connected components of $H$. Since $e$ is not a bridge, deleting $e$ does not change the number of connected components:
\begin{align*}
c(G\setminus e)=c(G).
\end{align*}
Since $e$ is not a loop, contracting $e$ also preserves the number of connected components:
\begin{align*}
c(G/e)=c(G).
\end{align*}
Multiplying the matroid identity by $t^{c(G)}$ therefore gives
\begin{align*}
t^{c(G)}\chi_{M(G)}(t)=t^{c(G)}\chi_{M(G\setminus e)}(t)-t^{c(G)}\chi_{M(G/e)}(t).
\end{align*}
Using the three equal component counts, this becomes
\begin{align*}
P_G(t)=P_{G\setminus e}(t)-P_{G/e}(t).
\end{align*}
Thus the graphic-matroid recurrence is exactly the usual chromatic deletion-contraction recurrence for a non-loop, non-bridge edge.
For a cycle graph $C_n$ with $n\ge 3$, choose an edge $e$. Deleting $e$ leaves a tree on $n$ vertices, so
\begin{align*}
P_{C_n\setminus e}(t)=t(t-1)^{n-1}.
\end{align*}
Contracting $e$ gives $C_{n-1}$, with the base case $P_{C_2}(t)=t(t-1)$ for the two-parallel-edge cycle. Hence
\begin{align*}
P_{C_n}(t)=t(t-1)^{n-1}-P_{C_{n-1}}(t).
\end{align*}
Iterating this recurrence yields
\begin{align*}
P_{C_n}(t)=(t-1)^n+(-1)^n(t-1).
\end{align*}
Since $C_n$ is connected, $P_{C_n}(t)=t\chi_{M(C_n)}(t)$, and therefore
\begin{align*}
\chi_{M(C_n)}(t)=\frac{(t-1)^n+(-1)^n(t-1)}{t}.
\end{align*}
This shows concretely how deletion-contraction turns one cyclic dependence into the reduced characteristic polynomial of the cycle matroid.
[/example]
Deletion-contraction also suggests looking for invariants governed by small recurrences. This motivates Crapo's beta invariant, which extracts the first residual coefficient after the universal root at $t=1$.
[definition: Crapo Beta Invariant]
Let $\mathcal M_{\mathrm{lf}}^{+}$ be the class of nonempty finite loopless matroids. Crapo's beta invariant is the function $\beta:\mathcal M_{\mathrm{lf}}^{+}\to \mathbb Z$ defined as follows: for $M\in \mathcal M_{\mathrm{lf}}^{+}$ of rank $r$ with characteristic polynomial $\chi_M(t)$,
\begin{align*}
\beta(M)=(-1)^{r-1}\chi_M'(1).
\end{align*}
[/definition]
The derivative at $1$ extracts the residual information after the root at $t=1$. The first structural theorem records the two basic ways this residual information vanishes: a nontrivial direct-sum decomposition and a coloop factor.
These vanishing mechanisms are useful because they separate product-like matroid structure from genuinely connected cyclic dependence. The theorem below makes that separation precise before the cycle example shows a positive case.
We quote the result as the bridge from the definition of $\beta$ to computations with characteristic polynomials: it tells us exactly what must be checked before interpreting a beta value as evidence of connected matroid structure. In the next example, this lets the cycle matroid calculation focus on the polynomial derivative rather than on excluding direct-sum or coloop degeneracies.
[quotetheorem:8115]
[citeproof:8115]
The beta invariant is not just another coefficient; it sees whether the characteristic polynomial has more than the universal first-order vanishing at $t=1$. The theorem gives the main zero mechanisms needed here. The full positivity criterion says that, for loopless finite matroids with at least two elements, these are exactly the obstructions to positive beta. We will use the vanishing direction explicitly and then test the invariant on the simplest connected family with one cyclic dependence.
[example: Beta Invariant of a Cycle]
Let $M(C_n)$ be the graphic matroid of a cycle graph with $n\ge 3$ edges. The graph has $n$ vertices and one connected component, so the rank of its graphic matroid is
\begin{align*}
r(M(C_n))=|V|-c(C_n)=n-1.
\end{align*}
From the cycle characteristic-polynomial computation,
\begin{align*}
\chi_{M(C_n)}(t)=\frac{(t-1)^n+(-1)^n(t-1)}{t}.
\end{align*}
Set
\begin{align*}
f(t)=(t-1)^n+(-1)^n(t-1).
\end{align*}
Then $\chi_{M(C_n)}(t)=f(t)/t$, and
\begin{align*}
f(1)=(1-1)^n+(-1)^n(1-1)=0+0=0.
\end{align*}
Differentiating term by term gives
\begin{align*}
f'(t)=n(t-1)^{n-1}+(-1)^n.
\end{align*}
Since $n\ge 3$, we have $n-1\ge 2$, so
\begin{align*}
f'(1)=n(1-1)^{n-1}+(-1)^n=0+(-1)^n=(-1)^n.
\end{align*}
Using the quotient rule for $\chi(t)=f(t)/t$,
\begin{align*}
\chi_{M(C_n)}'(t)=\frac{t f'(t)-f(t)}{t^2}.
\end{align*}
Evaluating at $t=1$ gives
\begin{align*}
\chi_{M(C_n)}'(1)=\frac{1\cdot f'(1)-f(1)}{1^2}=(-1)^n-0=(-1)^n.
\end{align*}
By the definition of Crapo's beta invariant,
\begin{align*}
\beta(M(C_n))=(-1)^{r(M(C_n))-1}\chi_{M(C_n)}'(1).
\end{align*}
Substituting $r(M(C_n))=n-1$ and $\chi_{M(C_n)}'(1)=(-1)^n$ yields
\begin{align*}
\beta(M(C_n))=(-1)^{n-2}(-1)^n=(-1)^{2n-2}=1.
\end{align*}
Thus every cycle matroid has beta invariant $1$, matching the fact that a cycle contains exactly one essential cyclic dependence.
[/example]
## Hyperplane Arrangements and Intersection Lattices
The last viewpoint asks why geometric lattices appear in geometry. A finite collection of hyperplanes decomposes affine or projective space into incidence data, and the intersections form a lattice after reversing inclusion.
[definition: Intersection Lattice of a Central Arrangement]
Let $\mathcal A$ be a finite central hyperplane arrangement in a vector space $V$ over a field $k$. The intersection lattice $L(\mathcal A)$ is the set of all subspaces of the form
\begin{align*}
\bigcap_{H\in \mathcal B}H
\end{align*}
for $\mathcal B\subseteq \mathcal A$, ordered by reverse inclusion.
[/definition]
Reverse inclusion makes the whole space the bottom element and the deepest intersections higher in the lattice. This motivates relating the arrangement to a representable matroid through the normal vectors of its hyperplanes.
[quotetheorem:8116]
[citeproof:8116]
This theorem closes the circle from linear algebra back to lattices, and centrality is the condition that keeps the intersection poset in the same flat-lattice form. Because all hyperplanes pass through the origin, every intersection is a subspace, reverse inclusion gives a ranked lattice with bottom element $V$, and annihilators translate intersections into spans of defining linear forms. For a noncentral affine arrangement, intersections are affine subspaces and some expected intersections may be empty; the intersection poset can still be studied, but it needs an affine or coned arrangement construction before the geometric-lattice statement applies in this form. The theorem is also not a representation theorem for all geometric lattices: only representable matroids over the chosen field arise from central arrangements by this construction. To anchor the correspondence, we compute the arrangement whose intersection lattice has no dependencies.
[example: Coordinate Hyperplane Arrangement]
Let $\mathcal A$ be the coordinate arrangement in $k^n$, with $H_i=\{x\in k^n:x_i=0\}$ for $1\le i\le n$. For each subset $S\subseteq \{1,\dots,n\}$, write
\begin{align*}
X_S=\bigcap_{i\in S}H_i=\{x\in k^n:x_i=0\text{ for every }i\in S\}.
\end{align*}
If $S\subseteq T$, then every coordinate forced to vanish by $S$ is also forced to vanish by $T$, so
\begin{align*}
X_T\subseteq X_S.
\end{align*}
Because $L(\mathcal A)$ is ordered by reverse inclusion, this means
\begin{align*}
X_S\le X_T\text{ in }L(\mathcal A)\quad\Longleftrightarrow\quad S\subseteq T.
\end{align*}
Thus $S\mapsto X_S$ is order-preserving from the Boolean lattice $B_n$ to $L(\mathcal A)$. It is surjective by the definition of $L(\mathcal A)$. It is injective because if $S\ne T$ and $i\in S\setminus T$, then the standard basis vector $e_i$ lies in $X_T$ but not in $X_S$. Hence $L(\mathcal A)$ is isomorphic to $B_n$.
Under this identification, the bottom element is $X_\varnothing=k^n$, and $X_S$ has rank $|S|$. For $m=|S|$, the interval $[X_\varnothing,X_S]$ corresponds to all subsets of $S$. Therefore $\mu(X_\varnothing,X_\varnothing)=1$, and for $m>0$ the Möbius recursion gives
\begin{align*}
0=\sum_{T\subseteq S}\mu(X_\varnothing,X_T)=\mu(X_\varnothing,X_S)+\sum_{T\subsetneq S}(-1)^{|T|}.
\end{align*}
Grouping proper subsets by size gives
\begin{align*}
\sum_{T\subsetneq S}(-1)^{|T|}=\sum_{j=0}^{m-1}\binom{m}{j}(-1)^j.
\end{align*}
Since
\begin{align*}
0=(1-1)^m=\sum_{j=0}^{m}\binom{m}{j}(-1)^j,
\end{align*}
we get
\begin{align*}
\sum_{j=0}^{m-1}\binom{m}{j}(-1)^j=-(-1)^m.
\end{align*}
Substituting this into the recursion gives
\begin{align*}
\mu(X_\varnothing,X_S)=(-1)^m=(-1)^{|S|}.
\end{align*}
The top rank is $n$, so
\begin{align*}
\chi_{L(\mathcal A)}(t)=\sum_{S\subseteq\{1,\dots,n\}}(-1)^{|S|}t^{n-|S|}.
\end{align*}
There are $\binom{n}{j}$ subsets of size $j$, hence
\begin{align*}
\chi_{L(\mathcal A)}(t)=\sum_{j=0}^{n}\binom{n}{j}(-1)^j t^{n-j}.
\end{align*}
The binomial expansion of $(t-1)^n$ is
\begin{align*}
(t-1)^n=\sum_{j=0}^{n}\binom{n}{j}t^{n-j}(-1)^j.
\end{align*}
Therefore
\begin{align*}
\chi_{L(\mathcal A)}(t)=(t-1)^n.
\end{align*}
The coordinate arrangement has no linear dependence among its defining coordinate functionals, so its intersection lattice is the arrangement version of the free matroid example.
[/example]
The chapter's main lesson is that geometric lattices are the exact meeting point of finite lattice theory and matroid theory. Atomicity supplies points, semimodularity supplies rank submodularity, the Möbius function supplies characteristic polynomials, and deletion-contraction supplies recursive computation.
# 7. Eulerian Posets and Flag Enumeration
This chapter turns from the lattice and matroid structure of ranked posets to their refined enumerative invariants. For Eulerian posets, the ordinary rank numbers do not remember enough: the useful data count chains according to exactly which ranks they visit. The main question is how much of this flag enumeration is forced by the Eulerian condition, and how the remaining information can be packaged efficiently by the $cd$-index.
## Flag Enumeration in Graded Posets
The first problem is to count chains in a graded poset without discarding the ranks at which the chains pass. Ordinary rank enumeration records how many elements lie at each rank, but a chain is sensitive to incidence between ranks. Before counting such chains, we need a way to attach rank data to a chain itself.
[definition: Rank Set of a Chain]
Let $P$ be a finite graded poset of rank $n+1$ with rank function $\rho: P \to \{0,1,\dots,n+1\}$. For a chain $C$ in $P \setminus \{\hat{0},\hat{1}\}$, its rank set is
\begin{align*}
\rho(C) := \{\rho(x) : x \in C\} \subseteq \{1,\dots,n\}.
\end{align*}
[/definition]
The deletion of $\hat{0}$ and $\hat{1}$ keeps the bookkeeping focused on the internal ranks. Once a chain has a rank set, the next invariant should count all chains whose visited ranks are a fixed subset of internal ranks. This motivates the flag $f$-vector, a refinement of the usual rank numbers.
[definition: Flag F-Vector]
Let $P$ be a finite graded poset of rank $n+1$ with minimum element $\hat{0}$ and maximum element $\hat{1}$. For $S \subseteq \{1,\dots,n\}$, the flag $f$-number $f_S(P)$ is the number of chains $C$ in $P \setminus \{\hat{0},\hat{1}\}$ such that $\rho(C)=S$. The collection $(f_S(P))_{S \subseteq \{1,\dots,n\}}$ is the flag $f$-vector of $P$.
[/definition]
The entry $f_\varnothing(P)$ is $1$, coming from the empty chain. If $S=\{i\}$, then $f_S(P)$ is the number of elements of rank $i$, so the usual rank numbers sit inside the flag $f$-vector.
[example: Boolean Flag Counts]
Let $B_m$ be the Boolean lattice of subsets of $\{1,\dots,m\}$, ranked by cardinality, with $\hat{0}=\varnothing$ and $\hat{1}=\{1,\dots,m\}$. For $S=\{s_1<\dots<s_k\}\subseteq \{1,\dots,m-1\}$, a chain with rank set $S$ is a sequence
\begin{align*}
A_1 \subset A_2 \subset \dots \subset A_k, \qquad |A_j|=s_j.
\end{align*}
Set $A_0=\varnothing$ and $A_{k+1}=\{1,\dots,m\}$. Then the successive blocks
\begin{align*}
D_j=A_j\setminus A_{j-1} \quad \text{for } 1\leq j\leq k+1
\end{align*}
are disjoint and their union is $\{1,\dots,m\}$. Their sizes are
\begin{align*}
|D_1|=s_1,\quad |D_2|=s_2-s_1,\quad \dots,\quad |D_k|=s_k-s_{k-1},\quad |D_{k+1}|=m-s_k.
\end{align*}
Conversely, any ordered partition of $\{1,\dots,m\}$ into blocks of these sizes determines the chain by
\begin{align*}
A_j=D_1\cup D_2\cup \dots \cup D_j.
\end{align*}
Thus the number of such chains is the number of ordered choices of these blocks:
\begin{align*}
f_S(B_m)=\binom{m}{s_1}\binom{m-s_1}{s_2-s_1}\cdots \binom{m-s_{k-1}}{s_k-s_{k-1}}.
\end{align*}
Expanding the binomial coefficients gives
\begin{align*}
f_S(B_m)=\frac{m!}{s_1!(s_2-s_1)!\cdots(s_k-s_{k-1})!(m-s_k)!}.
\end{align*}
Equivalently,
\begin{align*}
f_S(B_m)=\binom{m}{s_1,s_2-s_1,\dots,s_k-s_{k-1},m-s_k}.
\end{align*}
This example shows that flag enumeration records the full ordered incidence pattern between ranks, not only the individual binomial rank numbers.
[/example]
The Boolean computation is naturally additive under refinements of rank sets, which makes inclusion-exclusion unavoidable. Already for a three-rank Eulerian poset, the raw relation
\begin{align*}
f_{\{1\}}-f_{\{2\}}+f_{\{3\}}=2
\end{align*}
mixes entries from different singleton ranks, while the inclusion-exclusion basis separates the alternating cancellations by descent pattern. Thus the flag $f$-vector is convenient for direct counting, but linear identities become cleaner after Möbius inversion on subsets of ranks. This motivates the flag $h$-vector.
[definition: Flag H-Vector]
Let $P$ be a finite graded poset of rank $n+1$. For $S \subseteq \{1,\dots,n\}$, define
\begin{align*}
h_S(P) := \sum_{T \subseteq S} (-1)^{|S|-|T|} f_T(P).
\end{align*}
The collection $(h_S(P))_{S \subseteq \{1,\dots,n\}}$ is the flag $h$-vector of $P$.
[/definition]
Equivalently, for fixed $n$, the assignment $(f_S)_{S\subseteq \{1,\dots,n\}}\mapsto (h_S)_{S\subseteq \{1,\dots,n\}}$ is a linear automorphism of the vector space of functions $\mathcal{P}(\{1,\dots,n\})\to \mathbb Q$.
By Möbius inversion on the subset lattice, the two vectors contain the same information:
\begin{align*}
f_S(P)=\sum_{T\subseteq S} h_T(P).
\end{align*}
Thus every relation among flag $f$-numbers can be translated into an equivalent relation among flag $h$-numbers.
[example: Flag H-Vector of a Boolean Lattice]
For a permutation $\pi=\pi_1\pi_2\cdots \pi_m$ of $\{1,\dots,m\}$, define its descent set by
\begin{align*}
\operatorname{Des}(\pi)=\{i\in \{1,\dots,m-1\}:\pi_i>\pi_{i+1}\}.
\end{align*}
We show that $h_S(B_m)$ counts permutations with descent set exactly $S$. First write $S=\{s_1<\cdots<s_k\}$, and set $s_0=0$ and $s_{k+1}=m$. A permutation with $\operatorname{Des}(\pi)\subseteq S$ is increasing on each consecutive block of positions $\{s_{j-1}+1,\dots,s_j\}$. Once the entries in each block are chosen, their order inside that block is forced to be increasing, so the number of such permutations is
\begin{align*}
\binom{m}{s_1}\binom{m-s_1}{s_2-s_1}\cdots \binom{m-s_{k-1}}{s_k-s_{k-1}}.
\end{align*}
By the preceding Boolean flag-count computation, this number is $f_S(B_m)$.
Now apply the defining inclusion-exclusion formula for the flag $h$-vector:
\begin{align*}
h_S(B_m)=\sum_{T\subseteq S}(-1)^{|S|-|T|}f_T(B_m).
\end{align*}
Substituting the interpretation of $f_T(B_m)$ gives
\begin{align*}
h_S(B_m)=\sum_{T\subseteq S}(-1)^{|S|-|T|}\#\{\pi\in \mathfrak S_m:\operatorname{Des}(\pi)\subseteq T\}.
\end{align*}
Fix a permutation $\pi$ and write $D=\operatorname{Des}(\pi)$. Its coefficient in this sum is
\begin{align*}
\sum_{\substack{T\subseteq S\,:\,D\subseteq T}}(-1)^{|S|-|T|}.
\end{align*}
If $D\not\subseteq S$, there is no subset $T\subseteq S$ with $D\subseteq T$, so the coefficient is $0$. If $D\subseteq S$, write $T=D\cup U$ with $U\subseteq S\setminus D$. Then the coefficient becomes
\begin{align*}
\sum_{U\subseteq S\setminus D}(-1)^{|S|-|D|-|U|}.
\end{align*}
Since $\sum_{r=0}^{q}\binom{q}{r}(-1)^{q-r}=(1-1)^q$, this coefficient is
\begin{align*}
(1-1)^{|S|-|D|}.
\end{align*}
It is therefore $1$ when $D=S$ and $0$ when $D\subsetneq S$. Hence
\begin{align*}
h_S(B_m)=\#\{\pi\in \mathfrak S_m:\operatorname{Des}(\pi)=S\}.
\end{align*}
Thus the flag $h$-vector of $B_m$ refines the Eulerian distribution by recording the exact descent set, not just the number of descents.
[/example]
This descent interpretation gives a useful model, but in an arbitrary poset we often want to isolate the ranks themselves as a smaller ordered structure. Rank selection is needed because it turns a prescribed subset of ranks into an induced subposet whose maximal chains recover the corresponding flag entries. This motivates the next definition.
[definition: Rank-Selected Subposet]
Let $P$ be a finite graded poset with rank function $\rho$, and let $S$ be a subset of the rank set of $P$. The rank-selected subposet $P_S$ is the induced subposet
\begin{align*}
P_S := \{x \in P : \rho(x) \in S\}.
\end{align*}
[/definition]
When $P$ has $\hat{0}$ and $\hat{1}$, it is common to include or exclude them according to the enumerative question. For flag enumeration in this chapter, $S$ usually lies in the internal rank set $\{1,\dots,n\}$.
[example: Rank-Selected Boolean Lattices]
In $B_5$, take $S=\{2,4\}$. The rank-selected subposet consists of the elements whose ranks are $2$ or $4$, namely the $2$-subsets and $4$-subsets of $\{1,2,3,4,5\}$, ordered by inclusion. A maximal chain in this rank-selected subposet has the form $A\subset B$ with $|A|=2$ and $|B|=4$.
There are
\begin{align*}
\binom{5}{2}=\frac{5!}{2!3!}=\frac{5\cdot 4}{2\cdot 1}=10
\end{align*}
choices for $A$. Once $A$ is fixed, the complement $\{1,2,3,4,5\}\setminus A$ has $3$ elements, and $B$ is obtained by adding exactly $2$ of those $3$ elements to $A$. Thus there are
\begin{align*}
\binom{3}{2}=\frac{3!}{2!1!}=3
\end{align*}
choices for $B$ above a fixed $A$. Multiplying the independent choices gives
\begin{align*}
\binom{5}{2}\binom{3}{2}=10\cdot 3=30.
\end{align*}
By the definition of the flag $f$-number, these chains are exactly the chains in $B_5\setminus\{\hat{0},\hat{1}\}$ with rank set $\{2,4\}$, so $f_{\{2,4\}}(B_5)=30$.
[/example]
## Eulerian Posets and Alternating Intervals
The next question is which graded posets behave like face lattices of convex polytopes. The basic test is that every nontrivial interval should have the same alternating cancellation as the boundary of a polytope. This condition is captured by the word Eulerian.
[definition: Eulerian Poset]
A finite graded poset $P$ with minimum element $\hat{0}$ and maximum element $\hat{1}$ is Eulerian if every interval $[x,y]\subseteq P$ with $x<y$ has equally many elements of even and odd rank relative to $x$, except for the one-element interval.
[/definition]
Equivalently, every nontrivial interval has Möbius function $\mu(x,y)=(-1)^{\rho(y)-\rho(x)}$. This equivalence connects the present chapter with Möbius inversion from Chapters 3 and 4. The main supply of Eulerian posets should come from convex polytopes, so we next verify that face lattices satisfy this interval cancellation. This theorem supplies the geometric model for everything that follows.
[quotetheorem:8117]
[citeproof:8117]
The bounded face lattice hypothesis is essential: if the empty face or the top face is removed, the whole poset no longer has the endpoint intervals needed for the Eulerian definition. Convexity is also doing work, since an arbitrary finite regular cell complex may have intervals whose links are not spheres, and then the same alternating cancellation can fail. The theorem does not say that every Eulerian poset is a polytope face lattice; it only extracts the interval Euler characteristic property that polytopes always satisfy. That local property is exactly what will force the flag-vector relations below.
[example: Polygon Face Lattice]
Let $Q$ be a polygon with $v$ vertices and $v$ edges. Its face lattice has ranks $0,1,2,3$: the empty face has rank $0$, the vertices have rank $1$, the edges have rank $2$, and $Q$ has rank $3$. For the whole interval $[\varnothing,Q]$, the alternating rank sum is
\begin{align*}
\#\{\text{rank }0\}-\#\{\text{rank }1\}+\#\{\text{rank }2\}-\#\{\text{rank }3\}=1-v+v-1.
\end{align*}
Since $1-v+v-1=(1-1)+(-v+v)=0$, the whole interval has the required cancellation.
Now fix an edge $E$ of the polygon. The interval $[\varnothing,E]$ contains exactly four faces: $\varnothing$, the two endpoints of $E$, and $E$ itself. Measured relative to $\varnothing$, their ranks are $0,1,1,2$, so the alternating sum on this interval is
\begin{align*}
1-2+1=0.
\end{align*}
The same local check holds for every edge interval, and the vertex intervals $[\varnothing,x]$ and $[x,Q]$ have alternating sum $1-1=0$ whenever they have two elements. Thus the Eulerian property is an interval-by-interval condition, not only a statement about the total face numbers of the polygon.
[/example]
The Boolean lattice is the face lattice of a simplex, so it is the basic Eulerian example. Removing its top and bottom changes the presence of extrema, but preserves the internal incidence pattern used by rank-selected enumeration.
[example: Boolean Lattice with Top and Bottom Removed]
For $m\geq 2$, the proper part $\overline{B_m}=B_m\setminus\{\varnothing,\{1,\dots,m\}\}$ is not Eulerian as a bounded poset, because a bounded poset must have a minimum and a maximum element, while $\overline{B_m}$ has neither. For example, the singleton subsets $\{1\}$ and $\{2\}$ are both minimal elements of $\overline{B_m}$ and are incomparable, so no element of $\overline{B_m}$ lies below every element. Dually, the subsets $\{1,\dots,m\}\setminus\{1\}$ and $\{1,\dots,m\}\setminus\{2\}$ are incomparable maximal elements, so no element lies above every element.
Nevertheless, every interval inside $\overline{B_m}$ is Boolean. If $A,B\in \overline{B_m}$ and $A\subseteq B$, then
\begin{align*}
[A,B]_{\overline{B_m}}=\{X\subseteq \{1,\dots,m\}: A\subseteq X\subseteq B\}.
\end{align*}
The map
\begin{align*}
X\longmapsto X\setminus A
\end{align*}
sends this interval to the Boolean lattice of subsets of $B\setminus A$. Its inverse sends $Y\subseteq B\setminus A$ to $A\cup Y$, because
\begin{align*}
(A\cup Y)\setminus A=Y
\end{align*}
and, when $A\subseteq X\subseteq B$,
\begin{align*}
A\cup (X\setminus A)=X.
\end{align*}
Thus the local intervals retain the Boolean incidence structure even though the whole proper part is not bounded. This is why $\overline{B_m}$ remains useful for order complexes and rank-selected flag counts inside the Eulerian lattice $B_m$.
[/example]
## Bayer-Billera Relations
Flag vectors of arbitrary graded posets have $2^n$ entries in internal rank set size $n$. The Eulerian condition imposes many linear relations, and the next problem is to describe all of them. The [Bayer-Billera relations](/theorems/8118) are the complete [generalized Dehn-Sommerville relations](/theorems/8119) for Eulerian posets.
[quotetheorem:8118]
[citeproof:8118]
The condition $\{i,k\}\subseteq S\cup\{0,n+1\}$ is necessary for the proof and for the statement in this form: without it, a chain of rank set $S$ need not contain elements at the two boundary ranks, so there is no canonical interval in which to insert the missing rank. The Eulerian hypothesis is also necessary; for the three-element chain $\hat{0}<x<\hat{1}$ of rank $2$, taking $S=\varnothing$, $i=0$, and $k=2$ would give $f_{\{1\}}=2$, but in fact $f_{\{1\}}=1$. The theorem does not compute the flag vector by itself; it gives the universal linear constraints that any Eulerian flag vector must satisfy. The next result says these constraints are the only universal ones.
[example: A Three-Rank Relation]
Let $P$ be Eulerian of rank $4$, so its internal ranks are $\{1,2,3\}$. In the *Bayer-Billera Relations*, choose $S=\varnothing$, $i=0$, and $k=4$. The hypotheses hold because $\{0,4\}\subseteq \varnothing\cup\{0,4\}$ and $\varnothing\cap\{1,2,3\}=\varnothing$. The left-hand side is
\begin{align*}
\sum_{j=1}^{3}(-1)^{j-1}f_{\{j\}}(P)=f_{\{1\}}(P)-f_{\{2\}}(P)+f_{\{3\}}(P).
\end{align*}
The right-hand side is
\begin{align*}
\left(1-(-1)^{3}\right)f_{\varnothing}(P)=\left(1+1\right)f_{\varnothing}(P)=2f_{\varnothing}(P).
\end{align*}
Since $f_{\varnothing}(P)=1$, counting the empty chain, the relation becomes
\begin{align*}
f_{\{1\}}(P)-f_{\{2\}}(P)+f_{\{3\}}(P)=2.
\end{align*}
For the face lattice of a $3$-polytope, the rank-$1$ elements are vertices, the rank-$2$ elements are edges, and the rank-$3$ elements are facets. Hence $f_{\{1\}}=V$, $f_{\{2\}}=E$, and $f_{\{3\}}=F$, so the displayed identity is exactly
\begin{align*}
V-E+F=2.
\end{align*}
[/example]
It is useful to express the same constraints in the flag $h$-basis. There the relations have a symmetry form, and that form is what leads to the noncommutative $cd$-index.
[quotetheorem:8119]
[citeproof:8119]
The Eulerian hypothesis is indispensable: non-Eulerian graded posets can violate even the lowest Dehn-Sommerville equation, as the three-element chain shows. The theorem is a statement about universal linear relations only; it does not impose inequalities, integrality refinements, or positivity of coefficients. Its forward role is to justify replacing the full flag vector by a basis indexed by sparse subsets, which is exactly the indexing that the $cd$-index packages.
## The $ab$-Index and the $cd$-Index
The next problem is practical: a flag vector has many entries and the relations among them are cumbersome. For instance, the rank-three relation above is only one of a family of alternating sums, and writing all such sums directly in the symbols $f_S$ hides the descent-pattern structure. Noncommutative generating functions encode the same data in a way that remembers the order of ranks. For Eulerian posets, the $ab$-index collapses to an expression in two special combinations $c$ and $d$.
[definition: Ab-Index]
Let $P$ be a finite graded poset of rank $n+1$. For $S\subseteq \{1,\dots,n\}$, define the word $u_S=u_1u_2\cdots u_n$ in noncommuting variables $a,b$ by setting $u_i=b$ when $i\in S$ and $u_i=a$ when $i\notin S$. The $ab$-index of $P$ is
\begin{align*}
\Psi_P(a,b):=\sum_{S\subseteq \{1,\dots,n\}} h_S(P)u_S.
\end{align*}
[/definition]
For fixed $n$, this is the [linear map](/page/Linear%20Map) from functions $h:\mathcal{P}(\{1,\dots,n\})\to \mathbb Q$ to homogeneous degree-$n$ elements of the noncommutative vector space $\mathbb Q\langle a,b\rangle_n$, sending the basis function at $S$ to the word $u_S$. The notation $\mathbb Q\langle a,b\rangle$ means polynomials in the noncommuting letters $a$ and $b$ with rational coefficients, so the words $ab$ and $ba$ are distinct basis elements.
The word $u_S$ records a descent pattern in the same way that flag $h$-numbers do for Boolean lattices. Noncommutativity matters because a rank pattern is positional data, not just a count of selected ranks. For Eulerian posets, the Dehn-Sommerville relations say that many $ab$-polynomials represent the same constrained data, so the next definition introduces the smaller alphabet that captures exactly the allowed part.
[definition: Cd-Index]
Let $c=a+b$ and $d=ab+ba$, where $a$ and $b$ do not commute. If $P$ is an Eulerian poset of rank $n+1$, the $cd$-index of $P$ is the polynomial $\Phi_P(c,d)$ satisfying
\begin{align*}
\Psi_P(a,b)=\Phi_P(a+b,ab+ba).
\end{align*}
[/definition]
The definition presupposes that such a polynomial exists and is unique. This is a strong constraint: replacing the noncommuting letters $a,b$ by $c=a+b$ and $d=ab+ba$ only reaches a special subspace of all $ab$-polynomials, so a general flag enumerator need not have a $cd$-form. The Eulerian condition supplies exactly the Dehn-Sommerville relations needed to land in that subspace, and the theorem makes this existence and uniqueness statement precise.
[quotetheorem:8120]
[citeproof:8120]
The Eulerian hypothesis cannot be dropped: for example, the non-Eulerian three-element chain of rank $2$ has $ab$-index $a+b$, while the required degree-one $cd$-space is spanned by $c=a+b$, so low rank still works accidentally, but in degree $2$ a generic word such as $ab$ cannot be expressed as a polynomial in $c$ and $d$ alone because $d=ab+ba$ forces the coefficients of $ab$ and $ba$ to match after subtracting multiples of $c^2$. The theorem does not say that the $cd$-coefficients are nonnegative; positivity is a deeper result for important classes such as convex polytopes. What it gives here is the exact algebraic mechanism for computing with Eulerian flag data.
[example: Low-Rank Cd-Indices]
For an Eulerian poset $P$ of rank $3$, the $cd$-index has degree $2$. The degree-$2$ $cd$-monomials are exactly $c^2$ and $d$, so
\begin{align*}
\Phi_P(c,d)=\beta c^2+\alpha d.
\end{align*}
Since $c=a+b$ and $d=ab+ba$, expansion gives
\begin{align*}
\beta c^2+\alpha d=\beta(a+b)(a+b)+\alpha(ab+ba).
\end{align*}
Multiplying out the noncommutative product,
\begin{align*}
\beta(a+b)(a+b)=\beta aa+\beta ab+\beta ba+\beta bb.
\end{align*}
Hence
\begin{align*}
\beta c^2+\alpha d=\beta aa+(\beta+\alpha)ab+(\beta+\alpha)ba+\beta bb.
\end{align*}
The coefficient of $aa$ is $h_{\varnothing}(P)=f_{\varnothing}(P)=1$, so $\beta=1$. Thus every rank-$3$ Eulerian poset has
\begin{align*}
\Phi_P(c,d)=c^2+\alpha d.
\end{align*}
Now let $P=L(Q)$ be the face lattice of a polygon with $v$ vertices and $v$ edges. Its internal ranks are vertices and edges, so
\begin{align*}
f_{\{1\}}(P)=v,\qquad f_{\{2\}}(P)=v.
\end{align*}
A chain of rank set $\{1,2\}$ is an incident pair vertex $\subset$ edge. Each of the $v$ edges has exactly $2$ endpoints, so
\begin{align*}
f_{\{1,2\}}(P)=2v.
\end{align*}
Using the definition of the flag $h$-vector,
\begin{align*}
h_{\{1\}}(P)=f_{\{1\}}(P)-f_{\varnothing}(P)=v-1
\end{align*}
and
\begin{align*}
h_{\{2\}}(P)=f_{\{2\}}(P)-f_{\varnothing}(P)=v-1.
\end{align*}
Also,
\begin{align*}
h_{\{1,2\}}(P)=f_{\{1,2\}}(P)-f_{\{1\}}(P)-f_{\{2\}}(P)+f_{\varnothing}(P)=2v-v-v+1=1.
\end{align*}
Therefore the $ab$-index is
\begin{align*}
\Psi_P(a,b)=aa+(v-1)ba+(v-1)ab+bb.
\end{align*}
On the other hand,
\begin{align*}
c^2+(v-2)d=aa+ab+ba+bb+(v-2)(ab+ba).
\end{align*}
Combining coefficients gives
\begin{align*}
c^2+(v-2)d=aa+(v-1)ab+(v-1)ba+bb.
\end{align*}
This matches $\Psi_P(a,b)$, so
\begin{align*}
\Phi_P(c,d)=c^2+(v-2)d.
\end{align*}
For $v=3$ this gives the triangle index $c^2+d$, and for $v=4$ it gives the square index $c^2+2d$.
[/example]
The Boolean lattice gives the simplest possible case. Since it is the face lattice of a simplex, its flag enumeration is as constrained as possible among Eulerian lattices.
[example: Boolean Cd-Index]
The first Boolean case with two internal ranks already shows how the letter $d$ enters. For $B_3$, the internal ranks are $\{1,2\}$. There are
\begin{align*}
f_{\varnothing}(B_3)=1,\qquad f_{\{1\}}(B_3)=3,\qquad f_{\{2\}}(B_3)=3,\qquad f_{\{1,2\}}(B_3)=6.
\end{align*}
Using the definition of the flag $h$-vector,
\begin{align*}
h_{\varnothing}(B_3)=f_{\varnothing}(B_3)=1.
\end{align*}
Also,
\begin{align*}
h_{\{1\}}(B_3)=f_{\{1\}}(B_3)-f_{\varnothing}(B_3)=3-1=2.
\end{align*}
Similarly,
\begin{align*}
h_{\{2\}}(B_3)=f_{\{2\}}(B_3)-f_{\varnothing}(B_3)=3-1=2.
\end{align*}
Finally,
\begin{align*}
h_{\{1,2\}}(B_3)=f_{\{1,2\}}(B_3)-f_{\{1\}}(B_3)-f_{\{2\}}(B_3)+f_{\varnothing}(B_3)=6-3-3+1=1.
\end{align*}
Therefore
\begin{align*}
\Psi_{B_3}(a,b)=aa+2ba+2ab+bb.
\end{align*}
Since
\begin{align*}
c^2=(a+b)(a+b)=aa+ab+ba+bb
\end{align*}
and
\begin{align*}
d=ab+ba,
\end{align*}
we get
\begin{align*}
c^2+d=aa+2ab+2ba+bb.
\end{align*}
Thus
\begin{align*}
\Phi_{B_3}(c,d)=c^2+d,
\end{align*}
not $c^2$. The Boolean lattice is highly constrained because it is Eulerian, but its flag enumeration still records nontrivial incidence data beyond the single monomial $c^{m-1}$.
[/example]
## What the $cd$-Index Measures
The last question is how to read the $cd$-index as an invariant rather than as a formal compression. Its coefficients measure the independent flag-enumerative degrees of freedom that remain after the Eulerian cancellations are imposed. This makes it a refined analogue of the $h$-vector for simplicial polytopes.
[remark: Degree Bookkeeping]
The degree convention $\deg c=1$ and $\deg d=2$ matches the internal rank number $n$ of a rank $n+1$ poset. Thus the monomial $cdc$ has degree $4$ and can occur for Eulerian posets of rank $5$. The placement of the $d$ records an adjacent two-rank feature in the flag data.
[/remark]
The degree bookkeeping raises a natural counting question: how many independent slots can a degree $n$ $cd$-index have? The answer is governed by a two-step recursion, because a monomial begins either with a degree-one letter $c$ or with a degree-two letter $d$. This count explains the scale of the compression compared with the original $2^n$ flag entries.
[quotetheorem:8121]
[citeproof:8121]
The grading hypotheses are necessary for this count: if $c$ and $d$ were both assigned degree $1$, there would be $2^n$ words of degree $n$, while if the variables commuted, several noncommutative words such as $cd$ and $dc$ would collapse. The theorem counts possible coefficient slots only; it does not say which coefficient vectors are realised by face lattices, nor whether the realised coefficients are positive. Compared with the $2^n$ entries of the flag vector, $F_{n+1}$ is a substantial compression, and the $cd$-index theorem says that for Eulerian posets this smaller number is the exact number of independent flag parameters.
[example: Rank Four Cd-Index]
For an Eulerian poset $P$ of rank $5$, the internal rank set is $\{1,2,3,4\}$, so the $ab$-index has degree $4$. Since $\deg c=1$ and $\deg d=2$, the degree-$4$ $cd$-monomials are obtained by writing $4$ as an ordered sum whose parts are equal to $1$ or $2$:
\begin{align*}
1+1+1+1,\quad 1+1+2,\quad 1+2+1,\quad 2+1+1,\quad 2+2.
\end{align*}
Thus
\begin{align*}
\Phi_P(c,d)=\alpha_1c^4+\alpha_2c^2d+\alpha_3cdc+\alpha_4dc^2+\alpha_5d^2.
\end{align*}
Substituting $c=a+b$ and $d=ab+ba$ turns this into an $ab$-polynomial of degree $4$. For example,
\begin{align*}
c^2d=(a+b)(a+b)(ab+ba).
\end{align*}
First,
\begin{align*}
(a+b)(a+b)=aa+ab+ba+bb.
\end{align*}
Multiplying each term on the right by $ab+ba$ gives
\begin{align*}
c^2d=aaab+aaba+abab+abba+baab+baba+bbab+bbba.
\end{align*}
Similarly,
\begin{align*}
cdc=(a+b)(ab+ba)(a+b).
\end{align*}
Since
\begin{align*}
(a+b)(ab+ba)=aab+aba+bab+bba,
\end{align*}
we get
\begin{align*}
cdc=aaba+aabb+abaa+abab+baba+babb+bbaa+bbab.
\end{align*}
Also,
\begin{align*}
dc^2=(ab+ba)(a+b)(a+b).
\end{align*}
Using $(a+b)(a+b)=aa+ab+ba+bb$ gives
\begin{align*}
dc^2=abaa+abab+abba+abbb+baaa+baab+baba+babb.
\end{align*}
Finally,
\begin{align*}
d^2=(ab+ba)(ab+ba)=abab+abba+baab+baba.
\end{align*}
The remaining monomial is
\begin{align*}
c^4=(a+b)(a+b)(a+b)(a+b),
\end{align*}
which expands to the sum of all $16$ length-$4$ words in $a$ and $b$. By *Existence of the Cd-Index*, the five coefficients $\alpha_1,\dots,\alpha_5$ uniquely determine $\Psi_P(a,b)$. Since
\begin{align*}
\Psi_P(a,b)=\sum_{S\subseteq\{1,2,3,4\}}h_S(P)u_S,
\end{align*}
the coefficient of each length-$4$ word $u_S$ gives the corresponding flag $h$-number $h_S(P)$. Then Möbius inversion gives
\begin{align*}
f_S(P)=\sum_{T\subseteq S}h_T(P)
\end{align*}
for every $S\subseteq\{1,2,3,4\}$. Thus these five $cd$-coefficients determine all $16$ flag $h$-numbers and all $16$ flag $f$-numbers.
[/example]
The chapter's main lesson is that Eulerian posets are rigid enough for strong linear identities, but still rich enough to require more than ordinary rank enumeration. Face lattices of polytopes motivate the Eulerian condition; rank-selected subposets and flag vectors expose the refined data; the Bayer-Billera relations describe all forced cancellations; and the $cd$-index records exactly the remaining information.
# 8. Order Complexes and Poset Topology
This chapter turns the enumerative invariants of finite posets into topology. Chapters 3 and 4 used intervals, chains, and the Möbius function as algebraic data inside the incidence algebra. Here the same chains become simplices in a topological space, so that Möbius inversion is tied to Euler characteristic, and lattice-theoretic structure can be studied using homotopy type.
The guiding question is: what does the space of chains in a poset remember about the poset? For many important lattices, the answer is surprisingly rigid: Boolean intervals give spheres, partition lattices give wedges of spheres, and subspace lattices give the spherical buildings of linear algebraic groups.
## Chains as Simplices
Given a finite poset $P$, the first problem is to convert order-theoretic data into a space without losing the incidence information. A naive graph with edges between comparable elements loses higher chains: it cannot distinguish a three-element chain from a triangle of pairwise incidences unless we remember which comparable sets occur simultaneously. Chains are the natural candidates for faces, because every subchain of a chain is again a chain.
[definition: Order Complex]
Let $P$ be a finite poset. The order complex $\Delta(P)$ is the abstract simplicial complex whose vertices are the elements of $P$ and whose faces are the finite chains $x_0 < x_1 < \cdots < x_k$ in $P$.
[/definition]
Thus a $k$-simplex in $\Delta(P)$ is a strict chain of length $k+1$ in $P$. The empty chain is the empty face, as in every abstract simplicial complex.
[example: Three Element Chain]
Let $P=\{a<b<c\}$. The chains with one element are $\{a\}$, $\{b\}$, and $\{c\}$, so these are the vertices of $\Delta(P)$. The chains with two elements are
\begin{align*}
a<b,\qquad a<c,\qquad b<c,
\end{align*}
so $\Delta(P)$ has the three edges $ab$, $ac$, and $bc$. The chain
\begin{align*}
a<b<c
\end{align*}
has three elements, so it gives the $2$-simplex $abc$.
Thus every nonempty subset of $\{a,b,c\}$ is a chain in $P$, and $\Delta(P)$ is the full $2$-simplex on the vertices $a,b,c$. Geometrically this is a filled triangle, which is contractible by straight-line contraction to any chosen point in the triangle. The example shows why keeping both endpoints of a bounded poset can collapse the order complex to a cone-like, topologically uninteresting space.
[/example]
The filled triangle in the example has a vertex comparable with everything else, so all chains can be expanded toward an endpoint. That phenomenon occurs in every bounded poset: if the minimum or maximum is kept, it acts as a cone point. The following definition isolates the endpoint-free part whose chains can carry the internal topology of the bounded poset.
[definition: Proper Part]
Let $P$ be a finite poset with minimum element $\hat{0}$ and maximum element $\hat{1}$. The proper part of $P$ is
\begin{align*}
\overline{P} := P \setminus \{\hat{0},\hat{1}\}.
\end{align*}
[/definition]
The order complex $\Delta(\overline{P})$ is the main topological object attached to a bounded poset. In a ranked lattice, its faces are precisely the flags of non-extreme elements.
[example: Boolean Interval]
Let $B_n$ be the Boolean lattice of all subsets of $\{1,\dots,n\}$, ordered by inclusion, and assume $n\ge 2$. Its proper part is
\begin{align*}
\overline{B_n}=\{A\subseteq \{1,\dots,n\}: A\ne \varnothing \text{ and } A\ne \{1,\dots,n\}\}.
\end{align*}
If $\sigma$ is the $(n-1)$-simplex with vertex set $\{1,\dots,n\}$, then the nonempty proper subsets of $\{1,\dots,n\}$ are exactly the nonempty faces of the boundary $\partial\sigma$. Thus $\overline{B_n}$ is the face poset of $\partial\sigma$, and a chain
\begin{align*}
A_0\subsetneq A_1\subsetneq \cdots \subsetneq A_k
\end{align*}
is precisely a simplex in the barycentric subdivision of $\partial\sigma$. Therefore $\Delta(\overline{B_n})$ is the barycentric subdivision of $\partial\sigma$, so it is homeomorphic to $\partial\sigma$ and has the homotopy type of $S^{n-2}$.
A maximal chain in $\overline{B_n}$ has one subset of each size $1,2,\dots,n-1$. Choosing an ordering $(i_1,\dots,i_n)$ of $\{1,\dots,n\}$ gives the chain
\begin{align*}
\{i_1\} \subset \{i_1,i_2\} \subset \cdots \subset \{i_1,\dots,i_{n-1}\}.
\end{align*}
Conversely, from any maximal chain $A_1\subset A_2\subset \cdots \subset A_{n-1}$ with $|A_j|=j$, the element $i_1$ is the unique element of $A_1$, and for $2\le j\le n-1$ the element $i_j$ is the unique element of $A_j\setminus A_{j-1}$; the remaining element of $\{1,\dots,n\}\setminus A_{n-1}$ is $i_n$. Hence the facets of $\Delta(\overline{B_n})$ are indexed by the permutations of $\{1,\dots,n\}$.
[/example]
The Boolean example is the model case for many later results: an interval behaves topologically like a sphere when its chains assemble into a pure, well-connected complex.
## Barycentric Subdivision, Links, and Intervals
The next question is how standard operations on simplicial complexes appear in the language of posets. The answer is that barycentric subdivision and links are order complexes of naturally associated posets and intervals.
[definition: Face Poset]
Let $K$ be a finite simplicial complex. The face poset $F(K)$ is the poset of nonempty faces of $K$, ordered by inclusion.
[/definition]
The face poset records incidence but forgets the original geometric realisation of $K$. To justify replacing a complex by this poset, we must know that returning to chains gives the same topology after subdivision. The next theorem identifies the exact simplicial complex produced by this round trip.
[quotetheorem:8122]
[citeproof:8122]
This theorem explains why order complexes are not an artificial construction: every finite simplicial complex appears, up to subdivision, as the order complex of a poset. The hypothesis that the face poset uses nonempty faces matters: if the empty face were included as a least element, the order complex would be a cone and hence contractible even when $K$ is a circle or sphere. Finiteness is also part of the statement because $F(K)$ is then a finite poset and the subdivision is a finite simplicial complex. For an infinite complex, the same combinatorial description of chains can be written down, but extra local finiteness and realisation choices are needed before it gives the same compact topological object. The theorem does not say that $K$ and $F(K)$ are the same object; it says that the incidence data in $F(K)$ reconstructs the same topology after subdivision. This is the dictionary that lets us translate local questions about simplicial complexes into interval questions in posets.
[example: Face Poset of a Triangle]
Let $K$ be the $2$-simplex with vertex set $\{1,2,3\}$. Its nonempty faces are the three vertices
\begin{align*}\{1\},\{2\},\{3\},\end{align*}
the three edges
\begin{align*}\{1,2\},\{1,3\},\{2,3\},\end{align*}
and the top face
\begin{align*}\{1,2,3\}.\end{align*}
Thus $F(K)$ has seven elements, ordered by inclusion.
A maximal chain in $F(K)$ must start with a singleton, then pass through a two-element face containing it, and end at $\{1,2,3\}$. For example,
\begin{align*}\{1\}\subset \{1,2\}\subset \{1,2,3\}\end{align*}
is a chain with three elements, so it is a $2$-simplex in $\Delta(F(K))$. The maximal chains are exactly
\begin{align*}\{1\}\subset \{1,2\}\subset \{1,2,3\},\quad \{2\}\subset \{1,2\}\subset \{1,2,3\},\quad \{1\}\subset \{1,3\}\subset \{1,2,3\},\quad \{3\}\subset \{1,3\}\subset \{1,2,3\},\quad \{2\}\subset \{2,3\}\subset \{1,2,3\},\quad \{3\}\subset \{2,3\}\subset \{1,2,3\}.\end{align*}
There are $3$ choices for the middle edge and $2$ choices for the singleton inside that edge, giving $3\cdot 2=6$ maximal chains. These six chains are the six small triangles in the barycentric subdivision of the original triangle.
[/example]
Local topology in an order complex is governed by intervals. To isolate the local contribution of a vertex $x$, we look at chains below and above $x$ separately.
[definition: Link in a Simplicial Complex]
Let $K$ be a simplicial complex and let $\sigma \in K$. The link of $\sigma$ in $K$ is
\begin{align*}
\operatorname{lk}_K(\sigma) := \{\tau \in K : \tau \cap \sigma = \varnothing \text{ and } \tau \cup \sigma \in K\}.
\end{align*}
[/definition]
The link records all ways to enlarge a chosen face without reusing its vertices. In an arbitrary simplicial complex this is a local construction, while in an order complex comparability forces the vertices around $x$ to split into a lower part and an upper part. The following theorem makes that split precise and explains why interval topology controls local topology.
[quotetheorem:8123]
[citeproof:8123]
The hypotheses in the link formula are doing real work. The vertex must be viewed inside an order complex: in a general simplicial complex, the vertices adjacent to $x$ need not split into a lower and an upper part, so there is no interval decomposition. Finiteness keeps all links as finite simplicial complexes, but the formula itself is combinatorial and would extend to locally finite settings with care. The theorem does not claim that the link is usually a sphere; it only decomposes the link into simpler interval pieces, which is exactly the local input needed for Cohen-Macaulay conditions.
The same principle applies to larger faces. A chain $x_0<\cdots<x_k$ cuts the poset into the intervals below $x_0$, between consecutive elements, and above $x_k$.
[remark: Links of Higher Dimensional Faces]
If $\sigma=(x_0<\cdots<x_k)$ is a face of $\Delta(P)$, then its link is the join of the order complexes of the open intervals
\begin{align*}
P_{<x_0},\quad (x_0,x_1),\quad \dots,\quad (x_{k-1},x_k),\quad P_{>x_k}.
\end{align*}
This reduces many local questions about $\Delta(P)$ to questions about open intervals.
[/remark]
## Reduced Euler Characteristic and the Möbius Function
The algebraic invariant from the incidence algebra becomes topological after we count chains with alternating signs. The problem is to identify the Möbius function $\mu(x,y)$ as an Euler characteristic of the space of chains strictly between $x$ and $y$.
[definition: Reduced Euler Characteristic]
The reduced Euler characteristic is the function
\begin{align*}
\widetilde{\chi}:\{\text{finite simplicial complexes}\}\to\mathbb Z
\end{align*}
defined as follows. For a finite simplicial complex $K$, let $f_i$ be the number of $i$-dimensional faces of $K$. Then
\begin{align*}
\widetilde{\chi}(K) := -1 + \sum_{i \ge 0} (-1)^i f_i.
\end{align*}
[/definition]
The initial $-1$ accounts for the empty face. With this convention, a contractible nonempty complex has reduced Euler characteristic $0$, while $S^d$ has reduced Euler characteristic $(-1)^d$.
[example: Sphere from a Boolean Interval]
Let $[n]=\{1,\dots,n\}$ with $n\ge 2$. The open interval $(\varnothing,[n])$ consists of the nonempty proper subsets of $[n]$, so its order complex is the barycentric subdivision of the boundary of the $(n-1)$-simplex. Hence it is homeomorphic to $S^{n-2}$.
Since $S^{d}$ has reduced homology of rank $1$ in degree $d$ and zero in all other degrees, its reduced Euler characteristic is $(-1)^d$. Taking $d=n-2$ gives
\begin{align*}
\widetilde{\chi}\bigl(\Delta((\varnothing,[n]))\bigr)=\widetilde{\chi}(S^{n-2})=(-1)^{n-2}.
\end{align*}
The Boolean Möbius value is $\mu_{B_n}(\varnothing,[n])=(-1)^n$. The signs agree because
\begin{align*}
(-1)^{n-2}=(-1)^n(-1)^{-2}=(-1)^n\cdot 1=(-1)^n.
\end{align*}
Thus the reduced Euler characteristic of the Boolean interval’s order complex gives exactly the Möbius value of the interval.
[/example]
The Boolean lattice calculation shows that the same alternating signs appear in two places: in the reduced Euler characteristic of an order complex and in the recursive inverse of the zeta function. The useful theorem is that these are not merely analogous. It gives a topological interpretation of every interval value of the Möbius function, so it is the main bridge between the first half of the course and poset topology.
[quotetheorem:8124]
[citeproof:8124]
This theorem turns topological information into enumerative information. The openness of $(x,y)$ is essential: including $x$ or $y$ would introduce a cone point, making the complex contractible and destroying the Möbius value in examples such as Boolean intervals. The theorem does not determine the homotopy type of the interval; it only reads off one invariant, the reduced Euler characteristic. If $\Delta((x,y))$ is contractible, then $\mu_P(x,y)=0$; if it is a sphere of dimension $d$, then $\mu_P(x,y)=(-1)^d$, so the next question is how to detect contractibility or spherical behavior.
[example: Möbius Function of a Boolean Lattice]
Let $A\subseteq B$ in $B_n$, and put $R=B\setminus A$, so $|R|=r$. The interval $[A,B]$ is isomorphic to $B_r$ by the map $C\mapsto C\setminus A$: indeed $A\subseteq C\subseteq B$ implies $C\setminus A\subseteq R$, and the inverse sends $D\subseteq R$ to $A\cup D$.
If $r=0$, then $A=B$, so the diagonal defining value of the Möbius function gives $\mu(A,A)=1$. Now assume $r>0$. Under the interval isomorphism, the open interval $(A,B)$ corresponds to the nonempty proper subsets of $R$. Therefore $\Delta((A,B))$ is the barycentric subdivision of the boundary of the $(r-1)$-simplex on vertex set $R$, hence has the same reduced Euler characteristic as $S^{r-2}$, with $S^{-1}$ interpreted as the empty complex when $r=1$.
By *Philip Hall Theorem*,
\begin{align*}
\mu(A,B)=\widetilde{\chi}\bigl(\Delta((A,B))\bigr).
\end{align*}
For $r>0$, the preceding identification gives
\begin{align*}
\widetilde{\chi}\bigl(\Delta((A,B))\bigr)=\widetilde{\chi}(S^{r-2})=(-1)^{r-2}.
\end{align*}
Since $(-1)^{r-2}=(-1)^r(-1)^{-2}=(-1)^r$, we obtain
\begin{align*}
\mu(A,B)=(-1)^r.
\end{align*}
Thus the Boolean lattice formula is $\mu(A,B)=1$ when $A=B$, and $\mu(A,B)=(-1)^{|B\setminus A|}$ when $A\subset B$.
[/example]
The theorem also provides a practical test for vanishing. Contractible open intervals contribute zero to the Möbius function, so many cancellations in incidence algebra are explained by a cone point in the corresponding order complex.
## Contractibility and Homotopy Equivalences
The next problem is to decide when an order complex can be simplified without changing its homotopy type. Poset maps often induce simplicial maps, and fibers of those maps control whether the topology is preserved.
[definition: Poset Map]
Let $P$ and $Q$ be posets. A poset map is an order-preserving function $f:P\to Q$, meaning that $x\le y$ in $P$ implies $f(x)\le f(y)$ in $Q$.
[/definition]
Every poset map sends chains to chains, so it induces a simplicial map $\Delta(P)\to\Delta(Q)$. Merely having a surjective poset map is not enough to preserve topology: collapsing a two-point antichain onto a one-point poset changes the order complex from two disconnected vertices to one vertex. The missing condition is that the fibers, measured in the order-theoretic lower sense, carry no topology of their own. The following result is a standard tool for replacing a complicated poset by a simpler one.
[quotetheorem:8125]
[citeproof:8125]
This result belongs to the general theory of homotopy fibers of poset maps and is usually treated in a topology course. The lower-fiber hypothesis is essential: ordinary point fibers $f^{-1}(q)$ can be contractible while the induced map still fails to capture how chains approach $q$ from below. For example, let $Q=\{q_0<q_1\}$ and let $P=\{p_0,p_1\}$ be a two-element antichain, with $f(p_i)=q_i$. Each point fiber is a singleton, hence contractible, but $\Delta(P)$ consists of two disconnected vertices while $\Delta(Q)$ is an edge. The lower fiber over $q_1$ is all of $P$, and its non-contractibility is exactly what the point fibers miss. The lemma also does not say that the posets $P$ and $Q$ are isomorphic, or even that their order complexes are homeomorphic; it gives a homotopy equivalence. In this course it is used as a black-box criterion: check contractibility of lower fibers, then replace $P$ by $Q$ without changing homotopy type, especially when closure operations remove redundant elements.
[example: A Closure Operator Gives a Homotopy Equivalence]
Let $c:P\to P$ be order-preserving, extensive, and idempotent, meaning that $x\le c(x)$ and $c(c(x))=c(x)$ for every $x\in P$. Put $Q=c(P)$ with the induced order. If $q\in Q$, then $q=c(a)$ for some $a\in P$, so
\begin{align*}
c(q)=c(c(a))=c(a)=q.
\end{align*}
Thus every element of $Q$ is fixed by $c$, and the composite $Q\hookrightarrow P\xrightarrow{c}Q$ is the identity on $Q$.
Now fix $q\in Q$. The lower fiber of $c:P\to Q$ over $Q_{\le q}$ is
\begin{align*}
c^{-1}(Q_{\le q})=\{p\in P:c(p)\le q\}.
\end{align*}
If $p\in c^{-1}(Q_{\le q})$, then extensiveness gives $p\le c(p)$, and the definition of the fiber gives $c(p)\le q$. Hence
\begin{align*}
p\le c(p)\le q,
\end{align*}
so $p\le q$. Also $q$ itself lies in the fiber, because $c(q)=q\le q$. Therefore $q$ is a maximum element of $c^{-1}(Q_{\le q})$.
A finite poset with a maximum element has contractible order complex: every chain can be enlarged by adjoining the maximum element, so the order complex is a cone with that maximum as apex. Hence every lower fiber required in the *[Quillen Fiber Lemma](/theorems/8125)* has contractible order complex, and the induced map
\begin{align*}
\Delta(P)\to \Delta(Q)
\end{align*}
is a homotopy equivalence. Since $c$ restricts to the identity on $Q$, the inclusion $Q\hookrightarrow P$ gives the inverse direction. Thus the closure operator removes the non-closed elements of $P$ without changing the homotopy type of the order complex.
[/example]
Contractibility can also be detected directly from the poset. The most common reason is the presence of a cone point.
[remark: Cone Points]
If a finite poset $P$ has an element $x$ comparable with every element of $P$, then $\Delta(P)$ is a cone with apex $x$, provided adjoining $x$ to any chain still gives a chain. In particular, if $P$ has a minimum or maximum element, then $\Delta(P)$ is contractible. This is why the proper part is used for bounded posets.
[/remark]
## Crosscuts in Lattices
For lattices, one would like to compute the topology of the whole proper part using only a small set of atoms or coatoms. The [crosscut theorem](/theorems/8126) makes this possible under a spanning condition.
[definition: Crosscut]
Let $L$ be a finite lattice. A crosscut is an antichain $C\subset L\setminus\{\hat{0},\hat{1}\}$ such that every maximal chain in $L$ meets $C$.
[/definition]
A crosscut is useful because it samples every maximal chain, but the antichain alone does not yet remember which sampled elements can occur together inside a proper part of the lattice. The meet and join of a selected subset say whether that subset still lies below some non-top element or above some non-bottom element. The following definition records this compatibility as a simplicial complex on the crosscut vertices.
[definition: Crosscut Complex]
Let $L$ be a finite lattice and let $C$ be a crosscut. The crosscut complex $\Gamma(L,C)$ is the simplicial complex on vertex set $C$ whose faces are the subsets $A\subset C$ such that $A$ does not span the whole lattice, meaning
\begin{align*}
\bigwedge A > \hat{0} \quad \text{or} \quad \bigvee A < \hat{1},
\end{align*}
with the empty meet and empty join interpreted as $\hat{1}$ and $\hat{0}$ respectively.
[/definition]
The inclusive ``or'' is part of the convention: a subset is excluded exactly when its meet is $\hat{0}$ and its join is $\hat{1}$ at the same time. For an atomic crosscut, all nonempty subsets usually have meet $\hat{0}$, so the condition reduces to the familiar rule $\bigvee A<\hat{1}$. For a coatomic crosscut, it reduces dually to $\bigwedge A>\hat{0}$. Thus the definition keeps precisely the crosscut subsets that sit inside some proper lower or upper interval of the lattice.
The remaining question is whether this smaller compatibility complex has lost any topology. The crosscut condition says that the chosen antichain meets every maximal chain, so the proper part can be covered by pieces indexed by elements of $C$. The theorem below is needed because it identifies the nerve of that cover with $\Gamma(L,C)$, turning a large order complex of chains into a smaller complex of selected crosscut subsets.
[quotetheorem:8126]
[citeproof:8126]
This theorem is valuable because $\Gamma(L,C)$ is often much smaller than $\Delta(\overline{L})$. The crosscut hypothesis is necessary in a concrete way. In the diamond lattice $B_2$ with proper elements $a$ and $b$, take the antichain $C=\{a\}$. The maximal chain $\hat{0}<b<\hat{1}$ misses $C$, so $C$ is not a crosscut. The proper-part order complex $\Delta(\overline{B_2})$ consists of two disconnected vertices $a$ and $b$, whereas $\Gamma(B_2,C)$ is the single vertex $a$, a contractible complex. Thus the conclusion fails because the missed maximal chain contributes an entire component that the nerve cannot see. The theorem does not preserve the combinatorial triangulation; it preserves homotopy type, replacing all chains in the lattice by selected subsets of atoms or coatoms. This makes Boolean, partition, and subspace lattice computations feasible.
[example: Atomic Crosscut in the Boolean Lattice]
Let $L=B_n$ with $n\ge 2$, and let $C=\{\{1\},\dots,\{n\}\}$ be the set of atoms. Every maximal chain in $B_n$ has the form
\begin{align*}
\varnothing \subset \{i_1\} \subset \{i_1,i_2\} \subset \cdots \subset \{i_1,\dots,i_n\},
\end{align*}
so it meets $C$ at the singleton $\{i_1\}$. Thus $C$ is a crosscut.
For a subset $A\subseteq C$, write
\begin{align*}
S_A=\{i\in [n]:\{i\}\in A\}.
\end{align*}
In the Boolean lattice, joins are unions and meets are intersections, so
\begin{align*}
\bigvee A=\bigcup_{\{i\}\in A}\{i\}=S_A.
\end{align*}
Hence $\bigvee A<\hat{1}=[n]$ exactly when $S_A\ne [n]$, which is exactly when $A\ne C$. Therefore every proper subset $A\subsetneq C$ is a face of $\Gamma(B_n,C)$.
For the full subset $A=C$, the join is
\begin{align*}
\bigvee C=\{1\}\cup \{2\}\cup \cdots \cup \{n\}=[n]=\hat{1}.
\end{align*}
Since $n\ge 2$, the meet is
\begin{align*}
\bigwedge C=\{1\}\cap \{2\}\cap \cdots \cap \{n\}=\varnothing=\hat{0}.
\end{align*}
Thus $C$ itself is not a face of $\Gamma(B_n,C)$. The faces are exactly the proper subsets of the $n$-element vertex set $C$, so $\Gamma(B_n,C)$ is the boundary of an $(n-1)$-simplex. By *Crosscut Theorem*, $\Delta(\overline{B_n})$ is homotopy equivalent to this boundary, hence has the homotopy type of $S^{n-2}$.
[/example]
## Wedges of Spheres and Cohen-Macaulay Posets
The strongest structural results in this part of the course say that certain order complexes are not merely computable up to Euler characteristic: they are built from spheres. The problem is to recognize when all intervals have this controlled homotopy type.
[definition: Pure Simplicial Complex]
A finite simplicial complex is pure of dimension $d$ if every maximal face has dimension $d$.
[/definition]
Ranked posets naturally produce pure order complexes when all maximal chains have the same length. Purity is the first prerequisite for Cohen-Macaulay behavior, but it is not by itself a connectivity condition: two disjoint triangles form a pure $2$-dimensional complex with the wrong $0$-dimensional homology. Conversely, checking only whole intervals can miss failures in links of nonempty faces, where local topology may split incorrectly. The stronger condition asks for the complex and all of its links to have homotopy concentrated in the top possible dimension.
[definition: Homotopy Cohen-Macaulay Poset]
Let $P$ be a finite ranked poset. The poset $P$ is homotopy Cohen-Macaulay if $\Delta(P)$ is pure and, for every face $\sigma$ of $\Delta(P)$ including the empty face, the link $\operatorname{lk}_{\Delta(P)}(\sigma)$ is either empty or homotopy equivalent to a wedge of spheres of dimension $\dim \operatorname{lk}_{\Delta(P)}(\sigma)$.
[/definition]
For a bounded ranked poset $L$, this is commonly applied to the proper part $\overline{L}$. Using the link decomposition for chains, the condition says that $\Delta(\overline{L})$ has top-dimensional wedge homotopy type and that every open interval in $L$ has the corresponding top-dimensional wedge behavior. When the dimension is $-1$, the relevant open interval is empty; this occurs for a cover relation, namely rank difference $1$.
[remark: Meaning of the Dimension Shift]
If $x<y$ and the interval $[x,y]$ has rank difference $r$, then a maximal chain in $(x,y)$ has $r-1$ elements, so it gives a simplex of dimension $r-2$. Thus the sphere dimension in the homotopy Cohen-Macaulay condition is the top possible dimension of the interval order complex.
[/remark]
For homotopy Cohen-Macaulay posets, the Möbius function does more than detect an alternating Euler characteristic: it counts top-dimensional spheres with a predictable sign. This turns the topological condition into a concrete incidence-algebra formula. The next theorem records the exact conversion and will be used to read Möbius values from shellability and building computations.
[quotetheorem:8127]
[citeproof:8127]
This result explains why shellability and Cohen-Macaulayness are powerful in incidence algebra: they turn an alternating chain count into the rank of a single homology group. The ranked hypothesis fixes the dimension in which the spheres may occur; without it, different maximal chains can force different simplex dimensions and the displayed sign has no single rank difference to use. The interval wedge hypothesis is also indispensable. A concrete failure is obtained by taking $K$ to be the disjoint union of one edge and one isolated vertex, taking the face poset $F(K)$, and adjoining a new bottom $\hat{0}$ and top $\hat{1}$. Then $(\hat{0},\hat{1})$ has order complex $\Delta(F(K))$, the barycentric subdivision of $K$. This complex is $1$-dimensional but its reduced homology is in degree $0$, since it has two connected components. Philip Hall's theorem still gives $\mu(\hat{0},\hat{1})=\widetilde{\chi}(\Delta(F(K)))=1$, but this value is not the signed number of top-dimensional spheres. The theorem does not assert that intervals are Cohen-Macaulay; it only converts that topological input into a Möbius formula once the input has been proved, for instance by shellability or by a building theorem. The partition and subspace lattices now illustrate both sources of input.
## Partition and Subspace Lattices
The final question in the chapter is how the general theory behaves on the two most important non-Boolean families: partition lattices and subspace lattices. Both have rich topology, and both are central examples in algebraic combinatorics.
[example: Proper Part of the Partition Lattice]
Let $\Pi_n$ be the lattice of set partitions of $\{1,\dots,n\}$, ordered by refinement, with $\hat{0}$ the discrete partition and $\hat{1}$ the one-block partition. The proper part $\overline{\Pi_n}$ consists of all partitions that are neither discrete nor one-block. Its order complex $\Delta(\overline{\Pi_n})$ is homotopy equivalent to a wedge of $(n-1)!$ spheres of dimension $n-3$, with the usual convention that $S^{-1}$ is the empty complex when $n=2$.
The reduced Euler characteristic of one sphere $S^d$ is $(-1)^d$, because its reduced homology has rank $1$ in degree $d$ and is zero in all other degrees. A wedge of $m$ such spheres has reduced homology of rank $m$ in degree $d$, so its reduced Euler characteristic is $m(-1)^d$. Taking $m=(n-1)!$ and $d=n-3$ gives
\begin{align*}
\widetilde{\chi}\bigl(\Delta(\overline{\Pi_n})\bigr)=(-1)^{n-3}(n-1)!.
\end{align*}
By *Philip Hall Theorem*, applied to the open interval $(\hat{0},\hat{1})=\overline{\Pi_n}$, this reduced Euler characteristic is the Möbius value:
\begin{align*}
\mu_{\Pi_n}(\hat{0},\hat{1})=\widetilde{\chi}\bigl(\Delta(\overline{\Pi_n})\bigr)=(-1)^{n-3}(n-1)!.
\end{align*}
Since $n-1=(n-3)+2$ and $(-1)^2=1$, we have
\begin{align*}
(-1)^{n-1}=(-1)^{n-3}(-1)^2=(-1)^{n-3}.
\end{align*}
Therefore
\begin{align*}
\mu_{\Pi_n}(\hat{0},\hat{1})=(-1)^{n-1}(n-1)!.
\end{align*}
Thus the shellable wedge-of-spheres topology of the proper part recovers exactly the classical Möbius value of the partition lattice.
[/example]
The partition lattice shows that the topology of a proper part may be far larger than a single sphere. The next example is the finite-field analogue of projective geometry and leads to buildings.
[example: Lattice of Nontrivial Subspaces]
Let $V$ be an $n$-dimensional vector space over $\mathbb F_q$ with $n\ge 2$, and let $L(V)$ be the lattice of subspaces ordered by inclusion. The proper part of $L(V)$ is the set of subspaces $W$ satisfying
\begin{align*}
0<W<V.
\end{align*}
Thus a simplex of $\Delta(\overline{L(V)})$ is a strict flag of nonzero proper subspaces.
A maximal chain in $L(V)$ has one subspace in each dimension $0,1,\dots,n$, so a maximal simplex in the proper part has the form
\begin{align*}
0<V_1<V_2<\cdots<V_{n-1}<V
\end{align*}
with $\dim V_i=i$. The vertices of this simplex are $V_1,\dots,V_{n-1}$, so it has $n-1$ vertices and therefore dimension
\begin{align*}
(n-1)-1=n-2.
\end{align*}
The Solomon-Tits theorem identifies $\Delta(\overline{L(V)})$ as the spherical building of type $A_{n-1}$ over $\mathbb F_q$ and gives it the homotopy type of a wedge of spheres of dimension $n-2$. Write the number of spheres as $m$. A wedge of $m$ spheres of dimension $n-2$ has reduced Euler characteristic
\begin{align*}
m(-1)^{n-2},
\end{align*}
because its reduced homology has rank $m$ in degree $n-2$ and vanishes in all other degrees.
By *Philip Hall Theorem*, applied to the open interval $(0,V)=\overline{L(V)}$,
\begin{align*}
\mu_{L(V)}(0,V)=\widetilde{\chi}\bigl(\Delta(\overline{L(V)})\bigr)=m(-1)^{n-2}.
\end{align*}
The classical Möbius value for the subspace lattice is
\begin{align*}
\mu_{L(V)}(0,V)=(-1)^n q^{\binom{n}{2}}.
\end{align*}
Since
\begin{align*}
(-1)^n=(-1)^{n-2}(-1)^2=(-1)^{n-2},
\end{align*}
the two formulas give
\begin{align*}
m(-1)^{n-2}=q^{\binom{n}{2}}(-1)^{n-2}.
\end{align*}
Multiplying both sides by $(-1)^{n-2}$ gives
\begin{align*}
m=q^{\binom{n}{2}}.
\end{align*}
Therefore $\Delta(\overline{L(V)})$ has the homotopy type of a wedge of $q^{\binom{n}{2}}$ spheres of dimension $n-2$, and
\begin{align*}
\mu_{L(V)}(0,V)=(-1)^n q^{\binom{n}{2}}.
\end{align*}
This is the finite-field analogue of the Boolean sphere computation, with complete flags replacing permutations.
[/example]
This subspace example is the bridge from poset topology to representation theory. The top homology of the building carries the Steinberg representation of $GL(V)$, so the order complex is not only a homotopy invariant but also a module with substantial algebraic structure.
[remark: What This Chapter Adds to Möbius Inversion]
Möbius inversion began as an algebraic inverse to the zeta function in the incidence algebra. Poset topology adds a second interpretation: $\mu(x,y)$ is the reduced Euler characteristic of the open interval $(x,y)$. When the interval is Cohen-Macaulay, this Euler characteristic is the signed rank of top homology, so cancellations in alternating chain sums are controlled by the geometry of the order complex.
[/remark]
# 9. Shellability
Shellability is the bridge between the combinatorics of a finite poset and the topology of its order complex. Chapter 8 attached simplicial complexes to posets and used the Möbius function to measure alternating chain counts; this chapter explains when those chain complexes can be built one facet at a time in a controlled way. The guiding question is: what kind of ordering data on the maximal chains of a poset guarantees predictable homology and computable Mobius invariants?
## Shelling Orders for Pure Simplicial Complexes
A finite simplicial complex may have many facets, and its topology can be hard to read from the list of faces. Shellability asks for an ordering of the facets in which each new facet attaches along a controlled part of its boundary, so we first isolate the complexes where all facets have the same size.
[definition: Pure Simplicial Complex]
A finite simplicial complex $\Delta$ is pure of dimension $d-1$ if every facet of $\Delta$ has cardinality $d$.
[/definition]
Purity is the condition that all maximal simplices have the same dimension. Without it, the phrase "attach a new facet along codimension one boundary" becomes ambiguous: a triangle and a loose edge in the same complex have different boundary dimensions, so no single top-dimensional attachment process controls the whole complex. In poset language purity is the topological counterpart of gradedness, and the next example explains why order complexes of graded posets are the main source of pure complexes in this chapter.
[example: A Non-Pure Complex]
Let $\Delta$ have facets $F_1=\{1,2,3\}$ and $F_2=\{4,5\}$. The cardinalities are
\begin{align*}
|F_1|=|\{1,2,3\}|=3
\end{align*}
and
\begin{align*}
|F_2|=|\{4,5\}|=2.
\end{align*}
Since the facets do not all have the same cardinality, $\Delta$ is not pure. Equivalently, $F_1$ is a $2$-simplex and $F_2$ is a $1$-simplex, so there is no single value of $d$ for which every facet has cardinality $d$.
This shows why the purity hypothesis is part of the shelling definition used here. The face interval inside $F_1$ is the Boolean lattice of subsets of a $3$-element set, while the face interval inside $F_2$ is the Boolean lattice of subsets of a $2$-element set. Thus any restriction-face bookkeeping would have to mix intervals with different top ranks, rather than tracking one uniform dimension of facet attachment.
[/example]
The non-pure example marks the first boundary of the theory: shellability here is designed for a single-dimensional attachment process. The graded-poset examples below naturally land inside that setting.
[example: Order Complex of a Graded Poset]
Let $P$ be a finite graded poset of rank $n$ with unique minimum $\hat{0}$ and unique maximum $\hat{1}$, and let $\overline{P}=P\setminus\{\hat{0},\hat{1}\}$. A facet of the order complex $\Delta(\overline{P})$ is a maximal chain in $\overline{P}$, because faces of an order complex are chains and facets are maximal faces.
Since $P$ is graded of rank $n$, every maximal chain from $\hat{0}$ to $\hat{1}$ has the form
\begin{align*}
\hat{0}<x_1<x_2<\cdots <x_{n-1}<\hat{1}.
\end{align*}
Removing the two endpoints leaves the chain
\begin{align*}
x_1<x_2<\cdots <x_{n-1}
\end{align*}
in $\overline{P}$, so the corresponding facet of $\Delta(\overline{P})$ is
\begin{align*}
\{x_1,\dots,x_{n-1}\}.
\end{align*}
Its cardinality is
\begin{align*}
|\{x_1,\dots,x_{n-1}\}|=n-1,
\end{align*}
because the elements $x_1,\dots,x_{n-1}$ are distinct elements of a strict chain. Conversely, every maximal chain in $\overline{P}$ becomes a maximal chain in $P$ after adjoining $\hat{0}$ and $\hat{1}$, so every facet of $\Delta(\overline{P})$ arises in this way and has cardinality $n-1$.
Therefore all facets of $\Delta(\overline{P})$ have the same cardinality $n-1$. By the definition of a pure simplicial complex, $\Delta(\overline{P})$ is pure with $d=n-1$, hence has dimension $d-1=n-2$. This is why graded bounded posets naturally produce pure order complexes.
[/example]
The example reduces a graded-poset question to a pure-complex question. We now need an ordering condition on facets that says exactly how each new facet is allowed to meet the part already constructed.
[definition: Shelling Order]
Let $\Delta$ be a pure finite simplicial complex with facets $F_1,\dots,F_t$, each of cardinality $d$. The ordering $F_1,\dots,F_t$ is a shelling order if for every $1 \le i < k \le t$ there exists $j < k$ such that
\begin{align*}
F_i \cap F_k \subseteq F_j \cap F_k
\end{align*}
and $|F_j \cap F_k|=d-1$.
[/definition]
This formulation says that whenever an old facet meets the new facet $F_k$, that intersection is controlled by a codimension-one intersection with some earlier facet. The codimension-one requirement is essential: if two triangles meet only in a vertex, then the second triangle is attached too thinly to be a shelling step, even though the complex is pure.
[example: Pure Complex That Is Not Shellable]
Let $\Delta$ have facets $F_1=\{1,2,3\}$ and $F_2=\{1,4,5\}$, together with all their subfaces. The two facet cardinalities are
\begin{align*}
|F_1|=|\{1,2,3\}|=3
\end{align*}
and
\begin{align*}
|F_2|=|\{1,4,5\}|=3.
\end{align*}
Thus every facet has cardinality $3$, so $\Delta$ is pure with $d=3$ and dimension $d-1=2$.
There are only two possible facet orders. In the order $F_1,F_2$, the shelling condition for $i=1$ and $k=2$ would require some $j<2$, so necessarily $j=1$, such that
\begin{align*}
F_1\cap F_2\subseteq F_j\cap F_2
\end{align*}
and
\begin{align*}
|F_j\cap F_2|=d-1=2.
\end{align*}
But
\begin{align*}
F_1\cap F_2=\{1,2,3\}\cap\{1,4,5\}=\{1\},
\end{align*}
so with $j=1$ we get
\begin{align*}
|F_j\cap F_2|=|F_1\cap F_2|=|\{1\}|=1\ne 2.
\end{align*}
Hence $F_1,F_2$ is not a shelling order.
In the reverse order $F_2,F_1$, the same calculation gives
\begin{align*}
F_2\cap F_1=\{1,4,5\}\cap\{1,2,3\}=\{1\},
\end{align*}
and therefore
\begin{align*}
|F_2\cap F_1|=|\{1\}|=1\ne 2.
\end{align*}
So the reverse order is not a shelling order either. Therefore $\Delta$ is pure but not shellable: the second triangle always attaches only at one vertex, while a two-dimensional shelling step must attach along a codimension-one face, namely an edge.
[/example]
This example separates purity from shellability: purity gives a common dimension, while shellability adds a controlled boundary attachment. To turn this geometric attachment rule into enumeration, we need to identify which faces of $F_k$ are new at the moment $F_k$ is added.
[definition: Restriction Face]
Let $F_1,\dots,F_t$ be a shelling order for a pure simplicial complex $\Delta$. For $k \ge 2$, the restriction face $R(F_k)$ is the unique minimal face of $F_k$ that is not contained in the subcomplex generated by $F_1,\dots,F_{k-1}$. By convention, $R(F_1)=\varnothing$.
[/definition]
The restriction face measures the genuinely new part of $F_k$, but it is useful only if it controls all new faces at that shelling step. The question is whether every face first appearing with $F_k$ is exactly a face between $R(F_k)$ and $F_k$, and whether different facets give disjoint contributions. That interval decomposition is the combinatorial form of the shelling rule.
[quotetheorem:8128]
[citeproof:8128]
The purity hypothesis is needed here because the restriction-face construction depends on each new facet meeting the earlier complex along a controlled part of the boundary of one simplex of fixed dimension. In a nonpure attachment, a smaller facet may already lie inside a larger old facet, or a larger facet may attach along faces of several incompatible dimensions; then there need not be a unique minimal new face whose supersets inside the new facet are exactly the new faces. For instance, if an edge is attached to a triangle at a single vertex, the new faces in the edge are the other vertex and the edge, which form an interval; but if a triangle is then attached along two disjoint vertices, the new faces inside that triangle are not all supersets of one face. The shelling hypothesis rules out exactly this kind of non-principal complement inside the Boolean lattice of a facet. The theorem does not say that every interval partition comes from a shelling, only that a pure shelling produces this particular partition; the next examples use the partition to read off restriction faces geometrically before labels enter the story.
This interval partition is the enumerative engine behind shellability: it lets us count faces by recording only the sizes of the restriction faces. Before moving to posets, it is useful to see how a shelling can create a cycle in the smallest noncontractible pure complex.
[example: Boundary of a Triangle]
Let $\Delta$ be the boundary of a triangle with facets $F_1=\{1,2\}$, $F_2=\{2,3\}$, and $F_3=\{1,3\}$. Each facet has cardinality $2$, so this is a pure one-dimensional complex with $d=2$.
We verify the shelling condition for the order $F_1,F_2,F_3$. For $k=2$, the only earlier facet is $F_1$, and
\begin{align*}
F_1\cap F_2=\{1,2\}\cap\{2,3\}=\{2\}.
\end{align*}
Thus
\begin{align*}
|F_1\cap F_2|=|\{2\}|=1=d-1.
\end{align*}
For $k=3$ and $i=1$, choose $j=1$; then
\begin{align*}
F_1\cap F_3=\{1,2\}\cap\{1,3\}=\{1\},
\end{align*}
so $F_1\cap F_3\subseteq F_1\cap F_3$ and
\begin{align*}
|F_1\cap F_3|=|\{1\}|=1=d-1.
\end{align*}
For $k=3$ and $i=2$, choose $j=2$; then
\begin{align*}
F_2\cap F_3=\{2,3\}\cap\{1,3\}=\{3\},
\end{align*}
so $F_2\cap F_3\subseteq F_2\cap F_3$ and
\begin{align*}
|F_2\cap F_3|=|\{3\}|=1=d-1.
\end{align*}
Therefore $F_1,F_2,F_3$ is a shelling order.
Now compute the restriction faces. By convention,
\begin{align*}
R(F_1)=\varnothing.
\end{align*}
Before $F_2$ is added, the subcomplex generated by $F_1$ contains $\varnothing$, $\{1\}$, $\{2\}$, and $\{1,2\}$. The faces of $F_2$ are $\varnothing$, $\{2\}$, $\{3\}$, and $\{2,3\}$, so the faces of $F_2$ that are not already present are $\{3\}$ and $\{2,3\}$. The unique minimal one is $\{3\}$, hence
\begin{align*}
R(F_2)=\{3\}.
\end{align*}
Before $F_3$ is added, the subcomplex generated by $F_1$ and $F_2$ contains $\varnothing$, $\{1\}$, $\{2\}$, $\{3\}$, $\{1,2\}$, and $\{2,3\}$. The faces of $F_3$ are $\varnothing$, $\{1\}$, $\{3\}$, and $\{1,3\}$, so the only new face is $\{1,3\}$. Therefore
\begin{align*}
R(F_3)=\{1,3\}=F_3.
\end{align*}
The last restriction face is the whole final edge, so the last shelling step closes the triangle boundary into a one-dimensional cycle rather than merely adding an edge with one new endpoint.
[/example]
## Lexicographic Shellability of Posets
For order complexes, the facets are maximal chains. Rather than trying to order these chains directly, we put labels on cover relations and use lexicographic order. The main question is: what local condition on labels forces a global shelling order of the order complex?
[definition: Edge Labeling]
Let $P$ be a finite poset. An edge labeling of $P$ is a map $\lambda$ from the cover relations $x \lessdot y$ of $P$ to a totally ordered set $\Lambda$.
[/definition]
An edge labeling assigns a word to each maximal chain by reading labels from bottom to top. To recognize which word should come first in each interval, we need a vocabulary for chains whose labels increase or decrease.
[definition: Rising and Falling Chains]
Let $P$ be a finite graded poset with an edge labeling $\lambda$. A maximal chain
\begin{align*}
x_0 \lessdot x_1 \lessdot \cdots \lessdot x_r
\end{align*}
in an interval $[x_0,x_r]$ is rising if
\begin{align*}
\lambda(x_0,x_1) \le \lambda(x_1,x_2) \le \cdots \le \lambda(x_{r-1},x_r),
\end{align*}
and falling if
\begin{align*}
\lambda(x_0,x_1) \ge \lambda(x_1,x_2) \ge \cdots \ge \lambda(x_{r-1},x_r).
\end{align*}
[/definition]
Rising chains are intended to be the lexicographically first chains in intervals, while falling chains will later detect the Mobius function. To use labels as a certificate, existence of a rising chain is not enough: an interval with two rising chains would give no canonical first attachment in a shelling order. The required local axiom is uniqueness together with lexicographic minimality on every positive-rank interval.
[definition: EL-Labeling]
Let $P$ be a finite graded poset. An edge labeling $\lambda$ of $P$ is an EL-labeling if every positive-rank interval $[x,y]$ has a unique rising maximal chain, and this chain is lexicographically first among all maximal chains in $[x,y]$.
[/definition]
The uniqueness clause cannot be omitted. For example, label every cover relation in the Boolean lattice $B_2$ by the same label. Then both maximal chains from $\varnothing$ to $\{1,2\}$ are rising and both have the same lexicographic word, so the labeling is not an EL-labeling even though the underlying poset is shellable. Thus EL-shellability is a certificate, not merely a property of the unlabeled poset.
The condition is local over intervals, but the desired conclusion is global: maximal chains of the whole poset must be ordered so that each new simplex attaches along a pure codimension-one boundary piece. The problem is to turn the unique rising chain inside every interval into the replacement step required by shellability. Lexicographic order supplies the candidate shelling, and the theorem verifies that the local uniqueness axiom controls every attachment.
[quotetheorem:6507]
[citeproof:6507]
Each hypothesis has a specific role. Gradedness and the existence of $\hat{0},\hat{1}$ make the order complex of the proper part pure; without gradedness, maximal chains can have different lengths and the shelling criterion above is not even the right pure-complex criterion. The unique rising chain condition is not cosmetic: the constant labeling on $B_2$ gives multiple rising chains in the top interval and therefore does not determine the local replacement used in the proof. The theorem does not say that every shellable poset admits an EL-labeling; later CL-labelings and recursive atom orderings are introduced precisely because some shellings require labels that depend on the previously chosen root.
This theorem turns a topological problem into a labeling problem. The Boolean lattice gives the model example because its cover labels remember which element was inserted.
[example: Boolean Lattice]
Let $B_n$ be the Boolean lattice of subsets of $\{1,\dots,n\}$ ordered by inclusion. Label each cover relation
\begin{align*}
S\lessdot S\cup\{i\}
\end{align*}
by $i$. Fix an interval $[A,B]$, and write
\begin{align*}
B\setminus A=\{b_1,\dots,b_r\}
\end{align*}
with
\begin{align*}
b_1<b_2<\cdots<b_r.
\end{align*}
A maximal chain in $[A,B]$ adds the elements of $B\setminus A$ one at a time, so it has the form
\begin{align*}
A\subset A\cup\{c_1\}\subset A\cup\{c_1,c_2\}\subset\cdots\subset A\cup\{c_1,\dots,c_r\}=B,
\end{align*}
where $(c_1,\dots,c_r)$ is a permutation of $(b_1,\dots,b_r)$. Its label word is exactly
\begin{align*}
c_1c_2\cdots c_r.
\end{align*}
This chain is rising precisely when
\begin{align*}
c_1\le c_2\le\cdots\le c_r.
\end{align*}
Since the $c_i$ are distinct, this is equivalent to
\begin{align*}
c_1<c_2<\cdots<c_r,
\end{align*}
so the only rising maximal chain is
\begin{align*}
A\subset A\cup\{b_1\}\subset A\cup\{b_1,b_2\}\subset\cdots\subset A\cup\{b_1,\dots,b_r\}=B.
\end{align*}
Its label word is
\begin{align*}
b_1b_2\cdots b_r.
\end{align*}
To see that this word is lexicographically first, take any other label word $c_1\cdots c_r$ and let $q$ be the first index with $c_q\ne b_q$. Then $c_1=b_1,\dots,c_{q-1}=b_{q-1}$, so among the unused elements the smallest is $b_q$. Since $c_q$ is unused and is not $b_q$, we have
\begin{align*}
b_q<c_q.
\end{align*}
Thus
\begin{align*}
b_1\cdots b_{q-1}b_q
\end{align*}
is lexicographically smaller than
\begin{align*}
c_1\cdots c_{q-1}c_q.
\end{align*}
Therefore every interval has a unique rising maximal chain, and that chain is lexicographically first. Hence this labeling is an EL-labeling of $B_n$. By the *[EL-Shellability Theorem](/theorems/6507)*, the lexicographic order of maximal chains shells the order complex of the proper part of $B_n$.
[/example]
EL-labelings are powerful but sometimes too rigid because they label individual edges. To handle posets where the right label depends on the path already taken, we let edge labels remember the rooted chain below the edge.
[definition: CL-Labeling]
Let $P$ be a finite graded poset with unique minimum $\hat{0}$, and let $\Lambda$ be a totally ordered set. A rooted interval is a pair $([x,y],\rho)$ where $x<y$ in $P$ and
\begin{align*}
\rho:\hat{0}=u_0\lessdot u_1\lessdot\cdots\lessdot u_s=x
\end{align*}
is a saturated chain ending at $x$.
A rooted cover relation is a triple $(\rho,x\lessdot z)$, where $([x,y],\rho)$ is a rooted interval for some $y\ge z$ and $x\lessdot z$ is a cover relation in $P$. A chain-edge labeling is a map
\begin{align*}
\lambda:\{(\rho,x\lessdot z):(\rho,x\lessdot z)\text{ is a rooted cover relation}\}\to\Lambda.
\end{align*}
It is a CL-labeling if, for every rooted interval $([x,y],\rho)$, the maximal chains from $x$ to $y$, labeled by extending the root $\rho$ as the chain is traversed, have a unique rising chain, and that rising chain is lexicographically first among all maximal chains from $x$ to $y$ with root $\rho$.
[/definition]
A CL-labeling gives the same shellability conclusion as an EL-labeling, with rooted intervals replacing ordinary intervals. This extension is needed because the shelling proof only requires a lexicographically first rising chain after the lower part of the maximal chain has been fixed.
[quotetheorem:6510]
[citeproof:6510]
The bounded graded hypotheses again ensure that maximal chains give facets of one dimension in the proper-part order complex; without them the statement would not produce a pure shelling. The rooted uniqueness hypothesis is the CL analogue of the EL uniqueness condition: if two rooted rising chains are tied, the local replacement step no longer chooses a controlled earlier chain. The theorem does not claim that the labels ignore history; its point is exactly that root-dependence still gives a shelling, and this prepares the recursive atom-ordering certificate below.
CL-labelings still require labels, and in lattices it is often more natural to specify the order in which atoms should appear. This leads to a recursive certificate for CL-shellability.
[definition: Recursive Atom Ordering]
Let $P$ be a finite bounded graded poset. A recursive atom ordering of $P$ is an ordering $a_1,\dots,a_m$ of the atoms of $P$ satisfying the following recursive conditions.
If $P$ has rank $1$, every ordering of its atoms is a recursive atom ordering. If $P$ has rank greater than $1$, then for each $i$, the interval $[a_i,\hat{1}]$ has a recursive atom ordering in which all atoms that lie above some earlier atom $a_j$ with $j<i$ occur before the remaining atoms of $[a_i,\hat{1}]$.
For every $i<j$ and every $y\in P$ with $a_i<y$ and $a_j<y$, there exist $k<j$ and an atom $z$ of the interval $[a_j,\hat{1}]$ such that $a_k<z\le y$.
[/definition]
The recursive condition says that whenever an element $y$ lies above both an earlier atom and a later atom, the interval above the later atom contains an immediate step toward $y$ that is already witnessed by an earlier atom. This prevents later atoms from introducing lower-rank obstructions that were invisible from the earlier part of the ordering. The theorem below explains why this atom-ordering data is enough to build the rooted lexicographic labels required above.
[quotetheorem:8129]
[citeproof:8129]
The base case in the definition is necessary so the recursion terminates on rank-one intervals; without it the phrase "recursive atom ordering" would not define anything. The second condition prevents a later atom $a_j$ from being the first route into an element $y$ already reachable from an earlier atom: it forces a cover above $a_j$ that points back to an earlier witness. That is the control needed for lexicographic firstness in the induced CL-labeling. The theorem does not say that recursive atom orderings are necessary for CL-shellability, only that they are a convenient inductive certificate, so the examples below should be read as verification tools rather than classifications.
The three languages form a practical hierarchy: EL-labelings are the most concrete, CL-labelings allow dependence on history, and recursive atom orderings are often the most convenient inductive certificate. We next compare examples where each viewpoint gives a usable shellability proof.
[example: Partition Lattice]
In the partition lattice $\Pi_n$, ordered by refinement, a cover relation merges exactly two blocks. If the two blocks being merged have least elements $i<j$, label that cover by $j$. We show that this labeling is an EL-labeling by checking each interval $[\sigma,\tau]$.
Fix a block $T$ of $\tau$. The blocks of $\sigma$ contained in $T$ have distinct least elements; write them as
\begin{align*}
m_1(T)<m_2(T)<\cdots <m_{s(T)}(T).
\end{align*}
To move from $\sigma$ to $\tau$, all blocks of $\sigma$ inside the same $T$ must be merged into one block, and blocks lying in different blocks of $\tau$ are never merged with each other. Thus every maximal chain in $[\sigma,\tau]$ has exactly
\begin{align*}
\sum_{T\in\tau}(s(T)-1)
\end{align*}
cover steps.
For each block $T$ of $\tau$, the least element $m_1(T)$ remains the least element of the final merged block over $T$. Each other minimum $m_a(T)$ with $a\ge 2$ disappears at the unique step where its current block is merged into a block with smaller least element, and that step has label $m_a(T)$. Therefore every maximal chain in $[\sigma,\tau]$ has the same set of labels,
\begin{align*}
\{m_a(T):T\in\tau,\ 2\le a\le s(T)\}.
\end{align*}
Construct a chain by reading this set of labels in increasing order. When the next label is $m_a(T)$, merge the current block with least element $m_a(T)$ into the current block containing $m_1(T)$. This is always a legal cover relation: before this step, the block with least element $m_a(T)$ has not yet disappeared, while the block containing $m_1(T)$ lies in the same target block $T$ of $\tau$. The label of this merge is $m_a(T)$, because
\begin{align*}
m_1(T)<m_a(T).
\end{align*}
Hence the resulting label word is the increasing list of the fixed label set, so the chain is rising.
Now let any rising maximal chain in $[\sigma,\tau]$ be given. Its labels are the same fixed distinct labels listed above. Since the chain is rising, its label word must be those labels in increasing order. At the step labeled $m_a(T)$, the block with least element $m_a(T)$ must merge with a block whose least element is smaller. If it merged with a block not containing $m_1(T)$, then the new block would still have least element greater than $m_1(T)$, and it would later have to merge into a block with smaller least element; that later merge would again have label equal to the least element of the larger block, contradicting the fact that the label $m_a(T)$ occurs only once. Thus the step labeled $m_a(T)$ is forced to merge into the component containing $m_1(T)$. So the rising chain is unique.
Finally, any other maximal chain has the same set of distinct labels but not in increasing order. Let $q$ be the first position where its label word differs from the increasing label word. At that position, the increasing word uses the smallest unused label, while the other chain uses a larger unused label. Therefore the unique rising chain is lexicographically first in $[\sigma,\tau]$.
Thus this block-minimum labeling is an EL-labeling of $\Pi_n$. By the *EL-Shellability Theorem*, the proper part of $\Pi_n$ is shellable, and the later Möbius computation can count falling chains by counting decreasing sequences of these block-minimum labels.
[/example]
The partition lattice shows that EL-labelings can handle non-Boolean lattices with many chains. A small subgroup lattice shows the opposite extreme, where the recursive atom ordering is almost immediate and shellability can also be seen directly.
[example: Small Subgroup Lattice]
View $G=C_p\times C_p$ as the two-dimensional vector space $\mathbb F_p^2$ under addition. Its subgroups are its $\mathbb F_p$-subspaces: the zero subspace $\{e\}$, the one-dimensional subspaces, and the whole space $G$.
The atoms of the subgroup lattice are exactly the one-dimensional subspaces. There are $p^2-1$ nonzero vectors in $\mathbb F_p^2$, and each one-dimensional subspace contains $p-1$ nonzero vectors. Since distinct one-dimensional subspaces meet only in $\{0\}$, the number of atoms is
\begin{align*}
\frac{p^2-1}{p-1}=\frac{(p-1)(p+1)}{p-1}=p+1.
\end{align*}
Every maximal chain has the form
\begin{align*}
\{e\}<L<G
\end{align*}
where $L$ is a one-dimensional subspace. Hence every maximal chain has length $2$. Now order the atoms arbitrarily as
\begin{align*}
L_1,L_2,\dots,L_{p+1}.
\end{align*}
For each atom $L_i$, the interval $[L_i,G]$ has rank $1$, so its recursive atom-ordering condition is automatic. If $i<j$ and an element $y$ satisfies $L_i<y$ and $L_j<y$, then $L_i$ and $L_j$ are distinct one-dimensional subspaces of a two-dimensional space, so the only possibility is
\begin{align*}
y=G.
\end{align*}
Taking $k=i$ and $z=G$, we have $k<j$, the element $z$ is the unique atom of the interval $[L_j,G]$, and
\begin{align*}
L_k=L_i< G=z\le y.
\end{align*}
Thus every ordering of the $p+1$ atoms is a recursive atom ordering.
The proper part of the lattice is just the set
\begin{align*}
\{L_1,\dots,L_{p+1}\}.
\end{align*}
Its order complex has one vertex for each $L_i$ and no higher-dimensional faces, because no two distinct atoms are comparable. Therefore the order complex is a discrete set of $p+1$ vertices. Its facets are the singleton sets $\{L_i\}$, all of cardinality $1$, and any ordering of these singleton facets is a shelling order: for distinct facets $\{L_i\}$ and $\{L_k\}$, their intersection is $\varnothing$, whose cardinality is $0=d-1$ when $d=1$. So shellability is visible directly, and the recursive atom ordering gives the same conclusion through the general CL-shellability mechanism.
[/example]
These elementary examples are meant to make the mechanism calculable. The same mechanism appears in deeper algebraic combinatorics, where the labels often come from structure that is not part of this course.
[remark: Bruhat Order]
Bruhat order on a Coxeter group is a central motivating example: its intervals admit EL-labelings through reflection orders. The reflection order arranges the reflections so that each Bruhat interval has a unique rising saturated chain, and the consequence fits the present framework: Bruhat intervals have shellable order complexes and therefore tightly controlled topology.
[/remark]
## Homological and Enumerative Consequences
Shellability matters because it converts topology into bookkeeping. Once facets are attached in shelling order, each restriction face records whether a new top-dimensional homology class appears and how the $h$-vector changes. The first payoff is a strong homological vanishing property.
[definition: Cohen-Macaulay Simplicial Complex]
Let $k$ be a field. A pure simplicial complex $\Delta$ of dimension $d-1$ is Cohen-Macaulay over $k$ if for every face $F \in \Delta$, including $F=\varnothing$, the reduced homology of the link satisfies
\begin{align*}
\widetilde{H}_i(\operatorname{link}_{\Delta}(F);k)=0
\end{align*}
for all $i < d-1-|F|$.
[/definition]
This definition says that every local link has homology concentrated in top dimension. Shellability is expected to force that vanishing because its facet order controls how cells are attached, but the implication must also survive passage to links of arbitrary faces. The needed result is the homological payoff of shelling: controlled attachments in the complex and in its links rule out lower-dimensional reduced homology.
[quotetheorem:6511]
[citeproof:6511]
Purity is essential because Cohen-Macaulayness imposes local top-dimensional homology conditions on links of a fixed dimension; a complex with facets of different dimensions can fail those local conditions at faces lying in smaller facets. Shellability is also essential: the two triangles meeting only at a vertex are pure but not shellable, and the link of the shared vertex is disconnected in the wrong dimension. The theorem is one-way: Cohen-Macaulay complexes need not come with a shelling order, so the result gives a powerful sufficient condition rather than a characterization.
For posets, this theorem is usually applied to the order complex of the proper part. Combining it with EL-shellability raises the local question that matters for intervals: if every interval has the lexicographic shelling certificate, then every interval order complex should inherit Cohen-Macaulay vanishing.
The already quoted [citetheorem:6507] supplies shellings from labels, but Cohen-Macaulayness is a homological condition rather than a labeling condition. To finish the bridge, one uses [citetheorem:6511] interval by interval: shellability gives the required local homology vanishing, and the lexicographic certificate becomes the Cohen-Macaulay conclusion.
The interval formulation matters: Cohen-Macaulayness for a poset is not just a statement about the single proper-part order complex, but about all local open intervals. The EL hypothesis is stronger than the conclusion; a poset may have Cohen-Macaulay intervals without admitting the particular lexicographic certificate used here. This corollary is the bridge from labels to homology, and the next theorem adds the signed enumeration that recovers Mobius functions.
Homological vanishing is qualitative; Mobius functions ask for an exact signed count. Shellability supplies that count by identifying which facets create top-dimensional homology. In an EL-shelling, those special facets are detected by a label pattern rather than by computing homology directly, so the next theorem turns a topological invariant into a chain-counting rule.
[quotetheorem:8130]
[citeproof:8130]
The rank hypothesis fixes the dimension and therefore the sign: for an interval of rank $r>0$, the open interval has order complex of dimension $r-2$, whose reduced Euler characteristic contributes the displayed $(-1)^r$ convention for $\mu(x,y)$. The rank-zero case is separate because the open interval is empty as an interval but the Mobius value is $\mu(x,x)=1$, matching the single empty falling chain. The EL-labeling hypothesis cannot be replaced by an arbitrary labeling, since a labeling with repeated labels on $B_2$ has too many rising and falling chains and gives no valid Mobius count. The theorem does not count all maximal chains; it isolates exactly those shelling facets whose restriction face is the whole facet, which is why the examples below reduce Mobius computations to falling-chain enumeration.
This formula is useful because falling chains are often much easier to count than all chains with signs. The Boolean lattice illustrates how the shelling viewpoint recovers a familiar Mobius function from a single descending chain.
[example: Mobius Function of the Boolean Lattice]
For $B_n$ with the standard EL-labeling, fix an interval $[A,B]$ and write
\begin{align*}
B\setminus A=\{b_1,\dots,b_r\}
\end{align*}
with $b_1<\cdots<b_r$. The rank of $[A,B]$ is $r=|B\setminus A|$, since a maximal chain from $A$ to $B$ adds exactly one element of $B\setminus A$ at each cover step.
Every maximal chain in $[A,B]$ has the form
\begin{align*}
A\subset A\cup\{c_1\}\subset A\cup\{c_1,c_2\}\subset\cdots\subset A\cup\{c_1,\dots,c_r\}=B,
\end{align*}
where $(c_1,\dots,c_r)$ is a permutation of $(b_1,\dots,b_r)$. Under the standard labeling, the label word of this chain is
\begin{align*}
c_1c_2\cdots c_r.
\end{align*}
The chain is falling precisely when
\begin{align*}
c_1\ge c_2\ge\cdots\ge c_r.
\end{align*}
Because the elements $c_1,\dots,c_r$ are distinct, this weakly decreasing condition is equivalent to
\begin{align*}
c_1>c_2>\cdots>c_r.
\end{align*}
The only permutation of $\{b_1,\dots,b_r\}$ in decreasing order is
\begin{align*}
(b_r,b_{r-1},\dots,b_1).
\end{align*}
Thus there is exactly one falling maximal chain in $[A,B]$, namely the chain that adds the elements of $B\setminus A$ from largest to smallest.
Therefore $f(A,B)=1$. By *[Falling-Chain Formula for the Möbius Function](/theorems/8130)*,
\begin{align*}
\mu(A,B)=(-1)^r f(A,B)=(-1)^r\cdot 1=(-1)^r.
\end{align*}
Since $r=|B\setminus A|$, this gives
\begin{align*}
\mu(A,B)=(-1)^{|B\setminus A|}.
\end{align*}
So the Boolean-lattice Möbius function is recovered by counting the single falling chain in each interval.
[/example]
The partition lattice shows that the same formula can encode a factorial enumeration with real content. Here the falling chains are no longer unique, but they still have a clean combinatorial model.
[example: Partition Lattice Mobius Number]
For the partition lattice $\Pi_n$, use the standard EL-labeling in which a merge of two blocks with least elements $i<j$ is labeled by $j$. A maximal chain from the discrete partition $\hat{0}$ to the one-block partition $\hat{1}$ has exactly $n-1$ cover steps, because each cover merge reduces the number of blocks by $1$, starting from $n$ blocks and ending with $1$ block.
From such a maximal chain, build a graph on $\{1,\dots,n\}$ as follows. At a merge step, suppose the two blocks being merged have least elements $i<j$. Record the edge $i\to j$. Each step connects two previously separate components, so no recorded edge creates a cycle. After $n-1$ merge steps the partition has one block, so the graph is connected. A connected acyclic graph on $n$ vertices with $n-1$ edges is a tree, and every edge is oriented away from the component with smaller least element. Since the least element of the final block is $1$, this is a rooted tree with root $1$.
Now suppose the chain is falling. Its labels are weakly decreasing, so if an edge with child endpoint $j$ is recorded before an edge with child endpoint $\ell$, then
\begin{align*}
j\ge \ell.
\end{align*}
Consider an oriented path away from the root,
\begin{align*}
1=v_0\to v_1\to\cdots\to v_m.
\end{align*}
The edge into $v_{a+1}$ must be recorded before the edge into $v_a$, because the component rooted at $v_a$ must already contain $v_{a+1}$ before that whole component can be attached to its parent. Since the labels are weakly decreasing along the chain, this gives
\begin{align*}
v_{a+1}\ge v_a.
\end{align*}
The vertices on the path are distinct, so
\begin{align*}
v_{a+1}>v_a.
\end{align*}
Thus labels strictly increase along every path away from the root. Equivalently, every non-root vertex $j$ has a parent in $\{1,\dots,j-1\}$.
Conversely, start with a rooted increasing tree on $\{1,\dots,n\}$, meaning that every non-root vertex $j$ has a parent $p(j)$ with
\begin{align*}
p(j)<j.
\end{align*}
Process the edges in decreasing order of the child label $j$, and for each edge $p(j)\to j$, merge the current block containing $p(j)$ with the current block containing $j$. This is a legal cover merge: all descendants of $j$ have labels larger than $j$, so they have already been merged into the block containing $j$, while the edge from $p(j)$ to $j$ has not yet been processed, so the two blocks are still distinct. The least element of the child block is $j$, because every descendant of $j$ has label larger than $j$; the parent block has least element at most $p(j)<j$. Hence the merge label is exactly $j$. Since the edges are processed in decreasing order of $j$, the resulting maximal chain is falling.
These two constructions are inverse: a falling chain records the same parent edge by which each component first attaches to a smaller-minimum component, and processing the resulting increasing tree in decreasing child labels recreates those merges. Therefore falling maximal chains are counted by rooted increasing trees. Such a tree is specified by choosing, independently for each $j=2,\dots,n$, its parent
\begin{align*}
p(j)\in\{1,\dots,j-1\}.
\end{align*}
There are $j-1$ choices for $p(j)$, so the number of rooted increasing trees is
\begin{align*}
\prod_{j=2}^{n}(j-1)=1\cdot 2\cdot 3\cdots(n-1)=(n-1)!.
\end{align*}
Thus the number of falling maximal chains in $[\hat{0},\hat{1}]$ is
\begin{align*}
f(\hat{0},\hat{1})=(n-1)!.
\end{align*}
The rank of $[\hat{0},\hat{1}]$ is $n-1$, since a maximal chain has $n-1$ cover merges. By *Falling-Chain Formula for the Möbius Function*,
\begin{align*}
\mu_{\Pi_n}(\hat{0},\hat{1})=(-1)^{n-1}f(\hat{0},\hat{1})=(-1)^{n-1}(n-1)!.
\end{align*}
The sign comes from the rank, and the factorial comes from the independent parent choices in the increasing tree.
[/example]
Mobius functions use the top-dimensional part of the shelling, but the same restriction faces also record all face numbers. To state this compactly, we repackage the $f$-vector into the $h$-vector.
[definition: H-Vector]
Let $\Delta$ be a pure simplicial complex of dimension $d-1$, and let $f_{i-1}$ be the number of faces of cardinality $i$, with $f_{-1}=1$. The $h$-vector $(h_0,\dots,h_d)$ is defined by
\begin{align*}
\sum_{i=0}^{d} f_{i-1}(q-1)^{d-i}=\sum_{i=0}^{d} h_i q^{d-i}.
\end{align*}
[/definition]
The $h$-vector repackages the face numbers so that shellings become direct counts. Since each shelling interval is determined by the size of its restriction face, the natural next question is whether $h_i$ counts facets whose restriction face has size $i$.
[quotetheorem:6504]
[citeproof:6504]
Purity is needed because the displayed $h$-vector relation uses one common value of $d$ for all facets. Shellability is needed because the partition into Boolean intervals is what turns the algebraic change of variables from the $f$-vector into an actual count of facets. The pure complex made from two triangles meeting only in one vertex gives a concrete failure: it has $f_{-1}=1$, $f_0=5$, $f_1=6$, and $f_2=2$, so
\begin{align*}
(q-1)^3+5(q-1)^2+6(q-1)+2=q^3+2q^2-q.
\end{align*}
Its $h$-vector is $(1,2,-1,0)$, which cannot count restriction faces of a shelling. The theorem does not say that every nonnegative $h$-vector comes from a shellable complex, but it explains why shellable examples automatically have nonnegative $h_i$ and sets up the Eulerian-number computation for the Boolean lattice.
This theorem also shows why shellable complexes have nonnegative $h$-vectors. For the Boolean lattice, the relevant restriction-face sizes are classical descent numbers.
[example: H-Vector of the Boolean Order Complex]
Let $B_n$ be ordered by inclusion and let the standard EL-labeling label the cover
\begin{align*}
S\lessdot S\cup\{a\}
\end{align*}
by $a$. A maximal chain from $\hat{0}=\varnothing$ to $\hat{1}=\{1,\dots,n\}$ is determined by the order in which the elements are added. Thus a permutation $\pi=\pi_1\pi_2\cdots\pi_n\in S_n$ gives the chain
\begin{align*}
\varnothing=S_0\subset S_1\subset S_2\subset\cdots\subset S_n=\{1,\dots,n\},
\end{align*}
where
\begin{align*}
S_k=\{\pi_1,\dots,\pi_k\}.
\end{align*}
The corresponding facet of the proper-part order complex is
\begin{align*}
F_\pi=\{S_1,S_2,\dots,S_{n-1}\}.
\end{align*}
We show that the restriction face of $F_\pi$ consists exactly of those vertices $S_k$ for which $\pi_k>\pi_{k+1}$. If $\pi_k>\pi_{k+1}$, replace $S_k$ by
\begin{align*}
S'_k=S_{k-1}\cup\{\pi_{k+1}\}.
\end{align*}
Then
\begin{align*}
S_{k-1}\subset S'_k\subset S_{k+1},
\end{align*}
because $S_{k+1}=S_{k-1}\cup\{\pi_k,\pi_{k+1}\}$. This replacement swaps the adjacent labels $\pi_k,\pi_{k+1}$, so the new label word begins the same as $\pi$ up to position $k-1$ and has $\pi_{k+1}$ instead of $\pi_k$ at position $k$. Since $\pi_{k+1}<\pi_k$, the new chain is lexicographically earlier. It contains every vertex of $F_\pi$ except $S_k$, so $F_\pi\setminus\{S_k\}$ is already present when $F_\pi$ is added.
Conversely, suppose $\pi_k<\pi_{k+1}$. Any maximal chain containing all vertices of $F_\pi$ except possibly $S_k$ must pass from $S_{k-1}$ to $S_{k+1}$ through one of the two intermediate sets
\begin{align*}
S_{k-1}\cup\{\pi_k\}=S_k
\end{align*}
or
\begin{align*}
S_{k-1}\cup\{\pi_{k+1}\}.
\end{align*}
The second choice swaps the adjacent labels to put $\pi_{k+1}$ before $\pi_k$, which is lexicographically later because $\pi_k<\pi_{k+1}$. Hence no earlier facet contains $F_\pi\setminus\{S_k\}$. Therefore
\begin{align*}
R(F_\pi)=\{S_k:1\le k\le n-1\text{ and }\pi_k>\pi_{k+1}\}.
\end{align*}
So
\begin{align*}
|R(F_\pi)|=|\{k:1\le k\le n-1\text{ and }\pi_k>\pi_{k+1}\}|.
\end{align*}
By *Shelling Interpretation of the $h$-Vector*, $h_i$ counts the facets whose restriction face has cardinality $i$. Therefore
\begin{align*}
h_i=|\{\pi\in S_n:\pi\text{ has exactly }i\text{ descents}\}|.
\end{align*}
Thus the $h$-vector of the Boolean order complex is the Eulerian distribution, with each shelling restriction face recording the descent set of the corresponding permutation.
[/example]
Shellability therefore supplies a common explanation for three phenomena that appeared separately earlier in the course: Mobius cancellation, Cohen-Macaulay vanishing, and nonnegative $h$-vectors. The rest of the course uses this bridge repeatedly: once a poset has an EL-labeling, its order complex becomes topologically tame and its enumerative invariants become accessible through labeled chains.
# 10. Cohen-Macaulay Posets
Cohen-Macaulayness is the point where the enumerative theory of ranked posets meets the homological theory of simplicial complexes. Chapters 8 and 9 used order complexes and shellings to extract Möbius functions, Euler characteristics, and chain enumeration. This chapter asks when those complexes behave homologically like wedges of top-dimensional spheres, and how that condition survives the standard operations on posets.
The main route is through simplicial complexes, continuing the order-complex and shellability viewpoint of Chapters 8 and 9. We first state the Cohen-Macaulay condition over a field and use Reisner's criterion to make it local. Then we transfer the definition to posets by applying it to open intervals and order complexes. The preservation theorems explain why the class is large enough to include the geometric and partition lattices that occur throughout algebraic combinatorics.
## Cohen-Macaulay Simplicial Complexes and Reisner's Criterion
The guiding problem is to recognise when a simplicial complex has no homology below its top dimension, not only globally but also near every face. For order complexes this matters because local homology of links records the topology of intervals, and interval topology controls Mobius functions.
[definition: Simplicial Complex]
A simplicial complex $\Delta$ on a finite vertex set $V$ is a collection of subsets of $V$ such that if $F \in \Delta$ and $G \subset F$, then $G \in \Delta$. A face of $\Delta$ is an element $F \in \Delta$. A facet of $\Delta$ is a face maximal under inclusion. The dimension of a face $F$ is $|F|-1$, and the dimension of $\Delta$ is $\max\{|F|-1 : F \in \Delta\}$.
[/definition]
This gives the ambient object whose homology will be tested. Since lower-dimensional facets can create artificial local defects, the next issue is whether all maximal faces have the same dimension.
[definition: Pure Simplicial Complex]
A simplicial complex $\Delta$ is pure of dimension $d$ if every facet of $\Delta$ has dimension $d$.
[/definition]
Purity is a combinatorial first test, but it does not detect holes in smaller dimensions. To test homology near a chosen face rather than only in the whole complex, we need the complex of all faces compatible with that chosen face.
[definition: Link Of A Face]
Let $\Delta$ be a simplicial complex and let $F \in \Delta$. The link of $F$ in $\Delta$ is
\begin{align*}
\operatorname{lk}_{\Delta}(F)=\{G \in \Delta : G \cap F = \varnothing \text{ and } G \cup F \in \Delta\}.
\end{align*}
[/definition]
The link records all ways of extending a fixed face. A global homology condition would miss singular behaviour inside such links, so the local vanishing condition below is imposed on every face at once.
[definition: Cohen-Macaulay Simplicial Complex]
Let $k$ be a field. A finite simplicial complex $\Delta$ is Cohen-Macaulay over $k$ if for every face $F \in \Delta$, including $F=\varnothing$, the reduced homology satisfies
\begin{align*}
\widetilde{H}_i(\operatorname{lk}_{\Delta}(F);k)=0
\end{align*}
for all $i < \dim \operatorname{lk}_{\Delta}(F)$.
[/definition]
This definition is phrased topologically, while the name comes from commutative algebra. The comparison requires an algebraic object that remembers exactly which vertex sets are faces and which are forbidden. Encoding nonfaces as squarefree monomial relations creates a quotient ring whose depth can be tested against the link homology conditions.
[definition: Stanley-Reisner Ring]
Let $\Delta$ be a simplicial complex on vertex set $V=\{v_1,\dots,v_n\}$ and let $k$ be a field. The Stanley-Reisner ideal of $\Delta$ is
\begin{align*}
I_{\Delta}= (x_{i_1}\cdots x_{i_r} : \{v_{i_1},\dots,v_{i_r}\} \notin \Delta) \trianglelefteq k[x_1,\dots,x_n].
\end{align*}
The Stanley-Reisner ring of $\Delta$ is $k[\Delta]=k[x_1,\dots,x_n]/I_{\Delta}$. Here $k[x_1,\dots,x_n]$ is the polynomial ring in commuting variables $x_1,\dots,x_n$ over $k$, and $I_{\Delta}\trianglelefteq k[x_1,\dots,x_n]$ means that $I_{\Delta}$ is an ideal of that ring.
[/definition]
The ring construction supplies the commutative-algebraic side of the story, while the previous link condition supplies the topological side. The problem is that depth is not visible from the face list by inspection, whereas links are combinatorial objects that can be studied with shellings and homology. Reisner's criterion is the bridge: it identifies the algebraic Cohen-Macaulay property of $k[\Delta]$ with the vanishing conditions on all links.
[quotetheorem:6512]
The commutative-algebra background behind this result is not developed in this course, since it uses local cohomology and the depth of graded rings. Its role here is to justify a purely topological test for the algebraic property, but the theorem is deliberately local: checking only the homology of $\Delta$ is not enough, because singular behaviour can be hidden inside links. The field $k$ is also part of the data, since torsion in integral homology may disappear or survive after changing coefficients. This is why shellable examples are especially convenient later: shellability gives the required vanishing over every field, while Reisner's criterion explains exactly what must be checked in non-shellable cases. The next examples show how the test behaves in small dimensions.
[example: Boundary Of A Simplex]
Let the $m$-simplex have vertex set $V$ with $|V|=m+1$, so its boundary complex is
\begin{align*}
\Delta=\{A\subset V:A\ne V\}.
\end{align*}
If $F\in \Delta$, then $F\subsetneq V$, and a face $G$ lies in $\operatorname{lk}_{\Delta}(F)$ exactly when $G\cap F=\varnothing$ and $G\cup F\ne V$. Since $G\cap F=\varnothing$ means $G\subset V\setminus F$, this gives
\begin{align*}
\operatorname{lk}_{\Delta}(F)=\{G\subset V\setminus F:G\ne V\setminus F\}.
\end{align*}
Thus the link is the boundary complex of the simplex on $V\setminus F$. If $q=|V\setminus F|$, then this link has dimension $q-2$ when $q\ge 2$, and it is the complex $\{\varnothing\}$ of dimension $-1$ when $q=1$.
For $q\ge 2$, the boundary of a simplex on $q$ vertices is a triangulated sphere $S^{q-2}$, so
\begin{align*}
\widetilde{H}_i(\operatorname{lk}_{\Delta}(F);k)=0 \text{ for every } i<q-2.
\end{align*}
For $q=1$, the link has dimension $-1$, so there are no integers $i<-1$ to check. Therefore every face link has vanishing reduced homology below its own dimension, and the boundary complex of an $m$-simplex is Cohen-Macaulay over every field.
[/example]
The example shows why spheres are basic Cohen-Macaulay complexes. A contrasting example should be pure but disconnected, since disconnectedness produces reduced homology below the top degree.
[example: Disconnected Pure Graph]
Let the two edges have vertex sets $\{a,b\}$ and $\{c,d\}$, so
\begin{align*}
\Delta=\{\varnothing,\{a\},\{b\},\{a,b\},\{c\},\{d\},\{c,d\}\}.
\end{align*}
The facets are exactly $\{a,b\}$ and $\{c,d\}$, and both have dimension $2-1=1$, so $\Delta$ is pure of dimension $1$.
The two edges are separate path components. Hence $H_0(\Delta;k)\cong k^2$, with one copy of $k$ for the component containing $\{a,b\}$ and one copy for the component containing $\{c,d\}$. The reduced group is the kernel of the augmentation map
\begin{align*}
\epsilon:k^2\to k,\qquad \epsilon(u,v)=u+v.
\end{align*}
Since $(1,-1)\in \ker(\epsilon)$ and $(1,-1)\ne 0$, we have $\widetilde{H}_0(\Delta;k)\ne 0$.
For the empty face,
\begin{align*}
\operatorname{lk}_{\Delta}(\varnothing)=\{G\in \Delta:G\cap \varnothing=\varnothing \text{ and } G\cup \varnothing\in \Delta\}=\Delta.
\end{align*}
Thus the empty-face link has dimension $1$, but its reduced homology in degree $0<1$ is nonzero. Therefore the local vanishing condition in *Reisner criterion* fails, so this pure graph is not Cohen-Macaulay over $k$.
[/example]
This failure explains why purity alone is too weak. Cohen-Macaulayness forces connectedness in dimension at least $1$, and it imposes analogous connectivity conditions on every link.
## Posets, Intervals, and Order Complexes
The next question is how a property of simplicial complexes should be assigned to a poset. Since the topology of a poset is read from its chains, the first construction turns chains into faces.
[definition: Order Complex]
Let $P$ be a finite poset. The order complex $\Delta(P)$ is the simplicial complex whose vertices are the elements of $P$ and whose faces are the finite chains $x_0 < x_1 < \cdots < x_r$ in $P$.
[/definition]
The order complex translates a poset into topology, but bounded endpoints make the whole complex a cone and erase the topology we want. This motivates deleting the endpoints and focusing on the open interval between two comparable elements.
[definition: Open Interval In A Poset]
Let $P$ be a poset and let $x<y$ in $P$. The open interval from $x$ to $y$ is
\begin{align*}
(x,y)=\{z \in P : x<z<y\}.
\end{align*}
[/definition]
The order complex of $(x,y)$ has facets given by maximal chains strictly between $x$ and $y$. A single global order complex can miss local failures: an interval may have disconnected or low-dimensional link behaviour even when the whole poset looks well behaved. The problem is to define a poset-level condition that requires the Cohen-Macaulay test on every interval complex, not just on the proper part once.
[definition: Cohen-Macaulay Poset]
Let $P$ be a finite graded poset and let $k$ be a field. The poset $P$ is Cohen-Macaulay over $k$ if for every interval $[x,y] \subset P$, the order complex $\Delta((x,y))$ is Cohen-Macaulay over $k$.
[/definition]
This interval definition is local by design. Small rank intervals require a convention for the empty open interval, so covers should not be treated as pathologies.
[remark: Endpoint Conventions]
If $y$ covers $x$, then $(x,y)=\varnothing$ and $\Delta((x,y))$ has dimension $-1$ under the usual convention. This interval causes no obstruction to Cohen-Macaulayness.
[/remark]
The convention keeps atoms and covers compatible with the general theory, but it also raises the practical question of what must be checked in an interval of each rank difference. The useful form is the uniform vanishing criterion from [citetheorem:6512], with the top dimension determined by rank. That formulation turns the definition into the test used in applications.
This tells us that Cohen-Macaulayness is a strengthened form of homological regularity for all intervals, not just for the whole poset. The graded hypothesis is doing real work: without a common rank difference, there is no single top dimension in which the interval homology should concentrate. The statement does not say that the top homology is nonzero; contractible Cohen-Macaulay intervals can have all reduced homology vanish. The main combinatorial method for proving the condition is the already quoted [citetheorem:6511], because shellings give a controlled way to attach facets and then repeat the argument inside links.
This theorem is the bridge from combinatorial constructions to Cohen-Macaulayness. Purity is essential in the statement, because a non-pure shelling-style attachment order can still leave links with facets of different dimensions, which violates the local uniformity required by Reisner's criterion. Shellability is stronger than Cohen-Macaulayness, so the theorem is not reversible: there are Cohen-Macaulay complexes that are not shellable. In practice we often prove EL-shellability or CL-shellability and then invoke this result rather than compute homology directly, especially for lattices where interval shellings are easier to construct than homology bases.
[example: Geometric Lattices]
Let $L$ be a finite geometric lattice, for example the lattice of flats of a matroid ordered by inclusion. The standard EL-labelling of a geometric lattice restricts to an EL-labelling on every open interval, so each open interval order complex is shellable. By *Shellability Implies Cohen-Macaulayness*, each $\Delta((x,y))$ is Cohen-Macaulay over every field, and therefore $L$ is Cohen-Macaulay over every field.
For the Boolean lattice $B_n$, the open interval between the bottom and top elements is
\begin{align*}
(\varnothing,[n])=\{A\subseteq [n]:\varnothing\subsetneq A\subsetneq [n]\}.
\end{align*}
A face of $\Delta((\varnothing,[n]))$ is a chain
\begin{align*}
A_0\subsetneq A_1\subsetneq \cdots \subsetneq A_r
\end{align*}
of nonempty proper subsets of $[n]$. These chains are exactly the faces in the barycentric subdivision of the boundary of the simplex with vertex set $[n]$, because the nonempty proper faces of that simplex are precisely the nonempty proper subsets of $[n]$. The boundary of an $(n-1)$-simplex is homeomorphic to $S^{n-2}$, and barycentric subdivision preserves homeomorphism type, so $\Delta((\varnothing,[n]))$ is homeomorphic to $S^{n-2}$.
[/example]
Geometric lattices therefore supply the main family of Cohen-Macaulay lattices arising from matroids. The same mechanism applies to face lattices because polytopal boundaries admit shellings.
[example: Face Lattices Of Shellable Polytopes]
Let $Q$ be a convex polytope and let $L(Q)$ be its face lattice, with $\varnothing$ as the bottom element and $Q$ as the top element. To show that $L(Q)$ is Cohen-Macaulay over every field, it is enough to check every interval $[F,G]\subseteq L(Q)$.
Fix $F<G$ in $L(Q)$. The elements of the open interval are exactly the faces $H$ satisfying
\begin{align*}
F\subsetneq H\subsetneq G.
\end{align*}
If $G$ covers $F$, then $(F,G)=\varnothing$, so its order complex has dimension $-1$ and gives no obstruction. Otherwise, the interval $[F,G]$ is the face lattice of the face figure of $G$ along $F$: faces strictly between $F$ and $G$ correspond to the nonempty proper faces of that face figure, ordered by inclusion. Hence $\Delta((F,G))$ is the barycentric subdivision of the boundary complex of that face figure.
The face figure is again a convex polytope. By the Bruggesser-Mani shelling theorem, its boundary complex is shellable, and barycentric subdivision preserves shellability. Therefore $\Delta((F,G))$ is a pure shellable simplicial complex. By *Shellability Implies Cohen-Macaulayness*, $\Delta((F,G))$ is Cohen-Macaulay over every field. Since this holds for every interval $[F,G]$, the face lattice $L(Q)$ is Cohen-Macaulay over every field.
[/example]
These examples show that Cohen-Macaulay posets include many posets with strong geometric meaning. The remaining sections explain why the class is stable under operations used to build new posets from old ones.
## Rank Selection
A recurring operation in enumerative combinatorics is to keep only selected ranks of a graded poset. The problem is that deleting ranks can damage connectivity and purity, so we need a theorem rather than a definition alone.
[definition: Rank-Selected Subposet]
Let $P$ be a finite graded poset with rank function $\rho:P\to \{0,1,\dots,n\}$, and let $S \subset \{0,1,\dots,n\}$. The rank-selected subposet is
\begin{align*}
P_S=\{x\in P : \rho(x)\in S\},
\end{align*}
with the order inherited from $P$.
[/definition]
Rank selection is central because many flag-enumerative invariants count chains whose ranks lie in a fixed set $S$. Deleting ranks can easily damage topology: intervals may lose connecting chains, facets may change dimension, and local links may acquire new lower homology. The theorem addresses exactly this obstruction for Cohen-Macaulay posets by showing that the local vanishing condition is stable under rank selection.
[quotetheorem:8131]
[citeproof:8131]
The theorem turns a local topological condition into a source of flag-enumerative regularity. The Cohen-Macaulay hypothesis is essential: deleting ranks from an arbitrary graded poset can disconnect an interval or create lower-dimensional facets, so the conclusion is not a purely formal property of rank selection. The theorem also does not preserve the original dimension or the original top homology; it says that the selected poset is Cohen-Macaulay relative to its own ranks. This is precisely what is needed for flag $h$-vectors, because those invariants count chains with prescribed ranks rather than chains of every rank. It also explains why rank-selected Boolean and partition lattices retain good homological behaviour.
[example: Rank-Selected Boolean Lattices]
Let $B_n$ be the Boolean lattice of all subsets of $[n]$, ranked by cardinality:
\begin{align*}
\rho(A)=|A|.
\end{align*}
For $S\subset \{1,\dots,n-1\}$, the rank-selected subposet is therefore
\begin{align*}
(B_n)_S=\{A\subseteq [n]: |A|\in S\}.
\end{align*}
The Boolean lattice $B_n$ is a geometric lattice, so it is Cohen-Macaulay over every field. Applying *[Baclawski Rank-Selection Theorem](/theorems/8131)* to $B_n$ gives that $(B_n)_S$ is Cohen-Macaulay over the same field.
In the special case $S=\{r\}$, every element of $(B_n)_S$ has cardinality $r$. If $A,B\in (B_n)_S$ and $A\subsetneq B$, then $|A|<|B|$, contradicting $|A|=|B|=r$. Thus no two distinct elements are comparable, so $(B_n)_{\{r\}}$ is an antichain. When $S$ has several ranks, a face of the order complex is exactly a chain
\begin{align*}
A_0\subsetneq A_1\subsetneq \cdots \subsetneq A_t
\end{align*}
with $|A_j|\in S$ for every $j$. Thus rank selection keeps precisely those Boolean-lattice chains whose subset sizes lie in the prescribed set $S$, and Baclawski's theorem says that this restriction preserves Cohen-Macaulayness.
[/example]
The single-rank case illustrates that Cohen-Macaulayness adapts to the dimension of the selected poset rather than preserving the original dimension. More structured examples come from partition lattices, where ranks record successive mergers of blocks.
[example: Rank-Selected Partition Lattices]
Let $\Pi_n$ be ordered by refinement, so $\pi\le \sigma$ means that every block of $\pi$ is contained in a block of $\sigma$. Its rank function is
\begin{align*}
\rho(\pi)=n-\#\text{blocks}(\pi).
\end{align*}
Thus a partition of rank $s$ has exactly $n-s$ blocks, because $\rho(\pi)=s$ is equivalent to $\#\text{blocks}(\pi)=n-s$.
The partition lattice $\Pi_n$ is geometric, so it is Cohen-Macaulay over every field by the standard EL-shellability of geometric lattices and *Shellability Implies Cohen-Macaulayness*. For $S\subset\{1,\dots,n-2\}$, the rank-selected subposet is
\begin{align*}
(\Pi_n)_S=\{\pi\in \Pi_n:\rho(\pi)\in S\}=\{\pi\in \Pi_n:n-\#\text{blocks}(\pi)\in S\}.
\end{align*}
Equivalently,
\begin{align*}
(\Pi_n)_S=\{\pi\in \Pi_n:\#\text{blocks}(\pi)\in \{n-s:s\in S\}\}.
\end{align*}
Since $\Pi_n$ is Cohen-Macaulay over every field, *Baclawski Rank-Selection Theorem* gives that $(\Pi_n)_S$ is Cohen-Macaulay over the same field.
A face of the order complex $\Delta((\Pi_n)_S)$ is therefore a chain
\begin{align*}
\pi_0<\pi_1<\cdots<\pi_t
\end{align*}
such that $n-\#\text{blocks}(\pi_j)\in S$ for every $j$. Moving upward in the partition lattice merges blocks, so the rank value $n-\#\text{blocks}(\pi_j)$ records how many block mergers have occurred from the discrete partition. Thus rank selection keeps exactly the chains whose prescribed merger counts lie in $S$, and Cohen-Macaulayness gives homological control over those selected chains.
[/example]
Rank selection is especially useful when studying flag $h$-vectors and descent statistics, because the selected ranks encode which descent sets or chain types are being counted.
## Products and Other Preservation Theorems
The final question is whether Cohen-Macaulayness behaves well under standard poset constructions. Product constructions are unavoidable: intervals in products split into products of intervals, and order complexes interact with joins.
[definition: Product Of Posets]
Let $P$ and $Q$ be posets. Their product $P\times Q$ is the poset on pairs $(p,q)$ with
\begin{align*}
(p,q)\le (p',q') \iff p\le p' \text{ in } P \text{ and } q\le q' \text{ in } Q.
\end{align*}
[/definition]
The definition is combinatorial, but the preservation statement is topological. Product intervals decompose into coordinate intervals, and their order complexes are controlled by joins, whose homology is governed by the [Kunneth formula](/theorems/3933).
[quotetheorem:8132]
[citeproof:8132]
The product theorem is often applied together with rank selection. Both hypotheses are needed: if one factor has an interval with low-dimensional homology, the corresponding product interval retains that local defect inside its links. The theorem also does not identify the product interval complex with a plain join of the two interval complexes; the correct topology passes through the product triangulation and its links. For instance, multigraded Boolean-type posets can be built as products and then restricted to selected total ranks.
[example: Products Of Boolean Lattices]
Let $B_a$ be the Boolean lattice of subsets of $[a]$ and let $B_b$ be the Boolean lattice of subsets of $[b]$, with $B_a\times B_b$ ordered componentwise. Define
\begin{align*}
\Phi:B_a\times B_b\to B_{a+b},\qquad \Phi(A,B)=A\cup \{a+j:j\in B\}.
\end{align*}
The two sets in the union are disjoint, because $A\subseteq \{1,\dots,a\}$ while $\{a+j:j\in B\}\subseteq \{a+1,\dots,a+b\}$. Hence $\Phi(A,B)$ is a subset of $[a+b]$.
The map is bijective. Given $C\subseteq [a+b]$, set
\begin{align*}
A=C\cap [a].
\end{align*}
Set also
\begin{align*}
B=\{j\in [b]:a+j\in C\}.
\end{align*}
Then $A\subseteq [a]$, $B\subseteq [b]$, and
\begin{align*}
\Phi(A,B)=(C\cap [a])\cup (C\cap \{a+1,\dots,a+b\})=C.
\end{align*}
Thus every $C$ has a preimage, and the formulas for $A$ and $B$ show that this preimage is unique.
The map also preserves and reflects order. If $(A,B)\le (A',B')$, then $A\subseteq A'$ and $B\subseteq B'$, so
\begin{align*}
A\cup \{a+j:j\in B\}\subseteq A'\cup \{a+j:j\in B'\}.
\end{align*}
Conversely, if $\Phi(A,B)\subseteq \Phi(A',B')$, then intersecting both sides with $[a]$ gives $A\subseteq A'$, and intersecting both sides with $\{a+1,\dots,a+b\}$ gives $\{a+j:j\in B\}\subseteq \{a+j:j\in B'\}$, hence $B\subseteq B'$. Therefore $\Phi$ is a poset isomorphism $B_a\times B_b\cong B_{a+b}$.
Since Boolean lattices are geometric lattices, $B_{a+b}$ is Cohen-Macaulay over every field. Equivalently, because $B_a$ and $B_b$ are Cohen-Macaulay over every field, *Products Preserve Cohen-Macaulayness* gives directly that $B_a\times B_b$ is Cohen-Macaulay over every field; this second argument applies even when the two factors are unrelated Cohen-Macaulay posets.
[/example]
Products build new examples, but substructures and opposite orders appear just as often in calculations. The final preservation theorem is needed to justify passing to an interval or reversing the order without rechecking all homology groups.
[quotetheorem:8133]
[citeproof:8133]
These closure results make Cohen-Macaulayness usable in constructions rather than only in isolated examples. The interval part is almost built into the definition, but it is important because many arguments pass to a local interval without changing the ambient notation. The dual part uses the fact that order complexes do not remember the direction in which a chain is read; this would fail for properties depending on labelled edge directions rather than on the underlying chains. Together with the product and rank-selection theorems, these results explain why shellability remains the most efficient proof method: once a poset is shellable, many standard constructions preserve a large part of the resulting Cohen-Macaulay structure.
[remark: Dependence On The Field]
Cohen-Macaulayness may depend on the field $k$, because Reisner's criterion uses homology with coefficients in $k$. Shellable complexes avoid this dependence, since the shelling argument gives the required vanishing over every field.
[/remark]
The field dependence matters most for non-shellable examples with torsion in link homology. In the standard families of this course, geometric lattices and shellable face lattices are Cohen-Macaulay over every field, so the coefficient field is usually suppressed after being fixed.
# 11. $P$-Partitions and Order Polynomials
After the topological focus of Chapters 8 through 10, $P$-partitions return to enumeration by turning the qualitative data of a finite poset into enumerative functions and generating functions. The guiding question is: how many ways are there to assign positive integers to the elements of a poset while respecting the order, and how does this count change when equalities are allowed or forbidden? This chapter connects order-preserving maps, linear extensions, descents of permutations, and reciprocity for order polynomials.
## Order-Preserving Maps and Labeled Posets
A poset gives only partial information: if two elements are incomparable, their assigned values may occur in either order, while comparable elements must respect the order. The first concrete difficulty is to count assignments without accidentally treating incomparable elements as if they were ordered. Order-preserving maps solve this by counting maps into a finite chain that obey exactly the comparability constraints and no others.
[definition: Order-Preserving Map]
Let $P$ be a finite poset and let $[m]=\{1,\dots,m\}$ with its usual total order. An order-preserving map from $P$ to $[m]$ is a function $f:P\to [m]$ such that $x\le_P y$ implies $f(x)\le f(y)$.
[/definition]
The condition says that comparable elements may be tied, but they may not be reversed. This is the weak version of order enumeration, and it is already rich enough to detect the shape of $P$ through the way chains and antichains constrain the available choices.
[example: Chains and Antichains]
For the chain $P=C_n$ with $x_1<\cdots <x_n$, an order-preserving map $f:C_n\to [m]$ is exactly a choice of integers
\begin{align*}
1\le f(x_1)\le \cdots \le f(x_n)\le m.
\end{align*}
Set $a_i=f(x_i)$. To count such weakly increasing sequences, define
\begin{align*}
b_1=a_1,\quad b_2=a_2-a_1,\quad \ldots,\quad b_n=a_n-a_{n-1},\quad b_{n+1}=m+1-a_n.
\end{align*}
Then $b_1\ge 1$, $b_{n+1}\ge 1$, and $b_2,\ldots,b_n\ge 0$, while
\begin{align*}
b_1+b_2+\cdots+b_n+b_{n+1}=m+1.
\end{align*}
After setting $c_1=b_1-1$, $c_{n+1}=b_{n+1}-1$, and $c_i=b_i$ for $2\le i\le n$, this is equivalent to
\begin{align*}
c_1+c_2+\cdots+c_{n+1}=m-1
\end{align*}
with all $c_i\ge 0$. The number of nonnegative solutions is the number of placements of $n$ dividers among $m+n-1$ positions, hence
\begin{align*}
\Omega_{C_n}(m)=\binom{m+n-1}{n}.
\end{align*}
For the antichain $P=A_n$, no two distinct elements are comparable, so the order-preserving condition imposes no inequalities. Each of the $n$ elements may be assigned any of the $m$ values in $[m]$, independently, giving
\begin{align*}
\Omega_{A_n}(m)=\underbrace{m\cdot m\cdots m}_{n\text{ factors}}=m^n.
\end{align*}
Thus chains force sorted choices, while antichains leave all choices independent.
[/example]
The chain and antichain show the two extremes: total comparability forces a sorted sequence, while no comparability gives independent choices. The next refinement asks what changes when comparable elements are not allowed to share a value; this separates weak order preservation from strict order preservation.
[definition: Strict Order-Preserving Map]
Let $P$ be a finite poset. A strict order-preserving map from $P$ to $[m]$ is a function $f:P\to [m]$ such that $x<_P y$ implies $f(x)<f(y)$.
[/definition]
Strict maps exist only when the target chain is long enough to accommodate every chain in $P$. Comparing the strict and weak versions on the same basic examples gives the first hint that the two counts are related by binomial reciprocity rather than by a separate enumeration problem.
[example: Strict Maps on a Chain]
For the chain $C_n$ with $x_1<\cdots <x_n$, a strict order-preserving map $f:C_n\to [m]$ is exactly a choice of integers
\begin{align*}
1\le f(x_1)<\cdots <f(x_n)\le m.
\end{align*}
Set $a_i=f(x_i)$. The strict inequalities imply that the image set is
\begin{align*}
\{a_1,\dots,a_n\}\subseteq [m]
\end{align*}
with $n$ distinct elements. Conversely, if $S=\{s_1,\dots,s_n\}\subseteq [m]$ with $s_1<\cdots<s_n$, then defining $f(x_i)=s_i$ gives
\begin{align*}
f(x_1)=s_1< s_2=f(x_2)<\cdots <s_n=f(x_n),
\end{align*}
so $f$ is strict order-preserving. These two constructions are inverse to each other, hence the number of strict maps is the number of $n$-element subsets of $[m]$:
\begin{align*}
\overline{\Omega}_{C_n}(m)=\binom{m}{n}.
\end{align*}
For the antichain $A_n$, there are no comparable distinct elements, so the condition $x<_P y\implies f(x)<f(y)$ imposes no restrictions. Each of the $n$ elements may be assigned any of the $m$ values independently, giving
\begin{align*}
\overline{\Omega}_{A_n}(m)=\underbrace{m\cdot m\cdots m}_{n\text{ factors}}=m^n.
\end{align*}
Strictness therefore changes the chain count from weakly increasing choices to subsets, but it leaves antichain enumeration unchanged.
[/example]
These examples use the poset order alone, so each comparable pair is treated in the same way. That is too coarse for enumerators connected to permutation descents: the chain $x<y<z$ would only produce weak sequences or only produce strict sequences, but it would not remember whether the word of labels is $123$ or $321$. To make descent data visible, we need to keep the partial order and a chosen total naming of the elements at the same time. P-partitions fix the missing information by adding this second ordering of the elements, so the next definition packages a poset together with a bijective labeling.
[definition: Labeled Poset]
A labeled poset is a pair $(P,\omega)$ where $P$ is a finite poset and $\omega:P\to \{1,\dots,|P|\}$ is a bijection.
[/definition]
The label order gives a total ordering that can disagree with the partial order. This disagreement is exactly what a $P$-partition measures: comparable pairs following the labels allow equality, while comparable pairs opposing the labels force a rise.
[definition: P-Partition]
Let $(P,\omega)$ be a labeled finite poset. A $P$-partition is a function $\sigma:P\to \mathbb N$ such that for all $x<_P y$, the following conditions hold:
\begin{align*}
\omega(x)<\omega(y) \implies \sigma(x)\le \sigma(y),
\end{align*}
and
\begin{align*}
\omega(x)>\omega(y) \implies \sigma(x)<\sigma(y).
\end{align*}
[/definition]
This convention uses positive integers, so finite restrictions such as $\sigma(x)\le m$ are needed when we want finite counts. The special case in which the labeling never disagrees with the poset order recovers ordinary weak order-preserving maps; this motivates naming such labelings separately before comparing labeled and unlabeled enumeration.
[definition: Natural Labeling]
Let $P$ be a finite poset. A labeling $\omega:P\to \{1,\dots,|P|\}$ is natural if $x<_P y$ implies $\omega(x)<\omega(y)$.
[/definition]
For a natural labeling, $P$-partitions are exactly weak order-preserving maps from $P$ to $\mathbb N$. Other labelings record where strict inequalities must be placed, and the same underlying poset can therefore have several different bounded $P$-partition enumerators.
[example: A Three-Element V]
Let $P$ have elements $a,b,c$ with $a<c$ and $b<c$, while $a$ and $b$ are incomparable. With the natural labeling $\omega(a)=1$, $\omega(b)=2$, and $\omega(c)=3$, the two comparable relations have increasing labels:
\begin{align*}
\omega(a)=1<3=\omega(c)
\end{align*}
\begin{align*}
\omega(b)=2<3=\omega(c).
\end{align*}
By the definition of a $P$-partition, these two comparisons impose
\begin{align*}
\sigma(a)\le \sigma(c)
\end{align*}
\begin{align*}
\sigma(b)\le \sigma(c).
\end{align*}
There is no condition comparing $\sigma(a)$ and $\sigma(b)$, because $a$ and $b$ are incomparable.
If we count bounded $P$-partitions with values in $[m]$, then choosing $\sigma(c)=t$ leaves $\sigma(a)$ and $\sigma(b)$ independently chosen from $\{1,\dots,t\}$. Hence the natural labeling gives
\begin{align*}
\sum_{t=1}^m t\cdot t=\sum_{t=1}^m t^2
\end{align*}
bounded $P$-partitions.
Now change only the labeling, taking $\omega(c)=1$, $\omega(a)=2$, and $\omega(b)=3$. The same two poset relations now have decreasing labels:
\begin{align*}
\omega(a)=2>1=\omega(c)
\end{align*}
\begin{align*}
\omega(b)=3>1=\omega(c).
\end{align*}
Therefore the $P$-partition conditions become
\begin{align*}
\sigma(a)<\sigma(c)
\end{align*}
\begin{align*}
\sigma(b)<\sigma(c).
\end{align*}
If $\sigma(c)=t$, then $\sigma(a)$ and $\sigma(b)$ are independently chosen from $\{1,\dots,t-1\}$, so the bounded count is
\begin{align*}
\sum_{t=1}^m (t-1)(t-1)=\sum_{t=1}^m (t-1)^2.
\end{align*}
Thus the underlying poset is unchanged, but the labeling changes weak inequalities into strict inequalities and therefore changes the finite bounded enumeration.
[/example]
## Order Polynomials and Reciprocity
Once order-preserving maps have been isolated, the central enumerative question is whether their number as a function of $m$ is actually polynomial. The answer is yes, and the strict version is controlled by a reciprocity law.
[definition: Order Polynomial]
Let $P$ be a finite poset. The order polynomial is the function
\begin{align*}
\Omega_P:\mathbb Z_{>0}\to \mathbb Z_{\ge 0}
\end{align*}
defined by letting $\Omega_P(m)$ be the number of order-preserving maps $f:P\to [m]$.
[/definition]
The notation would be misleading if the count were only an arbitrary function of the positive integer $m$. The main question is why weakly order-preserving maps to a chain should vary polynomially with $m$ at all. Linear extensions provide the mechanism: once the elements are listed compatibly with the order, the map is counted by weakly increasing sequences with prescribed inequalities.
[quotetheorem:8134]
[citeproof:8134]
The finiteness of $P$ is essential. For the infinite antichain $A=\{a_1,a_2,\dots\}$, every function $A\to [m]$ is order-preserving, so for $m\ge 2$ there are infinitely many such maps rather than a finite value of a finite-degree polynomial. For the infinite chain $\mathbb N$ with its usual order, the set of weakly increasing maps $\mathbb N\to [m]$ is infinite as soon as $m\ge 2$, since the jump positions between levels may be chosen in infinitely many ways. The target $[m]$ being a chain is also part of the statement; maps into a general finite poset need not be controlled by the same linear-extension binomial expansion. The theorem does not yet count strict maps or explain negative evaluations of the polynomial. It tells us that weak order enumeration is polynomial, and its proof points forward to the strict theory because the same linear extensions record where inequalities may be forced.
[definition: Strict Order Polynomial]
Let $P$ be a finite poset. The strict order polynomial is the function
\begin{align*}
\overline{\Omega}_P:\mathbb Z_{>0}\to \mathbb Z_{\ge 0}
\end{align*}
defined by letting $\overline{\Omega}_P(m)$ be the number of strict order-preserving maps $f:P\to [m]$.
[/definition]
The weak and strict counts are not independent. The preceding polynomiality result lets us evaluate $\Omega_P(m)$ away from positive integers, and the natural question is what the negative evaluations count. The answer is that they encode strict maps, up to the sign determined by $|P|$.
[quotetheorem:8135]
[citeproof:8135]
The hypothesis that $P$ is finite is needed because the sign $(-1)^n$ and the degree $n$ have no meaning for an infinite poset. The positivity of $m$ is also essential for the enumerative interpretation: $\Omega_P(-m)$ is a polynomial evaluation, not a count of maps into an actual negative chain. The theorem does not say that weak and strict maps are naturally bijective for the same $m$; for a chain, weak maps are counted by $\binom{m+n-1}{n}$ while strict maps are counted by $\binom{m}{n}$. Its role is to connect order polynomials with the broader pattern of Ehrhart-type reciprocity: $\Omega_P(m)$ counts lattice points in dilates of the order polytope, while $\overline{\Omega}_P(m)$ counts lattice points satisfying the strict inequalities, corresponding to the relative interior. Negative evaluation is therefore the order-polytope version of recovering interior lattice points from an Ehrhart polynomial.
The simplest cases recover familiar binomial identities and check that the sign convention is the right one.
[example: Reciprocity for Chains]
For a chain $C_n$, the weak and strict order polynomials are
\begin{align*}
\Omega_{C_n}(m)=\binom{m+n-1}{n}, \qquad \overline{\Omega}_{C_n}(m)=\binom{m}{n}.
\end{align*}
Evaluating the weak polynomial at $-m$ gives
\begin{align*}
\Omega_{C_n}(-m)=\binom{-m+n-1}{n}.
\end{align*}
Using the product formula for binomial coefficients,
\begin{align*}
\binom{-m+n-1}{n}=\frac{(-m+n-1)(-m+n-2)\cdots(-m)}{n!}.
\end{align*}
Each of the $n$ factors contributes a minus sign, so
\begin{align*}
\binom{-m+n-1}{n}=(-1)^n\frac{(m-n+1)(m-n+2)\cdots m}{n!}.
\end{align*}
Reordering the factors in the numerator gives
\begin{align*}
\binom{-m+n-1}{n}=(-1)^n\frac{m(m-1)\cdots(m-n+1)}{n!}.
\end{align*}
Since
\begin{align*}
\binom{m}{n}=\frac{m(m-1)\cdots(m-n+1)}{n!},
\end{align*}
we obtain
\begin{align*}
(-1)^n\Omega_{C_n}(-m)=(-1)^n\binom{-m+n-1}{n}=\binom{m}{n}=\overline{\Omega}_{C_n}(m).
\end{align*}
For an antichain $A_n$, there are no order relations, so weak and strict maps are both arbitrary functions $A_n\to [m]$. Hence
\begin{align*}
\Omega_{A_n}(m)=m^n
\end{align*}
and
\begin{align*}
\overline{\Omega}_{A_n}(m)=m^n.
\end{align*}
The reciprocal evaluation is
\begin{align*}
(-1)^n\Omega_{A_n}(-m)=(-1)^n(-m)^n.
\end{align*}
Because $(-m)^n=(-1)^n m^n$, this becomes
\begin{align*}
(-1)^n(-m)^n=(-1)^n(-1)^n m^n=(-1)^{2n}m^n=m^n.
\end{align*}
Thus both the chain and antichain cases match the sign rule in *[Order Polynomial Reciprocity](/theorems/8135)*.
[/example]
The examples show the whole polynomial, but often the most important information is the top-degree term. Since large $m$ makes the order constraints behave like regions inside the cube $[m]^P$, it is natural to ask whether the leading coefficient is determined by compatible total orders. This motivates the leading coefficient theorem.
[quotetheorem:8136]
[citeproof:8136]
The finiteness hypothesis is again essential: $e(P)$ and $n!$ are finite combinatorial quantities only for finite posets. The theorem also depends on using the ordinary order polynomial; a labeled $P$-partition enumerator with forced strict inequalities can have the same degree but different lower-order terms. It does not determine the whole polynomial. For example, the two-element chain has
\begin{align*}
\Omega_{C_2}(m)=\binom{m+1}{2}=\frac{m^2+m}{2},
\end{align*}
while the reverse-labeled two-element chain has bounded $P$-partition enumerator
\begin{align*}
\binom{m}{2}=\frac{m^2-m}{2}.
\end{align*}
Both have leading coefficient $1/2$, but their linear terms differ. The leading coefficient is best viewed as the normalized volume of the order polytope, and in this course it serves as the bridge from counting maps to counting compatible total orders.
The standard test cases make the formula concrete.
[example: Leading Coefficients for Basic Posets]
For the chain $C_n$, there is only one linear extension, namely the order $x_1,x_2,\dots,x_n$, so $e(C_n)=1$. Its order polynomial is
\begin{align*}
\Omega_{C_n}(m)=\binom{m+n-1}{n}=\frac{m(m+1)\cdots(m+n-1)}{n!}.
\end{align*}
The numerator is a product of $n$ monic linear factors in $m$, so its highest-degree term is $m^n$. Therefore the coefficient of $m^n$ in $\Omega_{C_n}(m)$ is
\begin{align*}
\frac{1}{n!}.
\end{align*}
For the antichain $A_n$, there are no order relations, so every permutation of the $n$ elements is a linear extension. Hence
\begin{align*}
e(A_n)=n!.
\end{align*}
Also every function $A_n\to [m]$ is order-preserving, giving
\begin{align*}
\Omega_{A_n}(m)=m^n.
\end{align*}
Thus the coefficient of $m^n$ is $1$, which equals
\begin{align*}
\frac{e(A_n)}{n!}=\frac{n!}{n!}=1.
\end{align*}
For the three-element V-poset with $a<c$ and $b<c$, the element $c$ must appear last in every linear extension, while $a$ and $b$ may appear in either order. Thus the two linear extensions are
\begin{align*}
a,b,c
\end{align*}
and
\begin{align*}
b,a,c.
\end{align*}
So $e(P)=2$. The leading-coefficient prediction is therefore
\begin{align*}
\frac{e(P)}{3!}=\frac{2}{6}=\frac{1}{3}.
\end{align*}
This matches the direct count: if $f(c)=t$, then $f(a)$ and $f(b)$ are independently chosen from $[t]$, so
\begin{align*}
\Omega_P(m)=\sum_{t=1}^m t^2=\frac{m(m+1)(2m+1)}{6}.
\end{align*}
The highest-degree term of the last expression is
\begin{align*}
\frac{2m^3}{6}=\frac{1}{3}m^3.
\end{align*}
Thus the chain, antichain, and V-poset all realize the leading coefficient $e(P)/n!$ in concrete form.
[/example]
## Linear Extensions, Descents, and the Fundamental Theorem
The previous section used linear extensions as a proof device. The deeper result is that bounded $P$-partitions decompose into disjoint pieces indexed by linear extensions, and descents describe the strict inequalities inside each piece.
[definition: Linear Extension as a Word]
Let $(P,\omega)$ be a labeled poset with $n=|P|$. A linear extension of $P$ is written as a word $\pi=\pi_1\cdots \pi_n$ in the labels $\omega(P)$ such that if $x<_P y$, then $\omega(x)$ appears before $\omega(y)$ in $\pi$.
[/definition]
Writing a linear extension as a word lets us import permutation statistics into poset enumeration. A count that remembers only the number of linear extensions loses information: the words $123$ and $321$ are both single extensions of a three-element chain, but they lead to weak and strict sequence counts. The statistic needed here records exactly where the label word decreases, because those are the positions that force strict inequalities in the associated sequence. This motivates defining the descent set before stating the decomposition theorem.
[definition: Descent Set]
Let $\pi=\pi_1\cdots \pi_n$ be a permutation of $\{1,\dots,n\}$. Its descent set is
\begin{align*}
\operatorname{Des}(\pi)=\{i\in \{1,\dots,n-1\}:\pi_i>\pi_{i+1}\}.
\end{align*}
The number of descents is $\operatorname{des}(\pi)=|\operatorname{Des}(\pi)|$.
[/definition]
A descent means that the labels decrease when the linear extension moves from one element to the next. In a $P$-partition, such a descent is exactly where equality between adjacent entries must be forbidden, while non-descents allow weak inequalities. The problem is to separate all assignments by the compatible linear extension that records their tie-breaking order; the theorem gives that decomposition and makes descents the counting data.
[quotetheorem:8137]
[citeproof:8137]
The labeling hypothesis is necessary because descents are not defined for an unlabeled poset; without labels the theorem cannot decide which adjacent equalities should be forbidden. The finiteness hypothesis is needed so that the elements can be arranged into a finite word and a finite sequence. The theorem does not claim that every permutation occurs, only those words that are linear extensions of $P$; for a chain there is just one such word. It is the main refinement of order enumeration because it records the descent pattern of each compatible total order. To turn that structural decomposition into an actual counting formula, we now impose the finite bound $\sigma(P)\subset [m]$ and count the allowed sequences extension by extension.
[quotetheorem:8138]
[citeproof:8138]
The positivity of $m$ is needed because $[m]$ is a finite chain only for positive integers in this notation. The labeled-poset hypothesis is also essential: replacing $(P,\omega)$ by the underlying poset alone loses the descent statistic, as the natural and reverse labelings of a chain give different bounded counts. The theorem does not say the enumerator is independent of the labeling; that independence holds only in the natural-labeling case, where it reduces to the order polynomial $\Omega_P(m)$. More broadly, the formula is the finite shadow of the $P$-partition generating function, a quasisymmetric generating function whose monomials are grouped by descent sets.
For non-natural labelings the formula is therefore a refined invariant of the labeled poset, and different labelings of the same underlying poset can separate descent behavior.
[example: Three-Element Labelings and Descents]
Let $P$ be the chain $x<y<z$. Since every linear extension must list $x$ before $y$ before $z$, the underlying chain has only one possible extension order.
With the natural labeling $\omega(x)=1$, $\omega(y)=2$, and $\omega(z)=3$, this extension order becomes the word $123$. Its descent set is empty, because
\begin{align*}
1<2
\end{align*}
and
\begin{align*}
2<3.
\end{align*}
Thus $\operatorname{des}(123)=0$. By *Bounded P-Partition Enumerator*, the bounded $P$-partition count is
\begin{align*}
\binom{m+3-1-\operatorname{des}(123)}{3}.
\end{align*}
Substituting $\operatorname{des}(123)=0$ gives
\begin{align*}
\binom{m+3-1-0}{3}=\binom{m+2}{3}.
\end{align*}
Equivalently, the inequalities are
\begin{align*}
1\le \sigma(x)\le \sigma(y)\le \sigma(z)\le m,
\end{align*}
so the natural labeling recovers weak chain enumeration.
With the reverse labeling $\omega(x)=3$, $\omega(y)=2$, and $\omega(z)=1$, the same extension order $x,y,z$ becomes the word $321$. Its descent set is
\begin{align*}
\operatorname{Des}(321)=\{1,2\},
\end{align*}
because
\begin{align*}
3>2
\end{align*}
and
\begin{align*}
2>1.
\end{align*}
Hence $\operatorname{des}(321)=2$. Applying *Bounded P-Partition Enumerator* again gives
\begin{align*}
\binom{m+3-1-\operatorname{des}(321)}{3}.
\end{align*}
Substituting $\operatorname{des}(321)=2$ gives
\begin{align*}
\binom{m+3-1-2}{3}=\binom{m}{3}.
\end{align*}
In this case both comparable relations have decreasing labels, so the $P$-partition inequalities are
\begin{align*}
1\le \sigma(x)<\sigma(y)<\sigma(z)\le m.
\end{align*}
Thus the same three-element chain gives weak chain enumeration under the natural labeling and strict chain enumeration under the reverse labeling.
[/example]
The chain examples are extreme because the poset has only one linear extension, so every labeling effect is concentrated in one descent set. The V-poset shows a milder and more representative phenomenon: the underlying partial order leaves room for several extensions, and the chosen labels determine how their descent sets are distributed. This matters because two labelings can have the same number of extensions but different descent enumerators, so the bounded $P$-partition count sees more than $e(P)$.
[example: V-Poset Descent Enumerator]
Let $P$ have $a<c$ and $b<c$, and use the natural labeling $\omega(a)=1$, $\omega(b)=2$, and $\omega(c)=3$. In any linear extension, $c$ must occur after both $a$ and $b$, while $a$ and $b$ may occur in either order. Therefore the two extension orders are $a,b,c$ and $b,a,c$, giving the label words $123$ and $213$.
For $123$, there is no descent because $1<2$ and $2<3$, so $\operatorname{des}(123)=0$. For $213$, there is a descent in position $1$ because $2>1$, and no descent in position $2$ because $1<3$, so
\begin{align*}
\operatorname{Des}(213)=\{1\}
\end{align*}
and $\operatorname{des}(213)=1$. By *Bounded P-Partition Enumerator*, the bounded $P$-partition count is
\begin{align*}
\binom{m+3-1-\operatorname{des}(123)}{3}+\binom{m+3-1-\operatorname{des}(213)}{3}.
\end{align*}
Substituting the two descent numbers gives
\begin{align*}
\Omega_P(m)=\binom{m+2}{3}+\binom{m+1}{3}.
\end{align*}
This matches the direct count from the inequalities. Since the labeling is natural, the relations $a<c$ and $b<c$ impose
\begin{align*}
\sigma(a)\le \sigma(c)
\end{align*}
and
\begin{align*}
\sigma(b)\le \sigma(c).
\end{align*}
If $\sigma(c)=t$, then $\sigma(a)$ has $t$ choices in $[t]$ and $\sigma(b)$ has $t$ choices in $[t]$, independently, so there are $t^2$ choices with top value $t$. Summing over $t=1,\dots,m$ gives
\begin{align*}
\Omega_P(m)=\sum_{t=1}^m t^2.
\end{align*}
The binomial formula gives the same polynomial:
\begin{align*}
\binom{m+2}{3}+\binom{m+1}{3}=\frac{m(m+1)(m+2)}{6}+\frac{(m-1)m(m+1)}{6}.
\end{align*}
Factoring the common term gives
\begin{align*}
\frac{m(m+1)(m+2)}{6}+\frac{(m-1)m(m+1)}{6}=\frac{m(m+1)((m+2)+(m-1))}{6}.
\end{align*}
Since $(m+2)+(m-1)=2m+1$, this becomes
\begin{align*}
\binom{m+2}{3}+\binom{m+1}{3}=\frac{m(m+1)(2m+1)}{6}.
\end{align*}
Thus the descent enumerator and the direct bounded count describe the same enumeration, with the two linear extensions contributing one weak piece and one piece with a forced strict rise.
[/example]
The chapter therefore ends with a unifying picture. Order polynomials count weak maps, strict order polynomials count strict maps, reciprocity relates the two, and the fundamental theorem of $P$-partitions explains both through linear extensions and descents.
# 12. Synthesis: From Local Intervals to Global Enumeration
## Computing Invariants from Intervals
This chapter synthesizes the earlier material on finite posets, incidence algebras, Möbius inversion, order complexes, shellability, Cohen-Macaulayness, characteristic polynomials, and order polynomials. The guiding question is how a large finite poset can be reduced to smaller pieces without losing the invariant we want. The recurring answer is that intervals carry the local information, while convolution, products, and deletion-recursion tell us how to combine it. This section records the computational rules that turn the theory of previous chapters into an algorithmic toolkit.
The first invariant to organise is the Möbius function. Its defining recursion is local on intervals, so every computation begins by isolating the interval $[x,y]$ rather than by studying the ambient poset all at once.
[quotetheorem:8139]
[citeproof:8139]
This theorem is the basic divide-and-compute rule: once all proper subintervals have been evaluated, the top value follows by one summation. The finiteness hypothesis is doing real work here, because the recursion requires only finitely many intermediate elements and the incidence-algebra inverse is obtained by finite triangular inversion. The limitation is that the recursion alone may still require evaluating many unrelated intervals; products give a second source of simplification, replacing repeated interval computations by tensor products of smaller ones.
[quotetheorem:8140]
[citeproof:8140]
The product theorem applies only when the interval itself splits into independent coordinates. That hypothesis fails, for example, in a partition lattice interval where merging two blocks changes which later merges are available. When the splitting does exist, the computational gain is substantial; the Boolean lattice is the model example because every interval is again Boolean.
[example: Boolean Lattice $B_3$]
Let $B_3$ be the subset lattice of $\{1,2,3\}$ ordered by inclusion. If $A\subseteq B$, the map $C\mapsto C\setminus A$ sends the interval $[A,B]$ bijectively and order-preservingly onto the Boolean lattice of subsets of $B\setminus A$, with inverse $D\mapsto A\cup D$. Thus $[A,B]\cong B_k$, where $k=|B\setminus A|$.
For the one-element Boolean lattice $B_1=\{\varnothing,\{i\}\}$, the interval recursion gives
\begin{align*}
\mu_{B_1}(\varnothing,\{i\})=-\mu_{B_1}(\varnothing,\varnothing)=-1.
\end{align*}
Since $B_k\cong B_1^k$, *Multiplicativity over Products* gives
\begin{align*}
\mu_{B_3}(A,B)=\prod_{i\in B\setminus A}\mu_{B_1}(\varnothing,\{i\})=(-1)^k=(-1)^{|B\setminus A|}.
\end{align*}
The rank-$r$ elements of $B_3$ are the $r$-element subsets of $\{1,2,3\}$, so their numbers are
\begin{align*}
\binom{3}{0}=1,\quad \binom{3}{1}=3,\quad \binom{3}{2}=3,\quad \binom{3}{3}=1.
\end{align*}
Hence the rank generating function is
\begin{align*}
F_{B_3}(t)=1+3t+3t^2+t^3=(1+t)^3.
\end{align*}
For the top interval, the open interval $(\varnothing,\{1,2,3\})$ has six elements: the three singletons and the three two-element subsets. Its order complex has one vertex for each of these six elements, and its edges are the containment chains $\{i\}\subset \{i,j\}$. Each singleton is contained in exactly two two-element subsets, so the number of edges is $3\cdot 2=6$. There are no two-dimensional simplices because no chain in the open interval has three elements. Therefore
\begin{align*}
\chi(\Delta(\varnothing,\{1,2,3\}))=6-6=0.
\end{align*}
Since the complex is nonempty, its reduced Euler characteristic is
\begin{align*}
\widetilde{\chi}(\Delta(\varnothing,\{1,2,3\}))=\chi(\Delta(\varnothing,\{1,2,3\}))-1=0-1=-1.
\end{align*}
This agrees with the Möbius value computed above:
\begin{align*}
\mu_{B_3}(\varnothing,\{1,2,3\})=(-1)^3=-1.
\end{align*}
[/example]
The Boolean computation shows what happens when all choices are independent. The next obstruction is dependence: choosing an edge in a graph changes the rank behaviour of the remaining edge set, so a direct product decomposition no longer records the invariant. In graphical lattices and matroids, deletion and contraction provide the replacement for direct product factorisation.
[quotetheorem:8141]
[citeproof:8141]
The theorem gives a lattice-theoretic version of a familiar chromatic-polynomial recursion. The condition that $e$ is neither a loop nor a bridge is essential for the displayed two-term recursion: a loop never raises rank and forces the characteristic polynomial to vanish, while a bridge raises rank in a way that contributes the separate $q-1$ factor. It is also a reminder that characteristic polynomials are Möbius sums over flats, so recursive graph operations can be read directly inside a geometric lattice.
## Comparing Mobius, Shellability, Cohen-Macaulayness, and Order Polynomials
Which invariants are independent pieces of information, and which are shadows of the same structure? Möbius functions, shellings, Cohen-Macaulayness, and order polynomials measure different features, but they repeatedly meet on intervals. The main comparison is that topology explains signs and absolute values of Möbius numbers, while order polynomials encode enumeration of order-preserving maps.
For a bounded finite poset, the Möbius value of the whole poset is an Euler characteristic of the order complex of the open interval. This is the bridge between incidence algebra and topology.
[quotetheorem:8142]
[citeproof:8142]
This formula turns topology into a computational device by replacing an incidence-algebra sum with a homological invariant. The hypotheses are deliberately local: the order complex is formed from the open interval $(x,y)$, not from the whole ambient poset, so changing elements outside $[x,y]$ cannot change the value. The remaining obstruction is that Euler characteristics of arbitrary complexes can be hard to compute from the face poset alone; shellings supply a controlled construction that reads the relevant homology from combinatorial data.
[quotetheorem:6505]
[citeproof:6505]
The theorem explains why EL-labelings are more than a proof of Cohen-Macaulayness: they are a practical Möbius-computation machine. The dimension and purity assumptions matter because the sign $(-1)^d$ is the top homological degree; if the complex is not pure or not shellable, homology may occur in several dimensions and the Möbius number need not be a signed count of selected facets. The Boolean lattice again shows the entire comparison in a small case.
[example: EL Labeling of B Three]
Label each cover $A\lessdot A\cup\{i\}$ in $B_3$ by $i$. A maximal chain from $\varnothing$ to $\{1,2,3\}$ is obtained by adding the three elements one at a time, so it has the form
\begin{align*}
\varnothing \lessdot \{i_1\}\lessdot \{i_1,i_2\}\lessdot \{i_1,i_2,i_3\}=\{1,2,3\},
\end{align*}
where $(i_1,i_2,i_3)$ is a permutation of $(1,2,3)$. Its label sequence is exactly $(i_1,i_2,i_3)$. A falling maximal chain has strictly decreasing labels, so among the six permutations
\begin{align*}
123,\ 132,\ 213,\ 231,\ 312,\ 321
\end{align*}
the only falling one is $321$.
The open interval $(\varnothing,\{1,2,3\})$ has maximal chains of length $2$, so its order complex is $1$-dimensional. By *Falling-Chain Formula for the Möbius Function*, the Möbius number is $(-1)^d r$ with $d=1$ and with $r$ equal to the number of falling maximal chains. Here $r=1$, hence
\begin{align*}
\mu_{B_3}(\varnothing,\{1,2,3\})=(-1)^1\cdot 1=-1.
\end{align*}
This is the same value as $(-1)^3$, since $(-1)^3=-1$.
The same labeling works on any interval $[A,B]$. If $B\setminus A=\{j_1,\dots,j_k\}$, then maximal chains in $[A,B]$ correspond to the $k!$ orders in which the elements of $B\setminus A$ can be added, and the falling chains are exactly the decreasing orders of these labels. Thus there is one falling chain when $k\ge 1$. For $k\ge 2$, the open interval has dimension $k-2$, so the shelling formula gives
\begin{align*}
\mu_{B_3}(A,B)=(-1)^{k-2}\cdot 1=(-1)^k.
\end{align*}
For $k=1$, the interval recursion gives
\begin{align*}
\mu_{B_3}(A,B)=-\mu_{B_3}(A,A)=-1=(-1)^1,
\end{align*}
and for $k=0$ we have $\mu_{B_3}(A,A)=1=(-1)^0$. Therefore, for every $A\subseteq B$ in $B_3$,
\begin{align*}
\mu_{B_3}(A,B)=(-1)^{|B\setminus A|}.
\end{align*}
[/example]
This example computes one interval, but topological regularity asks for the same homological control on every interval and on the links that arise inside their order complexes. A single shellable top interval is not enough: a smaller interval can still carry unwanted lower-dimensional homology. That interval-by-interval demand is what turns shellability into Cohen-Macaulayness.
[quotetheorem:8143]
[citeproof:8143]
Cohen-Macaulayness controls topology of intervals, while order polynomials count maps from the poset into a chain. A direct count of strict maps is often less convenient than a count of weak maps, especially when incomparable elements are allowed to vary independently but comparable elements must satisfy inequalities. The comparison to make next is not homological but reciprocal: strict and weak enumerations are two evaluations of one polynomial.
[quotetheorem:8135]
[citeproof:8135]
This theorem is conceptually parallel to Möbius reciprocity: a weak enumeration has a signed strict counterpart. Finiteness is essential because the order polytope has dimension $n=|P|$, and that dimension is exactly what produces the sign $(-1)^n$ in Ehrhart reciprocity. Strictness applies only along comparable pairs: incomparable elements remain unconstrained relative to each other, which is why the antichain case below has the same weak and strict counts. The difference from Möbius inversion is that order-polynomial reciprocity comes from Ehrhart theory, while Möbius inversion comes from incidence algebra.
[example: Three Basic Order Polynomials]
For a chain $C_n=\{x_1<\cdots<x_n\}$, an order-preserving map $f:C_n\to\{1,\dots,m\}$ is the same thing as a weakly increasing sequence
\begin{align*}
1\le f(x_1)\le f(x_2)\le\cdots\le f(x_n)\le m.
\end{align*}
Writing $a_i=f(x_i)$, these sequences are counted by choosing a multiset of size $n$ from $\{1,\dots,m\}$, so
\begin{align*}
\Omega_{C_n}(m)=\binom{m+n-1}{n}.
\end{align*}
A strictly order-preserving map is the same thing as a strictly increasing sequence
\begin{align*}
1\le a_1<a_2<\cdots<a_n\le m,
\end{align*}
so it is determined by choosing the $n$ distinct values appearing in the sequence:
\begin{align*}
\Omega_{C_n,\mathrm{str}}(m)=\binom{m}{n}.
\end{align*}
For an antichain $A_n$, there are no comparability relations to impose. Each of the $n$ elements may be sent independently to any of the $m$ values, hence
\begin{align*}
\Omega_{A_n}(m)=m^n.
\end{align*}
Strictness adds no inequalities, because there are no comparable distinct elements in an antichain, so
\begin{align*}
\Omega_{A_n,\mathrm{str}}(m)=m^n.
\end{align*}
Now let $P$ be the V-shaped poset with relations $a<b$ and $a<c$. If $f(a)=j$, then order preservation requires
\begin{align*}
j\le f(b)\le m
\end{align*}
and
\begin{align*}
j\le f(c)\le m.
\end{align*}
There are $m-j+1$ choices for $f(b)$ and independently $m-j+1$ choices for $f(c)$, so
\begin{align*}
\Omega_P(m)=\sum_{j=1}^{m}(m-j+1)^2.
\end{align*}
With the substitution $r=m-j+1$, the values of $r$ run through $m,m-1,\dots,1$, hence
\begin{align*}
\Omega_P(m)=\sum_{r=1}^{m}r^2.
\end{align*}
Using the finite square-sum identity gives
\begin{align*}
\Omega_P(m)=\frac{m(m+1)(2m+1)}{6}.
\end{align*}
For the strict count on the same V-shaped poset, if $f(a)=j$, then strict order preservation requires
\begin{align*}
j<f(b)\le m
\end{align*}
and
\begin{align*}
j<f(c)\le m.
\end{align*}
There are $m-j$ choices for each of $f(b)$ and $f(c)$, independently, so
\begin{align*}
\Omega_{P,\mathrm{str}}(m)=\sum_{j=1}^{m}(m-j)^2.
\end{align*}
With $r=m-j$, the values of $r$ run through $m-1,m-2,\dots,0$, and the $r=0$ term contributes $0$, so
\begin{align*}
\Omega_{P,\mathrm{str}}(m)=\sum_{r=1}^{m-1}r^2.
\end{align*}
Applying the same square-sum identity with $m-1$ in place of $m$ gives
\begin{align*}
\Omega_{P,\mathrm{str}}(m)=\frac{(m-1)m(2m-1)}{6}.
\end{align*}
These three computations show the same principle in increasing complexity: chains impose total inequality constraints, antichains impose none, and the V-shaped poset imposes two independent upper choices after the lower value has been fixed.
[/example]
## Capstone Computations in Lattices
How do the separate techniques behave when a poset carries several structures at once? The best final tests are examples where incidence algebra, topology, and lattice theory all produce the same invariant. We now run full computations for three standard families.
Partition lattices are the first example where intervals are not merely Boolean but still factor into recognisable pieces. A partition interval decomposes according to how blocks are merged, and that decomposition reduces the global factorial formula to the shelling computation for the top interval.
[quotetheorem:8144]
[citeproof:8144]
For $n=4$, the formula is small enough to check against the actual ranks. The lattice has ranks with sizes $1,6,7,1$, and the top Möbius number is $-6$. The seven rank-two partitions split into two interval types, so the check must track lower interval Möbius values rather than rank size alone.
[example: Partition Lattice $\Pi_4$]
By *[Möbius Function of the Partition Lattice](/theorems/8144)*, the top interval of $\Pi_4$ satisfies
\begin{align*}
\mu_{\Pi_4}(\hat{0},\hat{1})=(-1)^{4-1}(4-1)!=(-1)^3\cdot 3!=-6.
\end{align*}
We can recover the same value from the interval recursion by summing the Möbius values below $\hat{1}=1234$.
The bottom element is $\hat{0}=1|2|3|4$, so
\begin{align*}
\mu(\hat{0},\hat{0})=1.
\end{align*}
A rank-one partition is obtained by choosing the two elements to merge, so there are $\binom{4}{2}=6$ such partitions. If $\rho$ is one of them, then the interval $[\hat{0},\rho]$ has only $\hat{0}$ and $\rho$, and the interval recursion gives
\begin{align*}
\mu(\hat{0},\rho)=-\mu(\hat{0},\hat{0})=-1.
\end{align*}
Thus the total rank-one contribution is
\begin{align*}
6\cdot(-1)=-6.
\end{align*}
There are two types of rank-two partitions. First, a partition of type $2+2$ pairs the four elements into two unordered pairs. Choosing the first pair gives $\binom{4}{2}=6$ choices, and each pairing is counted twice, once for each of its two blocks, so there are
\begin{align*}
\frac{1}{2}\binom{4}{2}=3
\end{align*}
partitions of type $2+2$. For example, if $\tau=12|34$, then $[\hat{0},\tau]\cong \Pi_2\times \Pi_2$. Each $\Pi_2$ factor has Möbius value $-1$ from bottom to top, so multiplicativity gives
\begin{align*}
\mu(\hat{0},12|34)=(-1)(-1)=1.
\end{align*}
The total contribution from the three type-$2+2$ partitions is
\begin{align*}
3\cdot 1=3.
\end{align*}
Second, a partition of type $3+1$ is determined by choosing its three-element block, so there are
\begin{align*}
\binom{4}{3}=4
\end{align*}
such partitions. For example, if $\tau=123|4$, then $[\hat{0},\tau]\cong \Pi_3\times \Pi_1$. The $\Pi_3$ factor contributes
\begin{align*}
(-1)^{3-1}(3-1)!=(-1)^2\cdot 2!=2,
\end{align*}
and the $\Pi_1$ factor contributes $1$, so
\begin{align*}
\mu(\hat{0},123|4)=2\cdot 1=2.
\end{align*}
The total contribution from the four type-$3+1$ partitions is
\begin{align*}
4\cdot 2=8.
\end{align*}
Therefore the sum of all Möbius values strictly below $\hat{1}$ is
\begin{align*}
1+6(-1)+3(1)+4(2)=1-6+3+8=6.
\end{align*}
Applying the interval recursion to the top element gives
\begin{align*}
\mu(\hat{0},\hat{1})=-\sum_{\hat{0}\le \sigma<\hat{1}}\mu(\hat{0},\sigma)=-6.
\end{align*}
The recursive computation matches the closed formula for $\Pi_4$, and the split into $2+2$ and $3+1$ intervals explains why rank sizes alone do not determine the sum.
[/example]
Graphical lattices provide the final synthesis because their characteristic polynomials connect flat lattices, Möbius sums, and graph colorings. The triangle graph is the smallest non-tree example with a nontrivial deletion-contraction step.
[example: Lattice of Flats of a Triangle]
Let $G=C_3$ be the triangle with edge set $E=\{e_1,e_2,e_3\}$. In the graphical matroid of $C_3$, the rank of an edge set is the size of a maximal forest inside that edge set, so $r(\varnothing)=0$, $r(\{e_i\})=1$, and $r(E)=2$. The set $\varnothing$ is a flat because adding any edge raises the rank from $0$ to $1$, and each singleton $\{e_i\}$ is a flat because adding any other edge gives a two-edge path and raises the rank from $1$ to $2$. A two-element set is not a flat: if $\{e_i,e_j\}$ has two edges, then $r(\{e_i,e_j\})=2$, while adding the remaining edge gives $E$ and $r(E)=2$, so the remaining edge lies in its closure. Thus the flats are exactly $\varnothing$, the three singletons, and $E$, with rank sizes $1,3,1$.
We compute the Möbius values from the bottom using the interval recursion. First,
\begin{align*}
\mu(\varnothing,\varnothing)=1.
\end{align*}
For a singleton flat $\{e_i\}$, the only flat strictly between $\varnothing$ and $\{e_i\}$ is absent, so
\begin{align*}
\mu(\varnothing,\{e_i\})=-\mu(\varnothing,\varnothing)=-1.
\end{align*}
For the top flat $E$, the flats strictly below $E$ are $\varnothing$ and the three singletons, hence
\begin{align*}
\mu(\varnothing,E)=-\left(\mu(\varnothing,\varnothing)+\sum_{i=1}^{3}\mu(\varnothing,\{e_i\})\right).
\end{align*}
Substituting the values already computed gives
\begin{align*}
\mu(\varnothing,E)=-\left(1+3(-1)\right)=-(-2)=2.
\end{align*}
The lattice has rank $2$, so its characteristic polynomial is
\begin{align*}
\chi_{L(G)}(q)=\sum_{F\in L(G)}\mu(\varnothing,F)q^{2-r(F)}.
\end{align*}
Using the three rank levels separately,
\begin{align*}
\chi_{L(G)}(q)=1\cdot q^2+3(-1)\cdot q+2\cdot 1.
\end{align*}
Therefore
\begin{align*}
\chi_{L(G)}(q)=q^2-3q+2.
\end{align*}
Factoring the quadratic,
\begin{align*}
q^2-3q+2=q^2-q-2q+2=q(q-1)-2(q-1)=(q-1)(q-2).
\end{align*}
The chromatic polynomial of a triangle is $q(q-1)(q-2)$, since the three vertices can be colored in $q$, then $q-1$, then $q-2$ choices. Dividing by $q$ gives $(q-1)(q-2)$, matching the lattice characteristic polynomial.
[/example]
The same calculation can be redone recursively. Deleting one edge of $C_3$ gives a path on three vertices, while contracting it gives two parallel edges on two vertices; the deletion-contraction formula recovers $q^2-3q+2$ after passing to the lattice characteristic polynomial.
[remark: What the Capstone Examples Show]
The Boolean lattice is governed by products of two-element choices, so every invariant factors into binomial data. The partition lattice is governed by interval factorisation over blocks, with shellability supplying the factorial Möbius values. The graphical lattice is governed by matroid deletion and contraction, with characteristic polynomials acting as the common language between lattice intervals and graph enumeration.
[/remark]
The course ends with a reusable workflow. First identify the intervals that control the invariant. Next test for product decompositions, EL-labelings, or deletion-contraction recursions. Then compare the answer against topological data from the order complex and enumerative data such as rank generating functions or order polynomials. When these perspectives agree, the computation is not only a number but a structural explanation of why the poset behaves as it does.
## Beyond And Connections
The topics in this course connect directly to [Algebraic Combinatorics I: Symmetric Functions](/page/Algebraic%20Combinatorics%20I%3A%20Symmetric%20Functions), where tableaux and symmetric functions provide the companion representation-theoretic side of the subject. The enumerative methods here also rely on ordinary generating series; the [Power Series](/page/Power%20Series) page gives the analytic and formal background for manipulating those series.
The lattice and closure chapters point toward [Matroid Theory](/page/Matroid%20Theory), where geometric lattices become a systematic language for dependence. On the theorem side, [Möbius Inversion on a Finite Poset](/theorems/7126) is the algebraic engine behind incidence algebra calculations, while [Shellable Complexes Are Wedges of Spheres](/theorems/6505), [Shellable Complexes Are Cohen-Macaulay](/theorems/6511), and [Reisner's Criterion](/theorems/6512) are the topological and commutative-algebraic endpoints of the order-complex chapters.
## References
For further background, see Richard P. Stanley, *Enumerative Combinatorics, Volume 1*, for Möbius inversion, $P$-partitions, and order polynomials; Richard P. Stanley, *Combinatorics and Commutative Algebra*, for shellability and Cohen-Macaulay complexes; Anders Bjorner, Michel Las Vergnas, Bernd Sturmfels, Neil White, and Gunter M. Ziegler, *Oriented Matroids*, for geometric lattices and matroidal examples; and Kenneth P. Bogart, *Introductory Combinatorics*, for chain and antichain decomposition theorems.
Contents
- Introduction
- What Does A Poset Remember?
- Why Intervals Matter?
- How Do Chains And Antichains Measure Size?
- When Does A Poset Become A Lattice?
- Where Topology Enters
- How The Course Is Organised
- 1. Finite Posets and Their Invariants
- From Order Relations to Diagrams
- Ideals, Filters, and Linear Extensions
- Rank and Graded Posets
- Order Complexes and Counting Chains
- Operations on Finite Posets
- 2. Chains, Antichains, and Decomposition Theorems
- Width, Height, and Decompositions
- Dilworth's Theorem and Mirsky's Theorem
- Sperner Posets and the LYM Inequality
- 3. Möbius Functions and Inversion
- Cumulative Functions on Locally Finite Posets
- Möbius Inversion on Posets
- Structure Theorems and Computations
- 4. Incidence Algebras
- Incidence Algebra and Convolution
- Chain-Counting Functions and Exponentials
- Reduced Incidence Algebras and Hereditary Intervals
- 5. Lattices and Closure Operators
- Meets and Joins as Order-Theoretic Operations
- Closure Operators and Moore Families
- Distributive Lattices and Order Ideals
- 6. Geometric Lattices and Matroids
- Atomic Semimodular Lattices as Flat Lattices
- Characteristic Polynomials and the Möbius Function
- Deletion, Contraction, and Crapo's Beta Invariant
- Hyperplane Arrangements and Intersection Lattices
- 7. Eulerian Posets and Flag Enumeration
- Flag Enumeration in Graded Posets
- Eulerian Posets and Alternating Intervals
- Bayer-Billera Relations
- The $ab$-Index and the $cd$-Index
- What the $cd$-Index Measures
- 8. Order Complexes and Poset Topology
- Chains as Simplices
- Barycentric Subdivision, Links, and Intervals
- Reduced Euler Characteristic and the Möbius Function
- Contractibility and Homotopy Equivalences
- Crosscuts in Lattices
- Wedges of Spheres and Cohen-Macaulay Posets
- Partition and Subspace Lattices
- 9. Shellability
- Shelling Orders for Pure Simplicial Complexes
- Lexicographic Shellability of Posets
- Homological and Enumerative Consequences
- 10. Cohen-Macaulay Posets
- Cohen-Macaulay Simplicial Complexes and Reisner's Criterion
- Posets, Intervals, and Order Complexes
- Rank Selection
- Products and Other Preservation Theorems
- 11. $P$-Partitions and Order Polynomials
- Order-Preserving Maps and Labeled Posets
- Order Polynomials and Reciprocity
- Linear Extensions, Descents, and the Fundamental Theorem
- 12. Synthesis: From Local Intervals to Global Enumeration
- Computing Invariants from Intervals
- Comparing Mobius, Shellability, Cohen-Macaulayness, and Order Polynomials
- Capstone Computations in Lattices
- Beyond And Connections
- References
Algebraic Combinatorics II: Posets and Lattices
Also known as: Algebraic Combinatorics II: Posets and Lattices, Algebraic Combinatorics II, Posets and Lattices, Posets and lattices notes, Algebraic combinatorics posets and lattices
Content
Problems
History
Created by admin on 6/20/2026 | Last updated on 6/20/2026
Prerequisites (0/8 completed)
Log in to track your prerequisite progress.
Prerequisites Graph
Interactive dependency map showing prerequisite concepts
Loading dependency graph...
Theorem
Definition
Current
Requires
Rate this page
★
★
★
★
★
Poor
Excellent