Binomial coefficients are the numbers that count how many ways a finite set can be sampled when order is ignored. They appear whenever a product is expanded, a subset is chosen, a lattice path is traced, or a probability distribution records the number of successes in repeated independent trials. In this sense they sit at the crossing point of combinatorics, [Set](/page/Set), polynomial algebra, probability, and induction.
The central question is simple: if a set has $n$ distinct elements, how many subsets have exactly $k$ elements? The answer is written $\binom{n}{k}$ and called a binomial coefficient. The same number also measures the coefficient of $x^k y^{n-k}$ in $(x+y)^n$, the number of shortest lattice paths from $(0,0)$ to $(k,n-k)$, and the probability weights in the binomial distribution. These different appearances are not coincidences; they are several views of the same counting principle.
## Definition
The most direct way to meet the binomial coefficient is through subsets. A finite set has many ordered listings of the same chosen elements, so counting ordered arrangements first overcounts each unordered choice. The definition divides out precisely that overcounting.
[definition: Binomial Coefficient]
Let $n \in \mathbb{N} \cup \{0\}$ and let $k \in \mathbb{Z}$. If $0 \le k \le n$, then
\begin{align*}
\binom{n}{k} &= \frac{n!}{k!(n-k)!}.
\end{align*}
If $k < 0$ or $k > n$, then
\begin{align*}
\binom{n}{k} &= 0.
\end{align*}
[/definition]
The convention outside the range $0 \le k \le n$ is not merely cosmetic. It lets recurrence formulas such as Pascal's identity be stated without repeatedly separating boundary cases.
To use the coefficient in counting arguments, the objects being counted must be named. The relevant objects are subsets with prescribed cardinality, and this excludes both order and repetition. Naming them also makes it possible to compare binomial coefficients with permutations and ordered lists.
[definition: K Element Subset]
Let $X$ be a finite set and let $k \in \mathbb{N} \cup \{0\}$. A $k$-element subset of $X$ is a subset $A \subset X$ such that $|A| = k$.
[/definition]
The factorial formula counts a number, but many arguments need to identify the actual choices represented by that number. Without this interpretation, it is easy to confuse unordered selections with ordered lists or selections with repetition. For a finite set $X$, the unordered choices of size $k$ are exactly the $k$-element subsets of $X$, so $\binom{n}{k}$ can be read as the size of that family.
[definition: Subset Counting Interpretation]
Let $X$ be a finite set with $|X| = n$, and let $k \in \mathbb{N} \cup \{0\}$. The subset counting interpretation of $\binom{n}{k}$ is
\begin{align*}
\binom{n}{k} &= |\{A \subset X : |A| = k\}|.
\end{align*}
[/definition]
The formula using factorials and the subset count are equivalent: list the chosen elements in order to get $n(n-1)\cdots(n-k+1)$ ordered selections, then divide by the $k!$ orders of the same selected set. This comparison is the first recurring theme: binomial coefficients convert ordered counting into unordered counting.
[example: Choosing a Committee]
Let $X = \{1,2,3,4,5\}$, and count the $2$-element committees from $X$. If we first count ordered pairs of distinct elements, there are $5$ choices for the first entry. After that entry has been chosen, there are $4$ remaining choices for the second entry, so the number of ordered pairs is
\begin{align*}
5 \cdot 4 = 20.
\end{align*}
Each $2$-element committee $\{a,b\}$ with $a \ne b$ appears as exactly two ordered pairs: $(a,b)$ and $(b,a)$. Therefore the number of committees is
\begin{align*}
\frac{20}{2} = 10.
\end{align*}
The factorial formula gives the same value:
\begin{align*}
\binom{5}{2} = \frac{5!}{2!(5-2)!}.
\end{align*}
Since $5-2=3$, this is
\begin{align*}
\binom{5}{2} = \frac{5!}{2!3!}.
\end{align*}
Expanding the factorials,
\begin{align*}
\binom{5}{2} = \frac{5 \cdot 4 \cdot 3 \cdot 2 \cdot 1}{(2 \cdot 1)(3 \cdot 2 \cdot 1)}.
\end{align*}
Canceling the common factor $3 \cdot 2 \cdot 1$ leaves
\begin{align*}
\binom{5}{2} = \frac{5 \cdot 4}{2 \cdot 1}.
\end{align*}
Thus
\begin{align*}
\binom{5}{2} = \frac{20}{2} = 10.
\end{align*}
The ten committees are the ten $2$-element subsets of $X$, not the twenty ordered pairs.
[/example]
This example is deliberately elementary, but it exposes the basic correction factor. Whenever a selected group can be listed in many orders, a binomial coefficient often appears after dividing by those internal orders.
A table of binomial coefficients reveals patterns that are hidden in isolated formulas. The triangular arrangement below is useful because it displays boundary values, symmetry, and the additive recurrence in a single object. It also gives a compact way to navigate examples before proving identities.
[definition: Pascal Triangle]
Pascal's triangle is the triangular array whose entry in row $n$ and position $k$ is $\binom{n}{k}$ for $n \in \mathbb{N} \cup \{0\}$ and $0 \le k \le n$.
[/definition]
The rows begin with $1$, then $1,1$, then $1,2,1$, then $1,3,3,1$, then $1,4,6,4,1$. They encode both arithmetic and combinatorics: every internal entry is the sum of the two entries above it, while each row gives the coefficients in a binomial expansion.
## Equivalent Characterisations
### Algebraic Coefficients
A concept becomes powerful when the same number solves several different counting problems. The coefficient definition explains why the name contains the word "binomial": these numbers are the coefficients that occur when a two-term expression is raised to a power.
[quotetheorem:750]
This theorem says that $\binom{n}{k}$ counts the number of times the monomial $x^k y^{n-k}$ appears when choosing either $x$ or $y$ from each of $n$ factors. It connects finite counting to polynomial algebra.
[example: Expanding a Fourth Power]
For commuting variables $x$ and $y$, the *[Binomial Theorem](/theorems/750)* with $n=4$ gives
\begin{align*}
(x+y)^4 = \sum_{k=0}^{4} \binom{4}{k}x^k y^{4-k}.
\end{align*}
Expanding the five terms of the sum,
\begin{align*}
(x+y)^4 = \binom{4}{0}y^4+\binom{4}{1}xy^3+\binom{4}{2}x^2y^2+\binom{4}{3}x^3y+\binom{4}{4}x^4.
\end{align*}
Using $\binom{4}{k}=\frac{4!}{k!(4-k)!}$ for $0 \le k \le 4$,
\begin{align*}
\binom{4}{0}=\frac{4!}{0!4!}=1,\quad \binom{4}{1}=\frac{4!}{1!3!}=4,\quad \binom{4}{2}=\frac{4!}{2!2!}=\frac{24}{4}=6,\quad \binom{4}{3}=\frac{4!}{3!1!}=4,\quad \binom{4}{4}=\frac{4!}{4!0!}=1.
\end{align*}
Substituting these coefficients,
\begin{align*}
(x+y)^4 = y^4 + 4xy^3 + 6x^2y^2 + 4x^3y + x^4.
\end{align*}
The coefficient $6$ of $x^2y^2$ counts the choices of the two factors that contribute $x$: positions $\{1,2\}$, $\{1,3\}$, $\{1,4\}$, $\{2,3\}$, $\{2,4\}$, and $\{3,4\}$.
[/example]
The algebraic coefficient and the subset count are the same count in different language. This is why binomial coefficients are unavoidable in generating functions and many identities in [Sequence](/page/Sequence) theory.
### Paths and Bernoulli Trials
Some counting problems are easier to see as movement rather than selection. A shortest path on a rectangular grid is completely determined by the positions at which it moves horizontally, or equivalently by the positions at which it moves vertically. This turns a geometric question into the same subset-counting question.
[definition: Monotone Lattice Path]
A monotone lattice path from $(0,0)$ to $(a,b)$, where $a,b \in \mathbb{N} \cup \{0\}$, is a finite sequence of $a+b$ moves in $\mathbb{Z}^2$ consisting of exactly $a$ moves of the form $(i,j) \mapsto (i+1,j)$ and exactly $b$ moves of the form $(i,j) \mapsto (i,j+1)$.
[/definition]
Once the path is encoded by the positions of its horizontal moves, it becomes a subset-counting problem. This is a useful model because many recurrence arguments can be pictured as paths splitting according to their last step.
[quotetheorem:7726]
This characterisation explains why binomial coefficients occur in grid problems, random walks with two possible moves, and recursive algorithms whose state changes by two elementary operations.
Probability gives another reason to separate the number of patterns from the weight of each pattern. In repeated two-outcome experiments, the coefficient counts where the successes happened, while multiplication supplies the probability attached to a fixed pattern. To state this cleanly, the single two-outcome experiment needs a name.
[definition: Bernoulli Trial]
A Bernoulli trial with success probability $p \in [0,1]$ is a random experiment with two outcomes, success and failure, such that the probability of success is $p$ and the probability of failure is $1-p$.
[/definition]
The probability law for repeated Bernoulli trials needs both multiplicity and weight. For a fixed value of $k$, there are many possible locations for the successes, and independence makes each such location pattern have the same probability. The next formula combines these two pieces into the standard probability mass function.
[quotetheorem:4867]
This formula is the bridge from binomial coefficients to the binomial distribution, where the coefficients provide the combinatorial multiplicities behind the probability mass function.
## Boundary Cases and Counting Pitfalls
### Endpoints of the Row
Boundary cases are important because they prevent formulas from being misread. Choosing nothing and choosing everything are both single choices.
[example: Boundary Values]
Let $X$ be an $n$-element set, where $n \in \mathbb{N} \cup \{0\}$. A $0$-element subset of $X$ must contain no elements, so it is exactly $\varnothing$. Hence
\begin{align*}
\{A \subset X : |A| = 0\} = \{\varnothing\}.
\end{align*}
This set has one element, so the subset-counting interpretation gives
\begin{align*}
\binom{n}{0} = |\{\varnothing\}| = 1.
\end{align*}
An $n$-element subset of $X$ must contain all $n$ elements of $X$. Since it is also required to be a subset of $X$, the only possibility is $X$ itself. Thus
\begin{align*}
\{A \subset X : |A| = n\} = \{X\}.
\end{align*}
Therefore
\begin{align*}
\binom{n}{n} = |\{X\}| = 1.
\end{align*}
For the out-of-range values, the definition of the binomial coefficient sets $\binom{n}{k}=0$ whenever $k<0$ or $k>n$. Since $-1<0$ and $n+1>n$,
\begin{align*}
\binom{n}{-1}=0.
\end{align*}
Also,
\begin{align*}
\binom{n}{n+1}=0.
\end{align*}
The first two identities count the unique empty choice and the unique whole-set choice; the last two record that one cannot choose a negative number of elements or more elements than the set contains.
[/example]
These boundary values are the edges of Pascal's triangle. They also make induction proofs cleaner because the recurrence can be extended to endpoints without special language.
### Ordered Choices Versus Subsets
A failure example clarifies why the word "subset" matters. If order or repetition is allowed, a different formula is needed.
[example: Ordered Selections Are Not Binomial Coefficients]
Let $X=\{1,2,3,4\}$. An ordered selection of length $2$ without repetition is a pair $(a,b)$ with $a,b \in X$ and $a \ne b$. There are $4$ choices for the first entry. Once the first entry has been chosen, exactly one element of $X$ is unavailable, so $4-1=3$ choices remain for the second entry. Hence the number of ordered selections is
\begin{align*}
4 \cdot 3 = 12.
\end{align*}
The corresponding binomial coefficient counts $2$-element subsets of $X$, so its factorial value is
\begin{align*}
\binom{4}{2} = \frac{4!}{2!(4-2)!}.
\end{align*}
Since $4-2=2$, this becomes
\begin{align*}
\binom{4}{2} = \frac{4!}{2!2!}.
\end{align*}
Expanding the factorials gives
\begin{align*}
\binom{4}{2} = \frac{4 \cdot 3 \cdot 2 \cdot 1}{(2 \cdot 1)(2 \cdot 1)}.
\end{align*}
Cancel one factor $2 \cdot 1$ from the numerator and denominator:
\begin{align*}
\binom{4}{2} = \frac{4 \cdot 3}{2 \cdot 1}.
\end{align*}
Therefore
\begin{align*}
\binom{4}{2} = \frac{12}{2} = 6.
\end{align*}
The factor of $2$ is exactly the number of orders of each chosen pair: for example, $(1,2)$ and $(2,1)$ are different ordered selections, but both determine the same subset $\{1,2\}$. Thus $\binom{4}{2}$ counts unordered choices, not ordered arrangements.
[/example]
This distinction separates binomial coefficients from permutations. Many mistakes in elementary combinatorics come from using $\binom{n}{k}$ when the positions, order, or labels of the chosen objects still matter.
## Core Identities and Growth
### Symmetry and Pascal Recurrence
The first structural property is symmetry. Choosing a committee can be recorded by naming its members, but it can also be recorded by naming the people left out. The complement construction gives a size-preserving translation between these two descriptions.
[quotetheorem:7727]
Symmetry is more than a factorial identity. It says that the complement map $A \mapsto X \setminus A$ pairs $k$-element subsets with $(n-k)$-element subsets.
The next property is Pascal's identity, the additive rule that builds the entire triangle from its boundary values. It is the main recurrence for binomial coefficients.
[quotetheorem:7728]
Combinatorially, this splits subsets of an $n$-element set according to whether they contain a distinguished element. Algebraically, it is the coefficient comparison behind multiplying $(x+y)^{n-1}$ by $x+y$.
### Row Sums and Signed Cancellation
To count all choices at once, we need the object that stores every subset of a given set. The binomial row divides this object into layers according to subset size. This is why row sums of binomial coefficients naturally lead to the power set.
[definition: Power Set]
Let $X$ be a set. The power set of $X$, denoted $\mathcal{P}(X)$, is the set of all subsets of $X$:
\begin{align*}
\mathcal{P}(X) &= \{A : A \subset X\}.
\end{align*}
[/definition]
The power set is the natural container for all the choices counted row by row. The following identity is the numerical form of that partition by subset size.
[quotetheorem:7729]
This identity is also obtained from the binomial theorem by setting $x=1$ and $y=1$. The two readings correspond to two perspectives: count all subsets directly, or evaluate a generating polynomial.
Signed versions of the same row sum are useful when objects are counted with cancellation. The simplest such cancellation compares even-sized subsets with odd-sized subsets. This prepares the alternating signs that appear in inclusion-exclusion arguments.
[quotetheorem:7730]
The cancellation says that a nonempty finite set has as many even-cardinality subsets as odd-cardinality subsets. Such signed counts are a basic tool in inclusion-exclusion arguments.
### Convolution and Central Growth
A set split into two disjoint parts creates a new kind of counting question: choose a total of $n$ objects, but allow the split between the two parts to vary. Summing over all possible splits should recover the count of choosing $n$ objects from the combined set. The next identity is the standard form of that principle.
[quotetheorem:7731]
Convolution formulas explain how binomial coefficients interact across decomposed sets, but growth questions require looking inside a single row. Which entries dominate as the row grows? The answer is found near the middle, where the number of selected elements is as balanced as possible.
[definition: Central Binomial Coefficient]
For $n \in \mathbb{N} \cup \{0\}$, the central binomial coefficient is
\begin{align*}
\binom{2n}{n}.
\end{align*}
[/definition]
These coefficients count balanced choices: $n$ selected elements from a $2n$-element set, or monotone paths from $(0,0)$ to $(n,n)$. Their size is much larger than a typical fixed-edge entry in Pascal's triangle.
A rough bound is often enough to show the scale of the middle of Pascal's triangle. Since the full row indexed by $2n$ sums to $4^n$, and since the central entry is the largest entry in that row, it must be comparable to a positive fraction of $4^n$. The following estimate records that useful scale without requiring asymptotic analysis.
[quotetheorem:7732]
Sharper estimates use [Stirling's formula](/theorems/1109), but even these bounds already show the exponential scale. The denominator in the lower bound comes from the fact that $\binom{2n}{n}$ is the largest entry in the row whose entries sum to $4^n$.
## Beyond and Connected Topics
Binomial coefficients are basic finite-counting invariants. They belong first to enumerative combinatorics, where the goal is to count structured finite objects, but they also serve as coefficients in algebra and weights in probability.
Their relationship with [Set](/page/Set) theory and the power set is direct: the row $\binom{n}{0},\binom{n}{1},\ldots,\binom{n}{n}$ partitions $\mathcal{P}(X)$ by cardinality when $|X|=n$. This makes them a refined count of subsets rather than only a total count.
Their relationship with polynomial algebra is encoded by the binomial theorem. In generating functions, binomial coefficients become coefficients of formal [Power Series](/page/Power%20Series), and identities among them often become products of [Sequence](/page/Sequence) generating series.
Their relationship with probability is most visible through Bernoulli trials. The coefficient $\binom{n}{k}$ counts the number of success patterns with $k$ successes, while $p^k(1-p)^{n-k}$ gives the probability of each such pattern under independence.
Their relationship with induction comes through Pascal's identity. The recurrence builds row $n$ from row $n-1$, making binomial coefficients one of the standard examples where induction mirrors a combinatorial decomposition.
Their relationship with inclusion-exclusion appears through alternating binomial sums. Signed combinations of binomial coefficients measure cancellation among overlapping conditions.
Several standard generalisations preserve the same idea while changing the objects being counted. Multinomial coefficients count ways to split a finite set into several labelled parts, Gaussian binomial coefficients count subspaces of a finite [vector space](/page/Vector%20Space), and binomial series extend the coefficients $\binom{n}{k}$ from nonnegative integer exponents to more general algebraic expansions.
## References
Richard P. Stanley, *Enumerative Combinatorics, Volume 1* (2012).
Herbert S. Wilf, *generatingfunctionology* (1994).
Ronald L. Graham, Donald E. Knuth, and Oren Patashnik, *Concrete Mathematics* (1994).