On some proofs of the Schroeder-Bernstein theorem.
A version of this article originally appeared in The Oxford Invariants Society magazine The Invariant in Trinity Term 2015.
1. Motivation: Sets, sizes and injections
How do we know whether two sets are the same size, in other words, that they have the same number of elements? For small, finite sets, it is straightforward: count the number of elements in each set and if the answer is the same, then they have the same size. But what about sets that are very large, or infinite? Using the Diagonal Argument (1), Georg Cantor (1845-1918) proved that there are different sizes of infinity: the set of natural numbers is ‘countably’ infinite whilst the set of real numbers is bigger—‘uncountably’ infinite. Much of set theory research is concerned with investigating the size and ordering of different infinite sets. In the face of these complications, we need a better way to compare the sizes of different sets.
Cantor defined two sets as having the same cardinality (same size) if and only if there exists a bijective function between them, i.e. a one-to-one correspondence between their elements. In practice, however, it can oftentimes be tricky to define the necessary bijection—and easier to define an injective function from each set into the other. If there exist injective mappings in each direction, then we intuitively feel that the two sets ought to be of equal size. But is this true?
The Schroeder-Bernstein theorem says Yes: if there exist injective functions and
between sets
and
, then there exists a bijection
and so, by Cantor’s definition,
and
are the same size (
). Furthermore, if we go on to define
as having cardinality greater than or equal to
(
) if and only if there exists an injection
, then the theorem states that
and
together imply
. Thus, in set theory the ordering of infinite cardinalities can work as we expect.
2. At the turn of the twentieth century
Like many important theorems in mathematics, the Schroeder-Bernstein theorem has a convoluted history and is arguably misnamed. Cantor first proposed the theorem in 1887 in a letter to Richard Dedekind (1831-1916), but did not publish a proof at that time; the theorem is nevertheless often referred to as the Cantor-Schroeder-Bernstein theorem, or simply the Cantor-Bernstein theorem. Ernst Schroeder (1841-1902) offered a proof in 1896, but this quickly turned out to be erroneous. The following year, both Schroeder and Felix Bernstein (1878 – 1956), who was then an undergraduate student of Cantor’s, independently found correct proofs. Bernstein’s proof became widely known when it was presented by Cantor at the International Congress of Mathematicians in 1897.
Dedekind, in the meantime, had quietly written a proof back in 1887 that, for reasons of his own, he neither published nor notified Cantor about. This proof was not discovered until 1908. Dedekind went on to prove the theorem at least once more, but his name, of course, is never associated with it!
3. ‘A proof of the Schroeder-Bernstein theorem’
Twentieth century proofs of the Schroeder-Bernstein theorem accumulated. R. H. Cox of the University of Kentucky offered a little-known alternative proof in a short note published by The American Mathematical Monthly in May 19683 (2). With a certain amount of paraphrasing, we explain and summarise his approach here. In the opinion of R. H. Cox, this proof ‘is more easily followed and smoother than any of the usual arguments’ (we leave up to the reader to decide whether not they agree).
First we establish the following lemma:
Lemma. If is a set, and , and is an injective function from A into B, then there exists a injective function from onto , that is, is a bijection.
Proof. If , then the identity function is such an
. Therefore, suppose
is a proper subset of
. Define
such that
is the identity function and, for each positive integer
,
. Let
be the set of all elements
for which there exists a non-negative integer
and an element
such that
. Then the following hold:
, since for all
,
;
- For all
,
and
are unique. This follows from the assumption that
is injective;
, by the definition of
—for every
,
for some
, some non-negative integer
and some
. Therefore
.
For example, in Fig. 1, since ,
, and
, so
,
, and
are in
. Since
and
, so
and
are in
.

Figure 1. Some elements in subset C of A.
For each , define function
as follows:
Then is a bijection
:
- Injection: for all
, this follows from injectivity of
; for
this follows from identity;
- Surjection: if
and
, then for some positive
,
, and some
,
where
i.e.
. Therefore
is accounted for in the first part of the definition of
; if
, again this follows from identity.
Theorem. For sets ,
, if functions
and
are injective, then there is a bijection between
and
.
Proof. Since is an injective function from
to
, then by the lemma there is a bijection
. Thus,
is a bijection between
and
(see Fig. 2).

Figure 2.
4. On other proofs and some proof properties
There are many ways to prove the Schroeder-Bernstein theorem. It is worth surveying some of the earliest approaches taken soon after the theorem was first proposed by Cantor. Back then, it was known as the ‘Equivalence theorem’ and was usually stated in its two-set formulation: for sets and
, if there exists a bijection between
and a proper subset
of
and a bijection between
and a proper subset
of
, then there exists a bijection between
and
(4). Cantor himself derived the theorem as a consequence of the well-ordering of the cardinals, a statement that he was not able to prove and which later turned out to be equivalent to the Axiom of Choice (AC). Bernstein, in his proof, noted that given the two-set formulation, every proper subset of
must correspond by a bijection to some proper subset of
, which for proper subset
we can call
, and that
has the same size as
and hence the same size as
. He iterated this process to set up an infinite sequence of nested sets
,
,
,
,… such that all sets of even index have the same size as
and all sets of odd index have the same size as
. This construction allows for a reformulation of sets
and
in such a way as to demonstrate their size equivalence. It is said that Bernstein thought of this proof while shaving, and that the idea of infinitely nested subsets may have been inspired by nested infinite reflections seen in two mirrors facing each other (5).
On the other hand, the proof currently presented on the Wikipedia entry for the Schroeder-Bernstein theorem (6), attributed to Julius König (1848-1913), has some similarities to Cox’s proof above. Assuming (without loss of generality) that and
are disjoint, it is possible to construct for each
and each
a unique sequence of elements by repeatedly applying
and
to go right and
and
to go left (a given sequence may terminate to the left where
or
is not defined). It turns out that the sequences form a partition of the (disjoint) union of
and
, and, proceeding by different cases of sequence termination, for each case either
or
will provide the necessary bijection. Quite a different proof flavour is found in the Oxford undergraduate mathematics Part B Set Theory course (7); here, the proof rests on Tarski’s fixed point theorem (Knaster-Tarski theorem) applied to the power set lattice as a lemma.
It is also interesting to consider some properties of proofs of the Schroeder-Bernstein theorem. Although Cantor originally derived the theorem (indirectly) as corollary of the Axiom of Choice, its proof is actually independent of AC; see, for example, Cox’s proof above. However, there is no way to avoid the Law of the Excluded Middle in the process of proving the theorem, meaning that its various proofs are non-constructive. The theorem is therefore not provable under intuitionistic (constructive) logic.
Taking a categorical approach, we observe that we have so far considered the classical Schroeder-Bernstein theorem as originally formulated for the category of sets with the injection relation. The theorem can be generalised to the Schroeder-Bernstein property: for mathematical objects and
, if there exists a monomorphism (injection, or embedding) from
to
and also from
to
, then there exists an isomorphism between them. Formulated in this way, the Schroeder-Bernstein property holds for categories of vector spaces, compact metric spaces and well-ordered sets, but not, for example, for general topological spaces.
Finally—in case anyone was worried—the correctness of the classical Schroeder-Bernstein theorem has been established through proof formalisation in almost all the most popular proof-checking software systems, including HOL Light, Mizar, Coq, Is- abelle, Metamath and ProofPower (8). We can breathe a sigh of relief!
~ Footnotes & References ~
(1) G. Cantor (1892) ‘Ueber eine elementare Frage der Mannigfaltigkeitslehre’. Jahresbericht der Deutschen Mathematiker-Vereinigung 1890–1891 1:75-78.
(2) I encountered a citation to Cox’s article in S. R. Lay’s excellent undergraduate textbook, ‘Analysis: With an Introduction to Proof’ (Pearson), currently in its 5th edition.
(3) R. H. Cox (1968) ‘A Proof of the Schroeder-Bernstein Theorem’. The American Mathematical Monthly, 75(5):508.
(4) Proofs of the Cantor-Bernstein Theorem: A Mathematical Excursion. A. Hinkis. 2013, Birkhaeuser: Basel.
(5) Ibid.
(6) http://en.wikipedia.org/wiki/SchrderBernstein_theorem
(7) The Schroeder-Bernstein theorem sees its first mention, without proof, in the current Oxford undergraduate mathematics syllabus in Prelims: Analysis I: Sequences and Series non-examinable section dealing with countability and density properties of and
. The first full proof of the theorem is encountered in the Part B Set Theory course.