* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Large cardinals and the Continuum Hypothesis
Survey
Document related concepts
Transcript
Large cardinals and the Continuum Hypothesis RADEK HONZIK Charles University, Department of Logic, Celetná 20, Praha 1, 116 42, Czech Republic [email protected] Abstract. This is a survey paper which discusses the impact of large cardinals on provability of the Continuum Hypothesis (CH). It was Gödel who first suggested that perhaps “strong axioms of infinity” (large cardinals) could decide interesting set-theoretical statements independent over ZFC, such as CH. This hope proved largely unfounded for CH – one can show that virtually all large cardinals defined so far do not affect the status of CH. It seems to be an inherent feature of large cardinals that they do not determine properties of sets low in the cumulative hierarchy if such properties can be forced to hold or fail by small forcings. The paper can also be used as an introductory text on large cardinals as it defines all relevant concepts. AMS subject code classification: 03E35,03E55. Keywords: Large cardinals, forcing. Acknowledgement: The author acknowledges the generous support of JTF grant Laboratory of the Infinite ID35216. 1 Introduction The question regarding the size of the continuum – i.e. the number of the reals – is probably the most famous question in set theory. Its appeal comes from the fact that, apparently, everyone knows what a real number is and so the question concerning their quantity seems easy to understand. While there is much to say about this apparent simplicity, we will not discuss this issue in this paper. We will content ourselves by stating that the usual axioms of set theory (ZFC) do not decide the size of the continuum, except for some rather trivial restrictions.1 Hence it is consistent, assuming the consistency of ZFC, that the number of reals is the least possible, i.e. the cardinal ℵ1 , but it can be something much larger, e.g. ℵℵ1 . The statement that the number of reals is the least possible is known as the Continuum Hypothesis, CH, for short: CH: |R| = 2ℵ0 = ℵ1 . CH was made famous by David Hilbert who included this problem as the first one on his list of mathematical problems for the 20th century (see for instance [5]). Since ZFC does not decide CH, are there any natural candidates for axioms which do? That is, is there a statement ϕ without apparent connection to CH which 1 The cofinality of the size of the continuum must be uncountable. 1 decides CH one way or the other? In fact there are many of these, such as MA or PFA,2 but we will require ϕ to be one of a more special kind. In 1946, that is well before the development of forcing, Gödel entertained the idea of so called stronger axioms of infinity deciding CH (and other independent statements as well):3 It is not impossible that [...] some completeness theorem would hold which would say that every proposition expressible in set theory is decidable from the present axioms plus some true assertion about the largeness of the universe of all sets ([1]). A natural way to arrive to “true assertions” about largeness of the universe of sets is to take up analogies with natural numbers. When we compare the theory of arithmetics such as PA with the theory of sets such as ZFC, we can show that the only important strengthening of ZFC over PA is the addition of the axiom of infinity. The axiom can be formulated in many ways, but for our purposes we adopt the following definition: (*) Axiom of Infinity: There is an ordinal ω which is the domain of a model for the formalization of PA. Because of this axiom, ZFC can not only prove some logical arithmetical statements which PA itself cannot prove (unless it is inconsistent), such as Con(P A), but also some purely number-theoretical statements as well (such as Goodstein’s theorem, see for instance [17]). Gödel suggested that perhaps by adding a stronger axiom of infinity to ZFC, this new theory might decide new statements interesting to set theoreticians.4 Can we find such an axiom, perhaps similar to (**) or (***) below, which will decide CH? (**) A strong Axiom of Infinity: There is a regular cardinal κ such that hVκ , ∈i is a model of the formalization of ZFC. or (***) A still stronger Axiom of Infinity: There is a regular cardinal κ such that hVκ , ∈i is a model of the formalization of ZFC + (**). Where Vκ is an initial part of the universe of sets (see Definition 2.1) and is the analogue of ω for sets. Remark. This paper is in a sense a continuation of [6] which contains an introduction to the axioms of set theory, discusses the basic set-theoretical notions and not so briefly reviews basics of forcing. Of course, any of the standard texts such as [7] or [10] contains all the prerequisites to this article. A standard reference book for large cardinals is [8] where an interested reader can find more details. 2 See footnote 27. As regards the intuitive “truth” of such axioms, or why they should be preferable to other types of axioms, see a discussion for instance in [2]. 4 Such extensions will always decide new statement, such as Con(ZF C), but these are considered too “logical” and not properly set-theoretical. 3 2 2 How to find large cardinals In this section we survey large cardinals which can be considered as candidates for the stronger axioms of infinity. The selection is rather arbitrary, but does attempt to do justice to the most important concepts. 2.1 Inaccessible cardinals In theSpresence of the Axiom of Foundation,5 the universe V is equal to the union V = α∈ORD Vα , where the initial segments Vα are defined by recursion along the ordinal numbers ORD as follows: Definition 2.1 V0 = ∅, Vα+1 = P(Vα ), S Vα , for λ limit ordinal, Vλ = Sα<λ V = α∈ORD Vα . If κ satisfies (**) above, we say that Vκ is a “natural model” of set theory. To obtain such a model in set theory, we must transgress the power of the plain ZFC theory – this is a consequence of the second Gödel theorem. What are the properties of a cardinal κ such that Vκ satisfies (**) above? We postulated that it must be regular (we will later see that we cannot avoid this assumption), but what else? Definition 2.2 We say that a cardinal µ is strong limit if for all ν < µ, 2ν < µ. Notice that every strong limit cardinal is also limit (i.e. does not have an immediate cardinal predecessor). Lemma 2.3 Assume κ satisfies (**). Then κ is strong limit. Proof. Assume µ < κ is given. ZFC proves that there is a cardinal ν ≥ µ+ and a bijection f : ν → P(µ). Since hVκ , ∈i is a model of ZFC, we have hVκ , ∈i |= “There is a bijection between P(µ) and some ν ≥ µ+ ”. Since Vκ is transitive, and P(µ) = (P(µ))Vκ ,6 any such bijection in Vκ is really (in V ) a bijection between P(µ) and some ordinal ν in Vκ . As ν < κ, 2µ < κ. Notice that for a regular κ, if µ < κ, then P(µ) ∈ κ (see Footnote 6); however, this does not generally imply that κ is strong limit because the existence of a bijection 5 This axioms states that sets are “well-behaved”; for instance sets x such as x ∈ x are prohibited by this axiom. 6 (P(µ))Vκ is the powerset of µ in the sense of hVκ , ∈i. Note that for every limit ordinal α, if β < α, then (P(β))Vα = P(β) because P(β) ⊆ Vβ+1 , and so P(β) ∈ Vβ+2 ⊆ Vα . 3 between P(µ) and some ν in Vκ depends on the truth of the Replacement schema in Vκ . In fact, we state without a proof that if κ is a regular cardinal, then all axioms of ZFC, except possibly some instances of the Schema of Replacement, are true in hVκ , ∈i. Lemma 2.3 motivates the following definitions: Definition 2.4 We say that a cardinal κ > ω is weakly inaccessible if it is regular and limit. Definition 2.5 We say that a cardinal κ > ω is strongly inaccessible if it is regular and strongly limit. Theorem 2.6 (i) Every cardinal satisfying (**) is strongly inaccessible. (ii) Every strongly inaccessible cardinal satisfies (**). Proof. Ad (i) Obvious from the definitions and Lemma 2.3. Ad (ii) (Sketch). For every regular κ, hVκ , ∈i is a model of ZFC without Schema of Replacement (this is easy to check). Strong limitness is used to ensure that Replacement holds as well. Although it may not be immediately apparent, the weakly inaccessible cardinal is not weaker in terms of consistency strength than the strongly inaccessible cardinal. Let ∃κ ψw (κ) denote the sentence “there exists a weakly inaccessible cardinal”, and similarly for the strongly inaccessible ∃κ ψs (κ). Lemma 2.7 Con(ZFC + ∃κ ψw (κ)) ↔ Con(ZFC + ∃κ ψs (κ)). Proof. The more difficult direction is from left to right. Assume κ is weakly inaccessible. Let L be the universe of constructible set, defined by Gödel. We know that L satisfies ZFC and also GCH.7 It is immediate to see that in L, κ is strongly inaccessible because being a limit cardinal together with GCH implies the desired property of strong limitness. Therefore by Gödel’s theorem and Lemma 2.7 and Theorem 2.6(ii): Corollary 2.8 If ZFC is consistent, it does not prove the existence of a weakly inaccessible cardinal. One can also show that if ZFC is consistent, so is the theory ZFC + “there is no strongly inaccessible cardinal”, and that ZFC does not prove the implication CON(ZFC) → CON(ZFC+ “there is a strongly inaccessible cardinals”). Usually, when we talk about an inaccessible cardinal, we mean the strongly inaccessible, and assumption of existence of such a cardinal number is taken to be the first step in defining strong axioms of infinity. Thus we can reformulate: (**)r : (Strong Axiom of Infinity) There is a (strongly) inaccessible cardinal. 7 The Generalized Continuum Hypothesis which states that for every cardinal µ, 2µ = µ+ . 4 Remark 2.9 One might wonder if we can remove the assumption of regularity in (**) and have an equivalent notion. We cannot: if κ is strongly inaccessible, we can use the standard Löwenheim-Skolem argument to obtain an elementary substructure hVα , ∈i ≺ hVκ , ∈i with α > ω, and cf(α) = ω. Thus hVα , ∈i is a model of ZFC, but α is not regular.8 That is why we need to explicitly postulate the regularity of κ in (**). What about (***)? Well, it is not difficult to see that if κ < κ0 are two strongly inaccessible cardinals, then hVκ0 , ∈i is the desired model for (***). This is the case because hVκ0 , ∈i |= “κ is a strongly inaccessible cardinal”. Thus we may reformulate: (***)r : (Still stronger Axiom of Infinity) There are two (strongly) inaccessible cardinals. We could repeat this argument many times over, obtaining stronger and stronger axioms of infinity, in the hope of deciding more and more sentences. However, there is a limit to this recursion – so called Mahlo cardinals (see the next section). 2.2 Mahlo cardinals We include this cardinal only because it is in a sense a limit to the process of arriving to a large cardinal by a process “from below”. Recall that above we have considered one, two, three, and so on inaccessible cardinals. What if we consider ℵ0 or ℵ1 many of them? Do we get something yet stronger? We do, but there is a natural limit to this type of strengthening of the notion of a strong axiom of infinity. Consider an inaccessible cardinal κ such that κ is the κ-th inaccessible cardinal – clearly, it is a limit of the process of taking more and more inaccessible cardinals as far as their number is concerned. A Mahlo cardinal is even stronger (although it may not be apparent without a more detailed look which we will not provide here): Definition 2.10 A cardinal κ is a Mahlo cardinal if the set of inaccessible cardinals smaller than κ is stationary in κ.9 2.3 Analogies with ω We said above that Mahlo cardinals are a limit to arriving to larger cardinals “from below” by repeating certain continuous processes applied to inaccessible cardinals. But what other options do we have? Mathematicians found out that it is useful to consider the usual properties of ω and try to generalize them in a suitable fashion. In fact, inaccessible cardinals can be regarded in this way – either as a generalization 8 But it is a singular strong limit cardinal. We will not define the notion of a stationary set here; any standard set-theoretical textbook contains this definition. Roughly speaking, a set is stationary in κ if it intersects every continuous enumeration of unboundedly many elements below κ. In particular, every stationary subset of a regular cardinal κ has size κ. 9 5 of the concept of a “model” for a given theory (see above in (**) and (***)), or combinatorially – notice that ω itself is regular and strong limit, i.e. no finite subset of ω is cofinal in ω and ∀n < ω 2n < ω. We generalize10 three other properties of ω:11 (C) ω is compact in the sense of the compactness theorem for the first-order predicate logic. (M) There is a two-valued non-trivial measure on ω, i.e. a non-principal ultrafilter on ω. This measure is ω-complete: for every finite number of elements in the ultrafilter, their intersection is still in the ultrafilter. (R) The Ramsey property holds for ω: ω → (ω)rk , for r, k < ω. Compactness (C). The classical predicate calculus satisfies compactness: for every language and for every set of formulas A (of arbitrary size) in that language if every finite subset B ⊆ A has a model, so does A. In order to generalize this property, we consider an extension of the classical logic denoted as Lκ,κ , where κ is a regular cardinal, as follows. A language in Lκ,κ can have up to κ many variables and an unlimited number of non-logical symbols (functions, constants, predicates). We also allow conjunctions and disjunctions of length < κ and quantifications over < κ many variables.12 The classical logic can be denoted as Lω,ω under this notation. Now we can formulate the generalization of the compactness theorem in two ways: (wC) κ > ω is called weakly compact iff whenever A is any collection of sentences in Lκ,κ with at most κ many non-logical symbols if every B ⊆ A of size < κ has a model, so does A. (sC) κ > ω is called strongly compact iff whenever A is any collection of sentences in Lκ,κ if every B ⊆ A of size < κ has a model, so does A. We will discuss the relationship between (wC) and (sC) later in the text. Measure (M). One can find a non-principal ultrafilter on ω, i.e. a set U ⊆ P(ω) such that for all A, B subsets of ω: (i) (ii) (iii) (iv) If A ∈ U and A ⊆ B, then B ∈ U . If A, B ∈ U , then A ∩ B ∈ U . For no n < ω, {n} ∈ U . For all A, either A ∈ U or ω \ A ∈ U . Note that by induction, (ii) implies that if A0 , . . . , An are sets in U for n < ω, then their intersection is in U – this property can be called ω-completeness to emphasize the analogy with κ-completeness for a cardinal κ > ω introduced below. U is nonprincipal because it is not generated by a single number (property (iii)); (iii) together with other properties implies that every set A ∈ U is infinite. 10 We assume AC, the Axiom of Choice, in formulating these generalizations. Note that a priori there is no guarantee that we get anything like a large cardinal in this fashion; the generalization may turn out to be mathematically trivial and uninteresting. The fact that we do get large cardinals seems to indicate that these generalizations are mathematically relevant. 12 For instance “∃β<α xβ ϕ”, α < κ, quantifies over α-many variables in ϕ. 11 6 (M) κ > ω is called measurable iff there is a κ-complete non-principal ultrafilter U on κ: (i) If A ∈ U and A ⊆ B, then B ∈ U . T (ii) If µ < κ, and {Aξ | ξ < µ} are sets in U , then ξ<µ Aξ is in U . (iii) For no ξ < κ, {ξ} ∈ U . (iv) For all A, either A ∈ U or κ \ A ∈ U . Such an ultrafilter U is often called a measure because it “measures” subsets of κ by a two-valued κ-complete measure: if A ∈ U , then measure of A is 1, if A ∈ 6 U, then its measure is 0. Ramsey partitions (R). Let f be a function from [ω]r to k, where [ω]r denotes the set of all subsets of ω with exactly r elements, and k = {0, . . . , k − 1} is a set of size k (r ≥ 1 and k ≥ 2 to avoid trivialities). Definition 2.11 We say that A ⊆ ω is homogeneous for f : [ω]r → k if |rng(f [A]r )| = 1. Ramsey proved in 1930 that for any such f there is an infinite homogeneous subset, in the arrow notation: ω → (ω)rk , for r, k < ω. The argument is by induction on r, and the nontrivial step is to show ω → (ω)22 . This we read that for any partition of two-element subsets of ω to two sets we can find an infinite homogeneous set. We therefore attempt to generalize: (wR) A cardinal κ > ω is called weakly Ramsey if κ → (κ)22 , i.e. for every partition of two-element subsets of κ to two sets we can find a homogeneous set of size κ. We later learn that this generalization is not getting us a new concept, so we will need to strengthen it. That is why we call this property (wR) and not (R). See the next section for the now standard definition of the Ramsey cardinal. 2.4 Compact, measurable, and Ramsey cardinals As we mentioned above, there is a priori no guarantee that the cardinals defined above under (wC), (sC), (M), and (wR) are even inaccessible. However, as it turns out, they are not only inaccessible but even Mahlo. By way of illustration, we show that a measurable cardinal κ is inaccessible. Theorem 2.12 Every measurable cardinal is inaccessible. Proof. Let U be a non-principal κ-complete ultrafilter witnessing measurability of κ. First notice that by κ-completeness and non-principality of U , all elements in U have size κ. κ is clearly regular, otherwise if {ξα | α < cf(κ)} is cofinal in κ for cf(κ) < κ, 7 S then while for each α < cf(κ), ξα 6∈ U , α<cf(κ) ξα = κ ∈ U , contradicting the κ-completeness of U .13 As regards strong limitness of κ, assume for contradiction that for some λ < κ, we have 2λ ≥ κ, and let f : κ → P(λ) be an injection. For a fixed α < λ, we can consider two subsets of κ given by f : X0α = {ξ < κ | α ∈ f (ξ)} and X1α = {ξ < κ | α 6∈ f (ξ)}. For each α < λ, exactly one of the two setsTX0α and X1α is in U ; let us denote this set as X α . By κ-completeness of U , X = α<λ X α must be in U . However X can have at most one element since f is an injection – if ξ 6= ζ are in X, then f (ξ) 6= f (ζ) and hence at some α < λ, ξ must be in X0α and ζ in X1α (or conversely). This contradicts the non-principality of U . It follows that κ is strong limit, and hence inaccessible. With nice combinatorial arguments, not always trivial ones, one can show that every strongly compact cardinal is measurable, every measurable is weakly compact, and every weakly compact is Mahlo, and every Mahlo is inaccessible. Thus disparate combinatorial notions gave rise to a linearly ordered scale of cardinals. What about the weakly Ramsey cardinal? With a little work, it can be shown that the definition (wR) is in fact equivalent to (wC). And so the classes of weakly compact cardinals and weakly Ramsey cardinals are the same. However, there is a way how to generalize the Ramsey property and obtain something stronger than a weakly compact cardinal: (R) A cardinal κ > ω is called Ramsey if κ → (κ)<ω 2 , i.e. for every partition of all finite subsets of κ to two sets we can find a homogeneous set A of size κ.14 Many questions concerning these cardinals are quite difficult. For instance, it has long been open (from 1930’s to 1960’s) whether the least inaccessible cardinal can be measurable. By a new method using elementary embeddings and ultrapowers developed by Scott, it was proved in the 1960’s that measurable cardinals are quite large – they can never by the least inaccessible, or the least weakly compact cardinal. In fact if κ is measurable, then it is the κ-th weakly inaccessible cardinal, and more. We will touch briefly on the method of elementary embeddings in Section 3.4. Finally, let us note that measurable cardinals were first used – before the introduction of the Cohen’s method of forcing – to argue for the consistency of the statement V 6= L, i.e. that there exists a non-constructible set. It was Scott [15] who showed in 1961 that if there exists a measurable cardinal, then V 6= L. Nowadays large cardinal which imply that V 6= L are called “large” large cardinals, while others are called “small” large cardinals. Inaccessible, Mahlo, and weakly compact cardinals are “small”, while Ramsey, measurable and strongly compact are “large”. 2.5 Motivation We showed that by a natural attempt to generalize properties which hold for ω, we arrive to interesting notions in set theory which form a linear scale, as regards the strength of the notions. This is often taken as a heuristic point in favour of the naturalness of the definitions. Not least because by the linearity, no two large 13 Notice that κ-completeness can be equivalently expressed as follows: whenever µ < κ and S {Xα | α < µ} are sets not in U , the the union α<µ Xα is not in U , either. 14 For every n < ω, |rng(f [A]n )| = 1. 8 cardinals are inconsistent together – so far, no large cardinal was found that prohibits the existence of some other large cardinals. The properties which can be generalized range from purely logical (such as the inaccessible cardinal witnessing (**), or (wC) and (sC)), to combinatorial (wR), (R) and measure-theoretic (M). On the downside, all these notions substantially increase the consistency strength of the relevant theories, thus increasing the risk of introducing a contradiction. It is conceivable, but not considered probable now, that ZFC is consistent, while ZFC + “there is an inaccessible” is not. Or that ZFC + “there is a weakly compact cardinal” is consistent while ZFC + “there is a measurable cardinals” is not. See Section 4 for more discussion on consistency strength. Such discussion are not of logical interest only. It can be shown for instance that a certain weakening of the GCH, denoted as SCH,15 is provable in ZFC if ZFC refutes the existence of inaccessible cardinals.16 However, with some large cardinals around, SCH cannot be proved, and is therefore independent over the theory ZFC + certain large cardinals.17 3 Large cardinals and CH As we mentioned earlier, Gödel expressed his hopes that perhaps large cardinals could provide a natural extension of ZFC with interesting set-theoretical consequences such as determining the truth or falsity of CH. However, with the development of forcing on the way, Levy and Solovay in 1967 [11] came with arguments which are almost universal and show that truth or falsity of CH is unaffected by large cardinals. In the following sections, we assume some basic understanding of forcing on the reader’s part. 3.1 How to force CH and ¬CH A standard forcing notion to force CH, which we will denote as PCH , is composed of functions f : ω1 → 2 with the domain of f being at most countable. The extension is by reverse inclusion. PCH adds a new subset of ω1 , and collapses 2ω to ω1 in the process.18 PCH is called the Cohen forcing for adding a subset of ω1 . To force ¬CH, we will use ω2 copies of the Cohen forcing which adds a new subset of ω. Formally, a condition in P¬CH is a function with finite domain from ω2 to 2. One can show that P¬CH preserves cardinals and forces 2ω = ω2 . For our purposes notice that |PCH | = 2ω and |P¬CH | = ω2 , i.e. both forcings are quite small, certainly smaller than the first inaccessible. 15 GCH, the Generalized Continuum Hypothesis, states that for all cardinals κ, 2κ = κ+ . SCH, the Singular Cardinal Hypothesis, states that for all singular cardinals κ, 2κ = max(2cf(κ) , κ+ ). 16 This is true for larger cardinals than just inaccessibles. 17 For instance if ZFC + (sC) is consistent, so is ZFC + ¬SCH. 18 Notice that for every X ⊆ ω in V , it is dense in PCH that there exists some α < ω1 and p such that p restricted to [α, α + ω) is a characteristic function of X. The function defined in a generic extension which takes every α < ω1 to a subset of ω given by the restriction of the generic filter to [α, α + ω) is therefore onto (2ω )V . It follows that 2ω of V is collapsed to ω1 . 9 3.2 Inaccessible and Mahlo cardinals and CH We have defined above two “small” forcings which can force CH and ¬CH, PCH and P¬CH , respectively. As it turns out, for the preservation of large cardinals, it suffices to assume that the forcing in question has size < κ. Theorem 3.1 Let P be a forcing of size < κ and let G be a P -generic filter. Assume κ is inaccessible or Mahlo in V , then κ is inaccessible or Mahlo, respectively, in V [G]. In particular, these large cardinals do not decide CH. Proof. First notice that the theorem really implies that these large cardinals do not decide CH. Suppose for contradiction that one of these cardinals decides CH; for example let us assume that ZFC + “there is an inaccessible” proves CH. Assume there is an inaccessible and force with P¬CH ; we obtain a generic extension where ¬CH holds and there is still an inaccessible. This a contradiction. Let us now turn to the proof of the rest of the theorem. By standard forcing technique, if λ < κ is given, then there are just 2|P |λ -many nice names for subsets of λ in V [G]. Since µ = 2|P |λ < κ by inaccessibility of κ, we have V [G] |= 2λ ≤ µ < κ, i.e. κ remain inaccessible in V [G].19 To argue for preservation of Mahloness, we show as a lemma that forcings with κ-cc preserve stationarity of subsets of κ. Lemma 3.2 Assume Q is a forcing notion. If Q is κ-cc, then it preserves stationary subsets of κ. Proof. Let V [E] be a Q-generic extension and S stationary subset. We wish to show that S is still stationary in V [E]: that is, we need to show that if C ∈ V [E] is closed unbounded, then S ∩ C 6= ∅. Fix a closed unbounded C and let p ∈ E force this: p Ċ is a closed unbounded subset of κ̌. Denote D = {ξ < κ | p ξˇ ∈ Ċ}. Note that D ⊆ C and D ∈ V . We prove that D is a closed unbounded subset of κ. Now the claim follows because D ∈ V , and so D ∩ S 6= ∅. To prove D is closed unbounded, it suffices to argue that it is unbounded (closure is easy). Let α < κ be given. By induction construct for each n < ω a maximal antichain An = hqnξ | ξ < αn i of elements below p and an increasing sequence of ordinals hβnξ | ξ < αn i, where αn < κ (this is possible by κ-cc), such that: (a) (b) (c) (d) 19 β00 ≥ α; for each n, hβnξ | ξ < αn i is strictly increasing; if m < n then all elements in the βn -sequence are above the βm -sequence; qnξ β̌nξ ∈ Ċ. Note also that all cardinals ≥ |P |+ , and hence also µ, remain cardinals in V [G]. 10 Since for every n < ω, αn < κ, S ξ n<ω {βn | ξ < αn } is bounded in κ. We show that δ = sup{βnξ | n < ω, ξ < αn } is in D, that is p δ̌ ∈ Ċ. By forcing theorems, it suffices to show that whenever F is a Q-generic and p ∈ F , then δ ∈ Ċ F . Since each An is maximal below p, F ∩ An is non-empty for each n < ω. It follows that there is a sequence hqn | n < ωi of conditions in F which force that elements of Ċ are unbounded below δ. Hence δ ∈ Ċ F as required. Since our forcing P has size < κ, it certainly has the κ-cc, and therefore the set of regular cardinals below κ is still stationary in V [G]. That is κ is still Mahlo in V [G]. 3.3 Weakly compact and measurable cardinals and CH By way of example, we show that if P has size < κ, then κ is still weakly compact or measurable in V [G] if it was weakly compact or measurable, respectively, in V . In Theorems 3.3 and 3.4 we will give direct arguments, while in Section 3.4 we will put large cardinals into a more general picture so that we can formulate a uniform approach to preservation of large cardinals. Theorem 3.3 Assume κ is weakly compact in V and P has size < κ, and G is P -generic. Then κ is weakly compact in V [G]. Proof. As a fact we state that κ is weakly compact iff (3.1) κ → (κ)nλ , for every n < ω, λ < κ. Let us fix in V [G] a function f : [κ]2 → 2; it suffices to find in V [G] a homogeneous set X ⊆ κ of size κ. By Forcing theorem, there is p ∈ P such that p f˙ : [κ]2 → 2. Define back in V , h : [κ]2 → P(P × 2) by h(s) = {hq, ii | q ≤ p & q f˙(š) = i}. Since |P(P × 2)| < κ, we can apply (3.1) and find a homogeneous set X ⊆ κ for the function h. We claim that p X̌ is homogeneous for f˙, or equivalently X is homogeneous for f in V [G]. The homogeneity of X for h means that for all s ∈ [X]2 , h(s) is equal to some fixed set of the form A = {hq, ii | q ≤ p & q f˙(š0 ) = i}, for some s0 ∈ [κ]2 . Notice that because p forces that f˙ is a function, there can be no “contradictory pairs” hq, 0i and hq, 1i in A; that is for each q ≤ p occurring at the first coordinate of a pair in 11 A there is unique i(q) such that hq, i(q)i is in A. Assume F is any P -generic with p ∈ F . For each s ∈ [X]2 , there is some q(s) ∈ F such that hq(s), i(q(s))i is in A. If s1 , s2 are in [X]2 , then q(s1 ) and q(s2 ) are compatible in F by some r which thus decides both f˙(š1 ) and f˙(š2 ); furthermore, there is a unique i(r) such that hr, i(r)i is in A and so i(q(s1 )) = i(q(s2 )) = i(r). This proves that p forces that X is homogeneous for f˙. Theorem 3.4 Assume κ is measurable in V and P has size < κ. Then κ is measurable in V P . Proof. Let G be a P -generic filter, and let U be a κ-complete non-principal ultrafilter on κ in V . We will show that W = {A ⊆ κ | ∃B ∈ U B ⊆ A} is a κ-complete non-principal ultrafilter in V [G]; we say that W is generated by U . It is easy to show that W is non-principal, closed upwards, and κ-complete – that is that is a κ-complete non-principal filter: (i) Non-principality. Since U is non-principal and every element of W is above an element of U , the argument follows. (ii) κ-completeness. Fix in V [G] a sequence hAξ | ξ < λi, λ < κ of sets in W . By definition of W , there is p ∈ G such that (3.2) p “There exists a sequence hḂξ | ξ < λi of sets in U such that for every ξ < λ, Ḃξ ⊆ Ȧξ .” By |P | < κ, there is for each ξ and Ḃξ a family Bξ of size < κ of sets in U such that p Ḃξ ∈ B̌ξ . T By κ-completeness of U , for every ξ, bξ = Bξ is in U . TheTsequence hbξ | ξ < λi exists in V , and therefore by κ-completeness of U in V , ξ<λ bξ is in U . It follows \ \ p b̌ξ ⊆ Ȧξ for every ξ < λ and p b̌ξ ⊆ Ȧξ , ξ<λ and hence T ξ<λ Aξ ξ<λ is in W . It remains to show that W is an ultrafilter. Let Ẋ be a name for a subset of κ. For each p ∈ P , let Ap = {α < κ | p decides if α̌ ∈ Ẋ}. Notice that D = {p ∈ P | Ap ∈ U } is dense in P. This is because for each q ∈ P , [ Ap = κ p≤q and by κ-completeness of U and the fact that |P | < κ, there must be some p ≤ q such that Ap ∈ U . Let r be in D ∩ G – then Ar ∈ U where Ar can be written as a disjoint union of A0 = {α < κ | r α̌ ∈ Ẋ} and A1 = {α < κ | r α̌ 6∈ Ẋ}. If A0 ∈ U , then Ẋ G ∈ W , and if A1 ∈ U , then κ \ Ẋ G ∈ W . 12 3.4 A uniform approach So far we have argued that inaccessible, Mahlo, and measurable cardinals do not decide CH. This, per se, is not an argument that other large cardinals cannot behave differently in this respect – after all, every argument we gave was unique to a given large cardinal concept, and not directly generalizable to other large cardinals. As it turns out, however, many large cardinals can be formulated in terms of elementary embeddings, and there is a uniform approach which shows that such cardinals do not affect CH. Among the cardinals with definitions through elementary embeddings are weakly compact cardinals, measurable cardinals, strongly compact cardinals, supercompact cardinals and many others. Definition 3.5 Let M and N be two transitive classes. We say that j : M → N is an elementary embedding if for every formula and every n-tuple m0 , . . . , mn of elements in M , if ϕM (m0 , . . . , mn ), then ϕN (j(m0 ), . . . , j(mn )). The notation ϕM is defined recursively and subsists in replacing every occurrence of an unbounded quantifier Qx with Qx ∈ M . Note that M, N and j may be proper classes.20 We say that κ is a critical point of j : M → N if for all α < κ, j(α) = α, and j(κ) > κ. One can show that if j is not the identity it has a critical point which is always a regular uncountable cardinal in M . Theorem 3.6 The following are equivalent for a cardinal κ > ω: (i) κ is measurable. (ii) There is an elementary embedding j : V → M with critical point κ, where M is some transitive class. Proof. Ad (i)→(ii). (Sketch) A generalization due to Scott [15] of the ultrapower construction can be used to form the ultrapower of the whole universe V via a κcomplete ultrafilter U witnessing the measurability of κ. One can show that this construction is well defined and yields a proper class ultrapower model, denoted as UltU (V ). Since the U is ω1 -complete, one can further show that that the ultrapower is well-founded and can therefore be collapsed using the Mostowski collapsing function. Thus there is an elementary embedding j : V → UltU (V ) ∼ =M to a transitive isomorphic image of UltU (V ). κ-completeness of U is invoked to prove that j is the identity below κ, and j(κ) > κ+ . 20 There are some logical issues here because ZFC does not formalize satisfaction for proper classes, and hence one should be careful in saying that some ϕ holds in M , or that j is elementary. The relativization ϕM solves the issue to a certain extent, but it is not entirely optimal (for instance the property “j is elementary” is a schema of infinitely many sentences). Luckily, as always with issues like these, there are ways to make these concepts completely correct from the formal point of view. See for instance [4] for a nice discussion of approaches to formalizing large cardinal concepts which refer to elementary embeddings. 13 Ad (ii)→(i). Let j : V → M be elementary with critical point κ. Let us define U = {X ⊆ κ | κ ∈ j(X)}. We will show that U is a κ-complete non-principal ultrafilter. It is non-principal because for every α < κ, j({α}) = {α} and therefore {α} 6∈ U . κ-completeness follows by the following argument: if {Aξ | ξ < µ} are sets in U for µ < κ, then j({Aξ | ξ < µ}) = {j(Aξ ) | ξ < µ} because j(µ) = µ and therefore the j-image of the system {Aξ | ξ < µ} is just the system of the j-images of the individual sets. Therefore \ κ∈ j(Aξ ) ξ<µ and hence \ Aξ ∈ U. ξ<µ Thus U is a κ-complete non-principal filter. It remains to show that U is an ultrafilter – but this is easy: if X ⊆ κ is given, then κ = X ∪ (κ \ X), and so κ ∈ j(κ) = j(X) ∪ j(κ \ X) by elementarity. Hence κ ∈ j(X) or κ ∈ j(κ \ X). Notice that U is generated by a single element – κ. But it is not principal because κ is not in the range of j. If ξ is in the range of j, then any attempt to define an ultrafilter as we did ends up with a principal ultrafilter because the singleton of j −1 (ξ) would be in the filter. Conversely, if we defined our U with any other ξ in the interval [κ, j(κ)), we would get a non-principal κ-complete ultrafilter by an identical argument. The importance of U , as generated by κ, is that U is normal, but this goes beyond the scope of this paper. The above theorem provides a new tool to show that a measurable cardinal is preserved by forcing. It suffices to show that in a generic extension V [G], there exists an elementary embedding with critical point κ. The following lemma is very useful for finding elementary embeddings in the generic extensions. Lemma 3.7 (Silver) Assume j : M → N is an elementary embedding between transitive classes M, N . Let P ∈ M be a forcing notion and let G be P -generic over M .21 Assume further that H is j(P )-generic over N such that (3.3) {j(p) | p ∈ G} ⊆ H. Then there exists elementary embedding j ∗ : M [G] → N [H] such that: (i) j ∗ restricted to M is equal to j, (ii) j ∗ (G) = H. 21 This means that G meets every dense open set which is an element of M . 14 We call j ∗ a lifting of j to M [G]. Proof. We first show how to define j ∗ . Let x be an element of M [G] and let ẋ be a name for x so that ẋG = x. We set j ∗ (ẋG ) = (j(ẋ))H . This definition is correct because by elementarity j(ẋ) is a j(P )-name; further if ẏ is another name for x, ẏ G = ẋG = x, then there is some p ∈ G such that p ẏ = ẋ. By elementarity, j(p) j(ẋ) = j(ẏ). By (3.3), j(p) ∈ H and therefore (j(ẋ))H = (j(ẏ))H . j ∗ is elementary by the following implications: (3.4) ϕM [G] (x, . . .) → ∃p ∈ G p ϕ(ẋ, . . .) → ∃p ∈ G j(p) ϕ(j(ẋ), . . .) → ϕN [H] (j ∗ (x), . . .), where the last implication follows by (3.3). Ad (i). For x ∈ M , j ∗ (x) = (j(x̌))H = j(x), by the properties of the canonical name x̌. Ad (ii). Let ġ be a canonical name for the generic filter, i.e. a name which always interprets by the generic filter. Then ġ G = G, and j ∗ (G) = (j(ġ))H = H. Silver’s “lifting lemma” allows us to reprove Theorem 3.4 in a more straightforward way: Theorem 3.8 Assume κ is measurable in V , P has size < κ and G is P -generic. Then κ is measurable in V [G]. Proof. By Theorem 3.6, there is an embedding j : V → M with critical point κ (this embedding exists, i.e. is definable, in V ). Since j is the identity below κ, one can easily show that Vκ+1 = (Vκ+1 )M and j(x) = x for every x ∈ Vκ . In particular j(P ) = P because |P | < κ implies that we can assume P ∈ Vκ .22 It follows by Silver’s lemma, when we substitute G for H, that there exists a lifting j ∗ : V [G] → M [G], where j ∗ (G) = G. Since j ∗ is definable in V [G], it shows by Theorem 3.6 that κ is still measurable in V [G]. 22 If |P | < κ, then there is an isomorphic copy of P which is in Vκ . 15 3.5 Other large cardinals Many large cardinals can be formulated in terms of elementary embeddings and hence the proof in Theorem 3.8 can be straightforwardly generalized to argue that these cardinals do not decide CH. Here is a list of some more known large cardinals defined using elementary embeddings satisfying certain properties, where κ > ω: • κ is weakly compact iff κ is inaccessible and for every transitive model M of ZF without the powerset axiom such that κ ∈ M , M is closed under < κsequences and |M | = κ, there is an elementary embedding j : M → N , N transitive, with critical point κ. • κ is strongly compact iff for every γ > κ there is an elementary embedding j : V → M with critical point κ, j(κ) > γ, and for any X ⊆ M with |X| ≤ γ, there is a Y ∈ M such that Y ⊇ X and (|Y | < j(κ))M . • κ is supercompact iff for every γ > κ there is an elementary embedding j : V → M with critical point κ, j(κ) > γ, and γ M ⊆ M .23 • κ is strong iff for every γ > κ there is an elementary embedding j : V → M with critical point κ, j(κ) > γ, and Vγ ⊆ M . Even Ramsey cardinals can be formulated in terms of elementary embeddings, see for instance [13]. All the cardinals considered so far are linearly ordered in terms of strength: for instance every supercompact is strongly compact, and every strongly compact is strong. Note that by a celebrated result by Kunen [9], there can be, in ZFC, no cardinal κ such that there exists an elementary embedding j : V → V with critial point κ. This sets an upper bound on the large cardinal concept which we can consider.24 4 On the consistency strength Large cardinals are interesting set-theoretical objects with beautiful combinatorics and surprising connections among themselves; for instance many of these can be defined in apparently disparate ways – using elementary embedding, satisfaction in various structures, or by partition properties. However, this does not fully explain the willingness with which large cardinals are almost universally accepted by the set theoreticians. To explicate the wider role of large cardinals we need to introduce the notion of a consistency strength over ZFC. Definition 4.1 A statement σ in the language of set theory is stronger in terms of consistency then another statement σ 0 if CON(ZFC + σ) → CON(ZFC + σ 0 ). 23γ M ⊆ M is true if for every sequence of length γ of elements in M , the whole sequence is in M . This a non-trivial requirement because the sequence itself is in general only in V , and not in M. 24 Rather surprisingly, it is still open whether this limiting result holds in ZF. 16 We denote here this relation by σ 0 ≤c σ. Statements are called equiconsistent if CON(ZFC + σ) ↔ CON(ZFC + σ 0 ). For instance, GCH ≡c ¬CH ≡c V = L ≡c V 6= L ≡c ♦ ≡c “There are no ω1 -Souslin trees”.25 Moreover we have (4.5) CON(ZF − Axiom of Foundation ) → CON(ZFC + σ) for any σ from the class [GCH]≡c .26 Note that the relation of equiconsistency ≡c is an equivalence relation, and the relation ≤c is an ordering on the equivalence classes given by ≡c . What is the structure of this ordering? In principle, it might be highly non-linear. However, large cardinal concepts can be used to show that it is in fact mostly linear: for many combinatorial statements σ and σ 0 considered in practice, we either have σ ≤c σ 0 or σ 0 ≤c σ. The key here is that large cardinal concepts themselves are linearly ordered under ≤c , and very often one can show that a statement σ is equiconsistent with a certain large cardinal axiom. By way of example, considered the following three statements (see [7] for the definitions of the concepts mentioned): (A) (Over ZF) All sets of reals are Lebesgue measurable. (B) (Over ZFC) Every ω2 -tree has a cofinal branch. (C) (Over ZFC) SCH fails. A priori, they might be incomparable under ≤c ; however, one can prove: Theorem 4.2 (Solovay [18], Shelah [16]) (A) ≡c “there exists an inaccessible cardinal”. Theorem 4.3 (Mitchell [12]) (B) ≡c “there exists a weakly compact cardinal”. Theorem 4.4 (Mitchell [14], Gitik [3]) (C) ≡c “there exists a measurable cardinal of Mitchell order o(κ) = κ++ ”. Corollary 4.5 GCH <c (A) <c (B) <c (C). 25 26 This can be shown using Gödel’s class of constructible sets L, or by forcing. Notice that by (4.5), [GCH]≡c is equal to [ν]≡c for any ν such that ZFC ` ν. 17 The above theorems are proved using two complementary methods: (i) forcing over a model with the given large cardinal, and (ii) technique of inner models to find a large cardinal (in some model of set theory) from the given combinatorial statement. For instance Theorem 4.3 is proved by iterating a certain forcing notion (such as the Sacks forcing at ω) along κ, where κ is weakly compact: this gives (B) ≤c “there exists a weakly compact cardinal”. Conversely, one can show that if (B) holds, then ω2 of V is a weakly compact cardinal in L, and hence there is a model with weakly compact cardinal. This gives: “there exists a weakly compact cardinal” ≤c (B). Problems arise when the large cardinal in question is inconsistent with L (such as a measurable cardinal), then to obtain the consistence of the large cardinals, a generalization of L must be defined which allows large cardinals. This is the field of inner model theory. So far, inner models were devised for infinitely many Woodin cardinals (Woodin cardinals are much stronger than measurable cardinals in terms of consistency strength), but not – crucially – for strongly compact or supercompact cardinals. This inability to find suitable inner models for such large cardinals is one of the most pressing problems in current set theory. Because of this, the following is still open for a certain important combinatorial statement denoted as PFA (Proper Forcing Axiom):27 Open question. We know: PFA ≤c “there exists a supercompact cardinal”. Does the converse hold as well, i.e. is PFA equiconsistent with a supercompact cardinal? There is a general agreement that this is the case, but we cannot prove it.28 The following is also long open, probably for the similar reason as the case of PFA: Open question. By definition, every supercompact cardinal is strongly compact. We also know that κ can be measurable + strongly compact but not supercompact. However, we do not know, but consider probable: Are strongly compact and supercompact cardinals equiconsistent? 5 Conslusion Large cardinals considered in this article do not decide CH one way or another. In fact no commonly considered large cardinals decide CH, which can be shown by similar methods.29 However, notice that we cannot prove a statement such as “no large cardinal decides CH” because in this statement we quantify over a vague domain of “large cardinals” and hence such a statement is not in the language of set 27 PFA, a strengthening of MA – the Martin’s Axiom –, implies 2ω = ω2 and thus decides CH. However, PFA itself is not a large cardinal axiom in the strict sense. Also, PFA trivially implies 2ω > ω1 the way it is set up, so what is surprising is that it also implies 2ω ≤ ω2 , and not that it implies failure of CH. MA, on the other hand, is consistent with any reasonable value of 2ω > ω1 . 28 We do know that PFA implies consistency of many Woodin cardinals, and so PFA is sandwiched between “many Woodins” and “supercompact”. But this gap is quite substantial. 29 We consider κ to be large when it is at least inaccessible. If we drop this requirement, the situation is more complex. 18 theory. It may be, but it is not considered probable, that a new large cardinal will be devised which will be more susceptible to effects of small forcings. At present, no such cardinal is known. 19 References [1] In S. Feferman, J. Dawson, S. Kleene, G. Moore, and J. Van Heijenoort, editors, Kurt Gödel. Collected Works. Volume II. Oxford University Press, 1990. [2] Sy-David Friedman and Tatiana Arrigoni. Foundational implications of the inner model hypothesis. Annals of Pure and Applied Logic, 163:1360–1366, 2012. [3] Moti Gitik. The negation of singular cardinal hypothesis from o(κ) = κ++ . Annals of Pure and Applied Logic, 43:209–234, 1989. [4] Joel David Hamkins, Greg Kirmayer, and Norman Lewis Perlmutter. Generalizations of the Kunen inconsistency. To appear in Annals of Pure and Applied Logic. [5] David Hilbert. Mathematical problems. Bull. Amer. Math. Soc., 8:437–479, 1902. [6] Radek Honzik. Quick guide to independence results in set theory. Miscellanea Logica, 7:89–131, 2009. [7] Tomáš Jech. Set Theory. Springer, 2003. [8] Akihiro Kanamori. The Higher Infinite. Springer, 2003. [9] K. Kunen. Elementary embeddings and infinitary combinatorics. The Journal of Symbolic Logic, 407–413:21–46, 1971. [10] Kenneth Kunen. Set Theory: An Introduction to Independence Proofs. North Holland, 1980. [11] Azriel Levy and Robert M. Solovay. Measurable cardinals and the continuum hypothesis. Israel Journal of Mathematics, 5:234–248, 1967. [12] W. J. Mitchell. Aronszajn trees and the independence of the transfer property. Annals of Mathematical Logic, 5:21–46, 1972/1973. [13] W. J. Mitchell. Ramsey cardinals and constructibility. The Journal of Symbolic Logic, 44(2):260–266, 1979. [14] William J. Mitchell. The core model for sequences of measures. I. Math. Proc. Camb. Phil. Soc., 95:229–260, 1984. [15] Dana S. Scott. Measurable cardinals and constructible sets. l’Académie Polonaise des Sciences, 9:521–524, 1961. Bulletin de [16] S. Shelah. Can you take Solovay’s inaccessible away? Israel Journal of Mathematics, 48:1–47, 1984. [17] Antonı́n Sochor. Klasická matematická logika. Karolinum, 2001. [18] Robert M. Solovay. A model of set theory in which every set of reals is Lebesgue measurable. Annals of Mathematics, 92:1–56, 1970. 20