Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
handout/homework 2 - A Fregean functional semantics, part 1: functions, names, and predicates In the first handout we gave rough translations of the parts of speech of first order predicate logic in the following manner. (x) = “for all x,” (x) = “there exists an x such that,” (meaning, there exists at least one x such that) (A) = “It is not the case that, (A)” (A B) = “(If A, then B)” (A B) = “(A, or B)” (A B) = “(A, and B)” (Rx) = “x has the property R,” (Rxy) = “x is in the relation R to y” (Rxyz) = “w, x, y, and z are in the relation R” (Rxyzz1) = “x, y, z, and z1 are in the relation R” Etc. (one can do this for any finite number of variables or constants) We did not rigorously define a grammar for such a language though. Why do such a thing? Remember that Frege developed his language in the hope that one could formalize mathematical reasoning so that each step in a mathematical proof is shown to be clearly unproblematic. Then the hope was that difficult mathematical theorems could be shown to be provable from clear premises that are obviously true. To the extent that one can do these two things (provide a logic where each step in a proof is clearly unproblematic and also specify unproblematic premises), then one will have shown that the mathematical theory in question is not philosophically problematic. But this will only work if the grammar for the logic can itself be characterized such that the question of whether a sentence is a sentence of the logical language is itself clearly unproblematic. The limiting case of “unproblematic” here (and in the case of a system of proof) is if we can specify dumb mechanical procedures that determine whether a string of symbols is a sentence of the language or not (or a proper step in a proof, for that matter). This is always possible for a logical language if one can provide a recursive specification of the set of sentences in the language. Moreover, in order to come up with a precise notion of proof, it is absolutely essential that certain forms of ambiguity are prohibited. For example, consider the following English language sentence. I’m going to the store, and she’s buying smokes, or I’m watching T.V. This sentence is ambiguous. If we use parentheses to disambiguate, on the one hand it could mean “(I’m going to the store) and (she’s buying smokes or I’m watching T.V).” From this reading of the sentence we would know that I’m going to the store. If we parsed the sentence as “(I’m going to the store and she’s buying smokes) or (I’m watching T.V),” we wouldn’t know that I’m going to the store. 1 By formalizing the logical languages, we prohibit all such ambiguities. Vocabulary: All capital English letters (A, B, C. . . .Z), as well as all numerically subscripted capital Zs (Z1,Z2, Z3. . .), are predicates, each one of one or more places. The lower case English letters (a, b, c, d), as well as all numerically subscripted lower case ds (d1,d2, d3. . .), are proper names. The lower case English letters (x, y, z), as well as numerically subscripted lower case zs (z1,z2, z3. . .), are variables. , , , and are propositional connectives (respectively: negation, conditional, disjunction, and conjunction). and are quantifiers (respectively: universal and existential). ( and ) are parentheses. Formation rules: (1) If Φ is an n-place predicate and every one of α1. . . .αn is either a variables or proper name, then Φ(α1. . . .αn) is a sentence of L. (2) If is a sentence of L, then is a sentence of L. (3) If and are sentences of L, then ( is a sentence of L. (4) If and are sentences of L, then ( is a sentence of L. (5) If and are sentences of L, then ( is a sentence of L. (6) If is a wff and α is a propositional variable, then α( is a sentence of L. (7) If is a wff and α is a propositional variable, then α( is a sentence of L. (8) All and only the sentences of L of L are generated by the above 7 rules. If we were to spend the time checking, we would see that all of the logical sentences given in the previous handout can be constructed from the above vocabulary and rules. The logical tradition that stems from Frege and others is philosophically interesting for a number of reasons. First and foremost, philosophers have always been interested in codifying valid argumentation. This is because philosophy itself consists in using reason to produce arguments concerning the things that centrally concern us as humans. However, the Fregean revolution is of special interest to philosophers of language since Frege’s language was the first to be provably compositional, in that (when understood via Frege’s semantic theory) it satisfies Miller’s Thesis 2: The semantic value of a complex expression is determined by the semantic value of its parts.1 Remember that the “semantic value” of an expression is defined as “that feature of it which determines whether sentences in which it occurs are true or false.” When put together with Thesis 2, this means that the truth or falsity of a sentence is a function of properties of parts of that sentence. Part of Frege’s genius is that his semantic theory for his logic showed in detail how this is the case (and we’ll learn this in these handouts). Again though, why does this matter? From the 2 perspective of the philosophy of mathematics and logic, it matters because logically valid proofs must show how it is not logically possible for the premises to be true and the conclusions false, and this ends up requiring a theory of how the logical parts of speech (the logical operators above) contribute to the truth or falsity of sentences in which they occur. From a broader perspective, compositionality has even more interest. In Descartes’ Discourse on Method2 the actual argument given for the conclusion that the mind/soul is different from the physical brain involves language. Descartes argues that the brain behaves purely mechanistically, like a complicated machine. But since human language is not something a machine could do, this proves that the human mind is not purely mechanistic. Therefore the human mind is more than just the brain. And the reason Descartes thought that human language was non-mechanical was because of the massive creativity involved in language use. Most sentences we understand and speak have never been uttered before, and will never be uttered again. Yet, for the most part, without effort we manage to use and understand language. Descartes thought it impossible that a piece of biological clockwork (the brain) could do anything like this.3 One way to respond to Descartes is to argue that the linguistic creativity he noticed is not quite as creative as he thought. In particular, if there are a finite number of words, a finite number of syntactic principles by which those words and the resulting phrases can be combined, and a finite number of semantic principles governing how meaning of phrases and sentences is a function of those words and how they are combined, then it seems more plausible that a machine-like entity (assuming with Descartes that the brain is machine-like) could master language. But Frege’s syntactic requirements for his logical language combined with his form of compositionality (the way in which Thesis 2 is worked out in his semantic theory) actually ends up showing in detail a language with a finite number of words, a finite number of syntactic principles by which those words and the resulting phrases can be combined, and a finite number of semantic principles governing how meaning of phrases and sentences is a function of those words and how they are combined. So if one could plausibly argue that human languages are similar enough to languages like Frege’s, then one would have a response to Descartes. This is in part why so much contemporary philosophy of language has been a footnote to Frege. One more piece of the puzzle- to answer Descartes’ challenge, it must be the case that the meaning (that which we understand) of sentences is provably compositional. But even if Frege’s theory of semantic content works, we just have that truth or falsity is compositional. Remember that the semantic content of a sentence for Frege is just truth or falsity (this is Miller’s Thesis 1). Well, arguably, part of the understanding of a sentence is the understanding of what it is for that sentence to be true or false. But then if truth or falsity itself is a function of the semantic content of the expressions of sentences, a big part of our understanding will be grasp of the rules that determine how this works. Here we want to see how Frege’s language4 is compositional. Then we will look at Frege’s own argument that the semantic account for his language as a language of mathematical proof is insufficient. 3 In mathematics, when we talk about some things being determined by other things, we usually are talking about functions. So Thesis 2 could have said something like, “The semantic value of a complex expression is a function of the semantic value of its parts.” Unfortunately, at the time Frege was writing, mathematics did not have a well worked out theory of functions (that we do is probably in part due to Frege’s advances), so we have to reconstruct his views a little bit here. In particular, today mathematical functions can be understood either extensionally or intentionally (we will see in a few days how this very distinction follows from Frege’s insights). Intensionally, a function is a rule or procedure that takes you from one ordered group of values to another value. So consider the following two functions: f(x) = (x + 4) – 2, g(x) = (x +6) – 4. Intuitively, they are different functions because they give different procedures for solving them. So: f(5) = (5 + 4) – 2 9–2 7 g(5) = (5 + 6) – 4 11 – 4 7 So we can say that by the intensional notion of functions, f and g are different functions. However, there is clearly a sense in which they are the same functions. For any numerical input, f and g return the same values. By the extensional notion of functions, when this happens we say that the two functions are actually one and the same, that we have one function characterized differently. When we think this way, we characterize functions just in terms of the set of inputs and outputs, for one-place functions like f and g, this will be a set of ordered pairs. If f and g are defined on the natural numbers ({0,1,2,3,4. . . .) then we have: f = g = {<0,2>,<1,3>,<2,4>,<3,5>. . . .} So extensionally, functions can be understood as just a set of inputs and outputs. Frege’s theory of semantic value is extensional in this sense, which is why we have: Thesis 6: Functions are extensional: if function f and g give the extension, then f = g.5 When modern mathematicians are being clear they almost always treat functions extensionally, referring explicitly to rules or procedures when intensional functions are being compared. For Frege, what is amazing is that his basic insights into the semantics of his language can be presented entirely in terms of extensional functions. Again, the basic compositionality claim 4 (Thesis 2) can be understood as the claim that the truth or falsity of a sentence is entirely a function of the semantic value of the parts of that sentence, and (for the language presented above) one can present Frege’s theory entirely in terms of our modern extensional notion of function. Thus, we get: Thesis 7: The semantic value of a predicate is a first-level function from objects to truthvalues; the semantic value of a sentential connective is a first-level function from truthvalues to truth-values; the semantic value of a quantifier is a second-level function from concepts to truth-values. Function, function, function. Pretty cool. Of course this only works if one can actually specify the functions in question. But for the above language, we can. To do this we will consider part of the fragment of the above language we gave in the previous handout. That is, our language will be just like the one defined above, except that the only proper names are c, m, and d, and the only predicates are M, F, H, C, T, and O. For simplicity, we will also amend the above to be able to stipulate that M, F, H, and C are one-placed predicates, in that they can only bind to one proper name or variable at a time (e.g. Mc), and T and O are two place predicates (e.g. Tdc). Here we will be able to show how if one knows the semantic values of the names and predicates6 then one can determine whether sentences in which they occur are true or false. First we have: Thesis 4: The semantic value of a proper name is the object which it refers to or stands for.7 So let’s assume that the semantic value of c is Charlie, m is Mary, and d is Dan. We can write this functionally. sv(c) = Charlie sv(m) = Mary sv(d) = Dan Again, we are pretending here that Charlie, Mary, and Dan are actual objects in the world. The semantic values of the names c, m, and d are the things to which those names refer. Then, the semantic values of predicates are functions from objects to truth values. Again, since we understand these extensionally, we can treat one place predicates as ordered pairs. For simplicity’s sake let’s assume the concrete objects in the universe just consist in Charlie, Mary, Dan, and a short, canine hermaphrodite we can call Frank, but who has no name in the language (surely most things in the universe don’t have names in any language). sv(M) = {<Charlie, True>,<Dan, True>,<Frank, True>,<Mary, False>} sv(F) = {<Charlie, False>,<Dan, False>,<Frank, True>,<Mary, True>} sv(H) = {<Charlie, False>,<Dan, True>,<Frank, False>,<Mary, True>} sv(C) = {<Charlie, True>,<Dan, False>,<Frank, True>,<Mary, False>} 5 From this one can determine the semantic values of an atomic formula (sentence with no quantifiers or propositional variables) by applying the semantic value of the predicate (which is a function) to the semantic value of the names. That is sv(Md) is equal to sv(M)(sv(d)). Since the semantic value of d is Dan, and the semantic value of M takes Dan to true, we know that the semantic value of Md is True. We can write this as a derivation in the following manner. 1. sv(Md) is = sv(M)(sv(d)) 2. sv(M)(sv(d)) = sv(M)Dan 3. sv(M)Dan = True by Frege’s theory by the model by the model Likewise, we can derive the semantic value of Hc as follows. 1. sv(Hc) is = sv(H)(sv(c)) by Frege’s theory 2. sv(H)(sv(c)) = sv(H)Charlie by the model 3. sv(H)Charlie = False by the model ------------Excercise1 Write out the above kind of derivations Mc, Fd, Hf, and Cm. Note that each one will have three steps, of the form, 1. sv(Φα) is = sv(Φ)(sv(α)) 2. sv(Φ)(sv(α)) = sv(Φ)Γ 3. sv(Φ)Γ = Σ by Frege’s theory by the model by the model, Where Φ is a predicate of the formal language, α is a name of the formal language, Γ names one of the objects denoting the names, and Σ is True or False. ------------All of the mathematical functions discussed above have been one-place functions, that is functions that take in one argument to return a value. However, most interesting mathematical functions take more than one argument. For example the addition and multiplication functions take two arguments and return values. And just like extensional one-place functions can be understood in terms of ordered pairs, two-place functions can be understood in terms of ordered triplets (e.g. + = {<0,0,0>, <0,1,1>, <1,1,2>, <1,0,1>, <0,2,2>, <1,2,3>, <2,2,4>, <2,1,3>, <2,0,3>, <0,3,3>. . .}). In fact any n place extensional function (function that takes n number of inputs) can be represented as a set of n+1-tuples in this manner. From this we can see how n-place predicates are handled in the Fregean way. Let’s say that “T” intuitively means “is taller than” and that Dan is taller than Mary, who taller than Charlie, who is taller than Frank. Then, the semantic value for T is the following. sv(T) = {<Charlie, Charlie, False>,<Charlie, Dan, False>,<Charlie, Frank, True>,<Charlie, Mary, False>,<Dan, Charlie, True>,<Dan, Dan, False>,<Dan, Frank, True>,<Dan, Mary, True>,<Frank, Charlie, False>,<Frank, Dan, False>,<Frank, Frank, False>,<Frank, Mary, False>,<Mary, Charlie, True>,<Mary, Dan, False>,<Mary, Frank, True>,<Marie, Mary, False>} 6 Let’s say that “S” intuitively means “is older than” and that Mary is older than Dan, who is older than Frank, who is older than Charlie. Then, the semantic value for O is the following. sv(O) = {<Charlie, Charlie, False>,<Charlie, Dan, False>,<Charlie, Frank, False>,<Charlie, Mary, False>,<Dan, Charlie, True>,<Dan, Dan, False>,<Dan, Frank, True>,<Dan, Mary, False>,<Frank, Charlie, True>,<Frank, Dan, False>,<Frank, Frank, False>,<Frank, Mary, False>,<Mary, Charlie, True>,<Mary, Dan, True>,<Mary, Frank, True>,<Marie, Mary, False>} But then we can do derivations just like we did above. Consider. by Frege’s theory by the model by the model 1. sv(Ocd) is = sv(O)(sv(c),sv(d)) 2. sv(O)(sv(c),sv(d)) = sv(O)(Charlie,Dan) 3. sv(O)(Charlie,Dan) = False Thus, since sv(Φαβ) is determined by applying the semantic value of Φ to the ordered pair containing α and β’s semantic values, every such derivation will look like the following, 1. sv(Φαβ) is = sv(Φ)(sv(α),sv(β)) 2. sv(Φ)(sv(Φα),sv(Φβ)) = sv(Φ)(Σ,Ψ) 3. sv(Φ)(Σ,Ψ) = Γ by Frege’s theory by the model by the model, Where Φ is a two-place predicate of the language, α and β are names of the language, Σ is equal to the semantic value of α, Ψ is equal to the semantic value of β, and Γ is either truth or falsity. I realize that sv(Φαβ) is = sv(Φ)(sv(α),sv(β)) is a mouthful. Think about addition again. When we represent addition as a set of ordered three-tuples (+ = {<0,0,0>, <0,1,1>, <1,1,2>, <1,0,1>, <0,2,2>, <1,2,3>, <2,2,4>, <2,1,3>, <2,0,3>, <0,3,3>. . .}) what we are saying is that + is applied to any of the first two members uniquely yields the third member. So the fact that <0,1,1> is a member of the set consisting of addition is the fact that +(<0,1>) = 1. Likewise, the fact that <Charlie, Dan, False> is a member of sv(0) is the fact that sv(0)(Charlie, Dan) = False. That’s all we’re doing in the derivations above. ------------Exercise 2 Do the above type derivations and the model above to determine the truth or falsity of the English language sentences “Charlie is taller than Mary,” “Dan is taller than Frank,” “Frank is older than Charlie,” “Mary is older than Dan.” Your derivations must be of the same form as the above. ------------1 Miller, 11. [citation needed] Modern philosophy of language arguably begins with Descartes’ argument. 3 Incidentally, this was the reason Cartesians thought that animals did not have minds. Since language was the reason minds are different from mere brains, and animals did not possess language, it followed that animals did not possess minds. 2 7 [should note with reference to Frege’s Begrifschrift (sp?) earlier] It is important to note that the actual logical symbols Frege used are completely different from the symbols that have become standard in modern logic. But in the other important respects, the language is the same. 5 Miller, 16. Miller notes that this isn’t quite right as a sketch of Frege’s views, since understanding functions this way is post-Fregean. As I understand Frege, he explicitly only thought of functions intensionally. This being said, once we understand them extensionally we can understand the simplicity and systematicity of Frege’s semantic theory much more clearly. 6 Footnote to self- interesting reason “course of values” for reference of predicates undermines compositionality if you don’t have multiple domains- with extensional function reading you get the anti-extension too and hence the whole domain. Frege had a fixed domain so the difference didn’t matter. Some theorems here?) 7 Miller, 12. 4 8