Download Correlation and Causation

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Philosophical Devices
Week 9 – Correlation & Causation
--------------------------------------------------------------------------------------------------------PROBABILISTIC INDEPENDENCE & DEPENDENCE
 Independence: p is probabilistically independent of q iffdf assuming q doesn’t alter the
probability of p – i.e. iff Pr(p|q) = Pr(p)
Now, we know that, by the definition of conditional probability,
Pr(p|q) = Pr(p & q)/Pr(q)
Further, if p is probabilistically independent of q, we also know that
Pr(p|q)= Pr(p)
By the transitivity of identity (i.e. because the relation picked out by ‘=’ is transitive), we
therefore can conclude that
Pr(p) = Pr(p & q)/Pr(q)
Multiplying both sides by Pr(q) gives us
Pr(p)  Pr(q) = Pr(p & q)
This is a necessary condition for probabilistic independence: any p and any q that satisfy the
above are probabilistically independent of one another.
Further, because order is irrelevant for both conjunction and multiplication, this shows that
independence is symmetric – i.e. if p is probabilistically independent of q, then q is
probabilistically independent of p!
Examples of probabilistically independent propositions:
1.
2.
3.

Dependence: p is probabilistically dependent on q iffdf assuming q does alter the
probability of p – i.e. iff Pr(p|q) ≠ Pr(p)

Positive Dependence: p and q are positively probabilistically dependent iffdf Pr(p & q)
> Pr(p)  Pr(q)
 Only occurs if either assuming p raises the likelihood of q or assuming q raises
the likelihood of p – i.e. either Pr(q|p)>Pr(q) or Pr(p|q)>Pr(p)

Negative Dependence: p and q are negatively probabilistically dependent iffdf Pr(p &
q) < Pr(p)  Pr(q)
 Only occurs if either assuming q decreases the likelihood of p or assuming p
decreases the likelihood of q – i.e. either Pr(q|p)<Pr(q) or Pr(p|q)<Pr(p)
Positive dependent propositions:
1.
2.
Negative dependent propositions:
1.
2.
Philosophical Devices
Week 10
2
Correlation & Causation
3.
3.
--------------------------------------------------------------------------------------------------------CORRELATION & CAUSATION

Correlations are objective probabilistic dependences between distinct properties of
individuals
(Note: an interesting side issue here is what sort of thing properties are, and whether an
individual’s possessing a property is the sort of thing to which we can legitimately attach an
objective chance value)

Positive correlation: Two properties Φ and  are positively correlated iffdf there is a
positive probabilistic dependence between their possession – i.e., letting ‘F’ be the
event of some individual possessing Φ and ‘G’ be some individual possessing ,
Pr(F|G)>Pr(F)
The implication of a correlation is that one property occurs more frequently when the other is
present. The same holds the other way too! (Question: why?)

Negative correlation: Two properties Φ and  are negatively correlated iffdf there is a
negative probabilistic dependence between their possession – i.e., letting ‘F’ be the
event of some individual possessing Φ and ‘G’ be some individual possessing ,
Pr(F|G)<Pr(F)
A frequently used (and correct) truism: correlation is not causation
Now, amongst normal humans, there is a clear correlation between feeling nauseous the next
day and drinking to excess tonight. And this correlation can be explained by a causal
connection between the two: tonight’s heavy partying causes tomorrow’s nausea.
There is also (again, amongst normal humans) a correlation between nauseous and feeling
feverish, but this correlation isn’t one of causation – the nausea doesn’t cause the fever, nor
the fever the nausea. Instead, both are symptoms of some common cause (e.g. having eaten
spoiled food, having drunk to excess)
Examples of correlation without causation:
1.
2.
3.
So, correlation doesn’t entail causation. However, arguably, correlation does imply a weaker,
disjuctive claim:
(CC) If A is correlated to B, then either (i) A causes B, (ii) B causes A, or (iii) there is
some C which causes both A and B
The support for this isn’t iron-clad, but still quite intuitive: if (CC) weren’t true, then the
correlation would be a startlingly odd coincidence. (Note that the credence one should attach
to the truth of (CC) is inversely proportional to the chances of there being such a coincidence,
Philosophical Devices
Week 10
3
Correlation & Causation
such that the greater likelihood that the correlation is a coincidence, the less credence we
should put in (CC)).
Better, once we grant that something like (CC) is true, we can use correlations to tell us
something about causation – namely, if we can rule out the possibility of a common cause,
then we can infer a direct causal connection between the correlated properties, and if we rule
out the possibility of a direct causal connection, then we can infer a common cause!
--------------------------------------------------------------------------------------------------------TRACKING DOWN CAUSES?
Common causes tend to ‘screen off’ correlations between their joint effects, such that the
correlation disappears once we control for the common cause.
Let’s go back to our above example. There is a correlation between A (being nauseous) and B
(feeling feverish), such that Pr(A|B)>Pr(A).
Now, to see whether C (having eaten bad Chinese food) is a common cause for both, we can
look separately at those people who have eaten bad Chinese and those who haven’t. If, when
we look at these two groups, we discover that
(i) in cases where people ate bad Chinese, adding the fact that they feel feverish
doesn’t make nausea more likely – i.e. Pr(A|B & C) = Pr(A|C)
(ii) in cases where people didn’t eat bad Chinese, adding the fact that they feel
feverish doesn’t make nausea more likely – i.e. Pr(A|B & ¬C) = Pr(A|¬C)
then we know that eating bad Chinese ‘screens off’ the nausea-fever correlation. This is
pretty good evidence that it is a common cause for our initial correlation.
We say a correlation is ‘spurious’ when it doesn’t correspond to any direct causal connection,
and the common cause is called a ‘confounding’ property.
Suppose that our efforts at screening off fail. That doesn’t mean we’ve shown that there is a
direct causal connection between the correlates!

The Easy Way Forwards: random experiment. Pick a sample group, then arrange for a
random subset of this group to ‘have’ the putative cause (this is the test group), and
some to not (the control group).
- Randomization ensures that the cause is not systematically correlated with any
other potentially causally potent properties
- Any resulting correlation implies a genuine causal connection!
- Variation: random clinical trial. Take a group of people who suffer from some
disease, then randomly select a subset to receive a potential treatment (test group),
while the rest do not (control group)
- Lurking problem: statistical fluctuations.
o Solution: bigger sample sizes!

The Hard Way Forwards: survey everything. Check all confounding properties that
could possibly be responsible for a spurious correlation, show that none of these
screen off.
Philosophical Devices
Week 10
4
Correlation & Causation
- Requires lots & lots of work
- Stronger than randomization?
--------------------------------------------------------------------------------------------------------SIMPSON’S PARADOX
Suppose a University advertises positions in the Department of History and in the Department of Geography.
Five men apply for the positions in History and one is hired, and eight women apply and two are hired. The
success rate for men is twenty percent, and the success rate for women is twenty-five percent. The History
Department has favoured women over men. In the Geography Department eight men apply and six are hired,
and five women apply and four are hired. The success rate for men is seventy-five percent and for women it is
eighty percent. The Geography Department has favoured women over men. Yet across the University as a whole
13 men and 13 women applied for jobs, and 7 men and 6 women were hired. The success rate for male
applicants is greater than the success rate for female applicants. (Bickle et al., 1975)
Explanation: more women are applying for jobs that are
harder to get. Of the women applying for jobs, more are
applying for jobs in History than in Geography, but
History hired only 3 out of 13 applicants, whereas
Geography hired 10 out of 13 applicants. Hence the
success rate was much higher in Geography, where there were more male applicants.
Men
History
1/5 <
Geography 6/8 <
University 7/13 >
Women
2/8
4/5
6/13