Download age (years) cancer deaths/year

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Genome (book) wikipedia , lookup

Cancer epigenetics wikipedia , lookup

RNA-Seq wikipedia , lookup

BRCA mutation wikipedia , lookup

Polycomb Group Proteins and Cancer wikipedia , lookup

Mutagen wikipedia , lookup

Mir-92 microRNA precursor family wikipedia , lookup

NEDD9 wikipedia , lookup

Oncogenomics wikipedia , lookup

Transcript
Biological physics & disease: A simple model for cancer mortality
1
Introduction
So far, this course has been mostly on understanding how living organisms work. Living organisms are amazing
things, and as we one of them, it is natural to wonder how our bodies work. However, much of the research
in biology is biomedical, and is less aimed at understanding basic biology than understanding disease, and
how best to treat it. In this part of the course, we will look a simple model of one of the two big killers in the
Western world: Cancer. The other big killer is heart diseases and other circulation diseases, such as strokes.
Most biomedical research is at least partly motivated by diseases, and these diseases are why it is relatively
generously (compared to most of physics) funded. This funding means that many jobs for scientists are in
biomedical research into disease.
2
Mortality data for cancer
cancer deaths/year
Data for total cancer mortality (i.e., deaths) in the UK, is plotted in Fig. 1. The basic observation is that
overall, cancer is very rare in children and young adults, but unfortunately the mortality rate rapidly increases
with age. Also, the rate is somewhat higher in men than in women.
0.040
male
0.035
female
0.030
0.025
0.020
0.015
0.010
0.005
0.0000 10 20 30 40 50 60 70 80 90
age (years)
Figure 1:
Cancer mortality in the UK. The plot is from data from Cancer Research UK,
http://www.cancerresearchuk.org/health-professional/cancer-statistics/mortality/age,
Accessed October 2016. Data is from the Office for National Statistics. The two curves show male and female
mortality (death) rates per year, as a function of age.
1
Figure 2: Lung cancer mortality in: people who smoke and continue to smoke; people who smoke but then
stop at the ages indicated; and non-smokers — people who never smoke. Plot by J Peto, British Cancer
Journal 104 (2011), p 389. Data from Halpern et al. (1993). Note that the rate in non-smokers is very
low, much less than 1 in 1000 even at 70 years of age (i.e., hC (t = 70 years) << 10−3 ), while the rate of
those who continue to smoke reaches hc (t = 70 years) ≈ 7 × 10−3 . The intermediate lines are for those who
smoke until they reach the specified range of years, 40–49, etc. Note that almost all smokers start smoking
at approximately the same time, as teenagers.
Brief Introduction to Cancer Our bodies are made of many trillions of cells. Cell growth and division,
i.e., one cell growing and then dividing to produce two daughter cells, is essential. It is required to make
our bodies of trillions of cells from one cell, the fertilised egg. The fertilised egg cell divides into two
daughter cells, which then divide again to form four cells, etc. Cell growth and division is also required
in adults as the cells in many of our tissues constantly turn over, i.e., cells die and are replaced. For
example, in our colons (bowels) the environment is tough on cells, so the cells facing inwards only live for
around 5 days before dying and so need to be constantly replaced.
So cell division is essential to our bodies developing and functioning. Normally it is very tightly
controlled, by complex mechanisms involving many genes. Cells only divide to produce more cells when
needed, e.g., to replace dead cells in the colon. However, if the complex mechanisms that regulate growth
and division are damaged, a set of cells may undergo uncontrolled cell growth. If this growth causes the
cells to invade and damage tissues, this is called cancer. Cancer is currently responsible for about 30% of
deaths in the UK. It and circulatory diseases (heart attacks, strokes etc) are the two big killers.
2.1
Smoking and lung cancer: Why if you smoke, you have already harmed yourself and
so should quit today to avoid further damage
Data for the lung-cancer mortality of smokers, non-smokers and those who smoke for a period, then quit, are
all shown in Fig. 2. Smoking greatly increases the probability of getting lung cancer, note that the death rate
for non-smokers (bottom curve) is very low, much lower than that of smokers who never quit (top curve).
But note the fact that if you smoke for say 35 years and give up, then after you have given up, you are
still much more likely to get lung cancer than if you never smoked. Almost all smokers start in their teens so
the 50–54 curve is for those who smoked for around 35 years.
Smoking permanently and irreversibly damages cells and so increases the cancer risk permanently. But
2
this increased risk is smaller the sooner the smoker stops, so it is worthwhile stopping smoking as early as
possible.
3
Cancer typically develops as a result of several mutations, not just a
single mutation
There is a large amount of evidence pointing to the fact that many cancers are the result of not one but
several mutations in the genes that control cell growth and division, i.e., that cancer is a multistep not a
single step process. A key piece of evidence is the fact that most cancers are very rare in young people but
the rate increases rapidly in old age. I have plotted cancer mortality statistics in the UK in Fig. 1.
3.1
Multistep model of cancer mortality
To understand Figs. 1 and 2, we need a model. It will inevitably be over simplified, but should help us to
understand the data. First we need to define a mortality rate as a function of time, hC (t), which is what is
plotted in Figs. 1 and 2. The mortality rate hC (t) is defined as the rate at which people who have survived
to age t, then die from a particular cancer at age t.
3.1.1
Model definition
Our model is that death from cancer is due to a multistep process, with K random steps (step = gene
mutation), i.e., if at time t, K or more random steps have occurred so, then so has death by cancer. Each
step is assumed independent and to occur at the same constant rate r. So after a time t, the average number
of steps that have occurred is hni = rt.
As we have assumed that each step occurs at the same rate and is independent, the probability that
exactly n steps have occurred at a time t, is just given by the Poisson distribution, with the mean number of
steps that have occurred at time t being rt, i.e.,
p(n, hni) =
hni
(rt)n
exp(−hni) =
exp(−rt)
n!
n!
Then the probability of having died of cancer, PC , in this model, is simply the probability that K or more
steps have occurred
∞
∞
X
X
(rt)n
PC (t) =
p(n, hni) =
exp(−rt)
n!
n=K
n=K
This is just the same sum we studied in the context of night vision.
To keep the maths simple, we assume that cancer mortality for a specific cancer (we will consider colon
cancer below) is relatively small, i.e., PC (t) 1, because the mean number of mutations is small, i.e., rt 1.
If we write out the terms in the sum explicitly
PC (t) =
(rt)K
(rt)K+1
exp(−rt) +
exp(−rt) + · · ·
K!
(K + 1)!
Because rt 1, then exp(−rt) ' 1. Also (rt)K+1 (rt)K , so the first term is much larger than subsequent
terms. This means we can keep only the first term, and PC (t) simplifies to
PC (t) '
3
(rt)K
K!
As PC (t) is the probability of death by age t of a cancer, the mortality rate hC (t) is just its time derivative
hC (t) =
dPC (t)
1
'
rK tK−1
dt
(K − 1)!
rt << 1
i.e., a power law, with exponent K − 1 for a K-step process.
3.1.2
Comparison of model with cancer data
Now, if the process was one step, K = 1, and
hC (t) ' r
K=1
i.e., a constant. For a one-step model of cancer mortality, the death rate is approximately a constant. As
we can see in Figs. 1, 2 and 3, the rate is absolutely not a constant, it starts off very low then in old age,
increases rapidly. Typically, cancer is very rare in children and young adults, is more common in the middle
age, and becomes rapidly more likely in old age. So, the one step model is not consistent with what we know
of the most common cancers.
In Fig. 3 we show data for the mortality from colon cancer (cancer of the bowel) plus a fit of a power law
to that data. The best fit is obtained with a power-law exponent ν = 5.0, which corresponds to a K = 6 step
process. Note that the fit is quite good. Our simple model, with 6 steps, reproduces the data on colon-cancer
quite well. Both the data and the fit give very low mortality rates in young ages before rapidly increasing
with age.
3.1.3
Conclusion
Clearly our model is grossly simplified but it gives a function that fits the data for colon cancer reasonably
well. A multistep model is also consistent with ex-smokers having higher rates of lung cancer than those who
have never smoked. While they smoked, some mutations occured, so meaning that fewer additional mutations
are needed for lung cancer to develop. There is also independent genetic evidence that most cancers do have
multiple mutations not one mutation.
0.0020
fit ν =5.0
colon cancer data (males, NY)
hC (t)
0.0015
0.0010
0.0005
0.00000
20
40
60
age t (years)
80
100
Figure 3: Data for mortality from colon cancer of males (green circles) plus a power law fit (red curve). The
data is for New York state (years 2009 to 2013), and the power law fit has a best-fit exponent ν = 5.0. Within
our simple model this implies that K = 6, i.e., that 6 mutations need to occur for colon cancer to occur.
4