Download Model-based source estimation during foodborne disease outbreaks

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Hospital-acquired infection wikipedia , lookup

Transmission (medicine) wikipedia , lookup

Surround optical-fiber immunoassay wikipedia , lookup

Infection wikipedia , lookup

Eradication of infectious diseases wikipedia , lookup

African trypanosomiasis wikipedia , lookup

Schistosomiasis wikipedia , lookup

Marburg virus disease wikipedia , lookup

Sociality and disease transmission wikipedia , lookup

Germ theory of disease wikipedia , lookup

Globalization and disease wikipedia , lookup

Infection control wikipedia , lookup

Foodborne illness wikipedia , lookup

Transcript
Model-based source estimation during
foodborne disease outbreaks
Juliane Manitz1 , Thomas Kneib1
1
Department of Statistics and Econometrics, University of Göttingen, Göttingen,
Germany
E-mail for correspondence: [email protected]
Abstract: The 2011 E. coli outbreak in Germany exposed the lack of timely
and efficient source detection as an integral part of mitigation strategies during
foodborne disease outbreaks. Conventional public health source detection procedures use case-control studies and tracings along the food shipping chain. Such
methods are typically very time-consuming and suffer from problems associated
with data collected from patient interviews, such as bias. We introduce a new
network theoretic method to estimate the spatial source of food-borne disease
outbreaks and similar dynamical contagion phenomena. Our method requires
only infection reports regularly collected by public health institutes and knowledge of the underlying food shipment network topology. We fit a hierarchical
Bayesian spatio-temporal model to the infection counts, which uses the shortest
path tree of the contaminated food shipping network as neighbourhood definition. Using Bayesian model comparison criteria, our method assignes an epicenter
plausibility to each outbreak source candidate. We test our method in a spatial
dynamic simulation model for foodborne diseases and specifically validate our
approach for the German E. coli outbreak in 2011.
Keywords: Complex network; Spatio-temporal model; foodborne disease.
1
Statement of the problem
Recently observed frequent foodborne disease outbreaks acutely demonstrated the need for timely and efficient source detection in the case of a
foodborne disease outbreak to public health bodies, risk assessment authorities, food industry, and society. Only efficient epidemic source detection
and timely removal of the contaminated product can prevent further disease
spread and impact on the population and economy.
Only in 66% of observed outbreaks, public health investigations were able to
find an evidence for the infection source (O’Brien et al., 2006). One reason is
the uncertain association between aetiology and food vehicle. Furthermore,
the multi-disciplinary nature of a foodborne disease outbreak investigation
task passes another major difficulty. It usually requires information from
many sources including interview-based data from case-control and cohort
2
Model-based source detection
studies as well as tracings along the food shipping chain. We eliminate this
source of bias by basing our new source estimation method only on the
topology of the underlying food shipping network and infection reports.
Our methodological developments were motivated during the 2011 EHEC/HUS outbreak in Germany, which has been the largest worldwide reported
E. coli outbreak regarding the number of severe HUS cases. The vast majority of the infections occurred in northern Germany, while other cases
were travel related (see Figure 1). The source in the Lower Saxonian district Uelzen as well as outbreak trend and sprouts as transmission vehicle
were hard to predict, because the outbreak was caused by a rare serotype
O104:H4 (Frank et al., 2011).
EHEC incidence in Germany week 21, 2011
EHEC incidence in Germany week 22 2011
(27,30]
(27,30]
(24,27]
(24,27]
(21,24]
(21,24]
(18,21]
(18,21]
(15,18]
(15,18]
(12,15]
(12,15]
(9,12]
(9,12]
(6,9]
(6,9]
(3,6]
(3,6]
(0,3]
(0,3]
FIGURE 1. EHEC incidence in Germany in 21st and 22nd calendar week, 2011
2
Deterministic source detection
First, we introduce the deterministic groundwork method for source detection during foodborne disease outbreaks. On the basis of these ideas, we
will develop the model-based source detection method in the next section.
We define a food shipping network G = (K, L) as a collection of nodes k ∈ K
connected by links L, where K 6= ∅ and L is a set of unordered pairs of
elements of K. In our context, nodes represent German districts while links
refer to their trade connections. Thus, the network by definition includes
all districts suspected to be the source of the outbreak, i.e. k0 ∈ K0 ⊆ K.
For simplicity, we construct the network using the gravity law of trade
(Anderson, 1979). Then, link weights are proportional to the connected
node population and inversely proportional to their Euclidian distance.
J. Manitz and T. Kneib
3
For outbreak source tracing, we define the effective distance, which combines the deterministic distance, measured as shortest path length, and
the corresponding path probability. In this way, we are able to reorganize
the spatial pattern of infection counts to a circular tree representation for
all source candidates k0 ∈ K0 as root. We minimize the effective network
distance of all potential sources to the median centre of reported epidemic
mass. Looking at the circular tree representation with the true source as
root node, it is easy to spot a circle-like pattern of infected nodes (see
Figure 2).
FIGURE 2. Deterministic source detection with application to EHEC/HUS data.
(A) German map with shortest path tree. (B) Circular shortest path tree representation with root in Uelzen. Color-coding corresponds to the aggregated EHEC
incidence during the first three outbreak weeks (19th-22nd calendar week 2011)
In extensive simulation studies, we used a novel spatial dynamic model
for foodborne diseases and showed high detection probabilities in a variety of scenarios. These simulated scenarios were also used to assess the
uncertainty of the estimated outbreak source through prediction with a
generalized linear model (McCullagh and Nelder, 1989).
3
Model-based source detection
A statistical method for source detection should be able to deal with different types of uncertainty. The shipping network is defined under uncertainty,
because its structures are highly adaptive to varying demand. During the
outbreak, public health institute investigations gain knowledge from riskoriented sampling. Moreover, the infectious disease counts show trend and
seasonality and suffer from under-reporting and reporting delay.
We assess this problem by a hierarchical Bayesian spatio-temporal model,
which distinguishes the regular background trend of sporadic cases and the
4
Model-based source detection
epidemic part given a specific source candidate using the network structure as neighbourhood definition. We fit this model for all potential source
candidates and derive a posterior plausibility for being the source causing
the given outbreak pattern.
We assume the number of infected ykt in district k at time t to be negative
Binomial distributed, i.e.
ykt |µkt , ν
log(µkt ) = ηkt
∼ NB(µkt , ν),
=
log(Ek ) + βt + sk|k0 + xk ,
t = 1, . . . , T, k ∈ K,
where θ kt = (β1 , . . . , βT , s1|k0 , . . . , sK|k0 , x1 , . . . , xK , ν). Thereby, βt describes the epidemic time trend, sk|k0 the dispersal of the contaminated
infection vehicle alog the food shipping chain, and xk the local dispersal
of sproradic cases. Furthermore, the model is rescaled by offset Ek , the
population in district k.
The typical epidemic curve can be adapted by a time trend prior, which
follows a random walk of order one, i.e.
βt |βt−1 , τβ ∼ N(βt−1 , τβ−1 )
Furthermore, we model the dispersal of the contaminated food item on a
shortest path tree with source candidate k0 ∈ K0 by
!
X wkl
1
sl ,
,
sk|k0 |sl , k 6= l, w, k0 , τs ∼ N
wk+
wk+ τs
k∼l
P
where wk+ = k wkl . The weights wkl represent the link weights in the
shortest path tree with root k0 .
For the regular background trend of sporadic cases, we assume local spatial
dispersal. Thus, we define a standard Besag model prior for xk
!
1
1 X
xl ,
,
xk |xl , k 6= l, τx ∼ N
nk
nk τx
k∼l
where k ∼ l represent links in the local neighborhood structure.
On another level, we plan to incorporate uncertainty about the food shipping network structure by assigning priors to the shortest path tree weights
wkl
wkl ∼ Ga(ν/2, ν/2).
The estimation of the model becomes very complex, but can be solved
elegantly using a strategy suggested by Brezger et al. (2007).
We fit this Bayesian model with epidemic dispersal along differing shortest
path trees. They represent the efficient food shipping network of a contaminated infection vehicle with root in a potential source candidates k0 ∈ K0 .
Comparing these model fits for different sources, we can assign a plausibility to each tested potential source candidate for being the true source of
the observed outbreak.
J. Manitz and T. Kneib
4
5
Conclusion
Altogether, the introduced model-based source detection method for foodborne disease outbreaks uses minimal data basis and introduces a networktheoretic approach for detection. Alternative approaches are time-consuming
and usually based on potentially biased patient interview data. The method
is designed to work for various indirectly transmitted infectious diseases,
but has as well the potential to detect the origin of various other propagtion patterns, such as the spread of technical innovations in agriculture or
delays in a railway system.
Acknowledgments: This research has been financially supported by grants
from the German Science Foundation (DFG), Research Training Group
1644 ’Scaling problems in Statistics’.
References
Anderson, J. (1979). A Theoretical Foundation for the Gravity Equation.
American Economic Review, 69, 106 – 116.
Brezger, A., Fahrmeir, L. and Hennerfeind, A. (2007). Adaptive Gaussian
Markov random fields with applications in human brain mapping.
Journal of the Royal Statistical Society: Series C (Applied Statistics),
56, 327 – 345.
Frank, C., et al. (2011). Epidemic Profile of Shiga-Toxin Producing Escherichia coli O104:H4 Outbreak in Germany. New England Journal
of Medicine, 365(19), 1771 – 1780.
Manitz, J., Kneib, T., Schlather, M., and Brockmann, D. (2013). Networkbased Source Detection for Foodborne Disease Epidemics Applied to
the German 2011 E. coli outbreak. Working paper.
McCullagh, P. and Nelder, J.A. (1989). Generalized linear models. London:
Chapman & Hall.
O’Brien, S.J., Gillespie, I.A., Sivanesan, M.A., Elson, R., Hughes, C., and
Adak, G.K. (2006). Publication bias in foodborne outbreaks of infectious intestinal disease and its implications for evidence-based food
policy. England and Wales 19922003. Epidemiology and Infection,
134, 667 – 674.