* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Probing noise in gene expression and protein production
Magnesium transporter wikipedia , lookup
Signal transduction wikipedia , lookup
Protein (nutrient) wikipedia , lookup
Protein phosphorylation wikipedia , lookup
Nuclear magnetic resonance spectroscopy of proteins wikipedia , lookup
Protein moonlighting wikipedia , lookup
Proteolysis wikipedia , lookup
PHYSICAL REVIEW E 80, 031916 共2009兲 Probing noise in gene expression and protein production Sandro Azaele,1 Jayanth R. Banavar,2 and Amos Maritan3,4 1 Department of Civil and Environmental Engineering, E-Quad, Princeton University, Princeton, New Jersey 08544, USA Department of Physics, The Pennsylvania State University, 104 Davey Laboratory, University Park, Pennsylvania 16802, USA 3 Dipartimento di Fisica “Galileo Galilei,” Università di Padova, via Marzolo 8, I-35131 Padova, Italy 4 INFN, via Marzolo 8, I-35131 Padova, Italy 共Received 30 January 2009; revised manuscript received 27 July 2009; published 24 September 2009兲 2 We derive exact solutions of simplified models for the temporal evolution of the protein concentration within a cell population arbitrarily far from the stationary state. We show that monitoring the dynamics can assist in modeling and understanding the nature of the noise in gene expression. We analyze the dispersion of the process, i.e., the ratio of the variance to the mean at arbitrary time, and introduce a measure, the fractional protein distribution, which can be used to probe the phase of transcription of DNA into mRNA. DOI: 10.1103/PhysRevE.80.031916 PACS number共s兲: 82.39.Rt, 02.50.⫺r, 87.17.Aa II. MODEL I. INTRODUCTION Advances in experimental techniques, which enable the direct observation of gene expression in individual cells, have demonstrated the importance of stochasticity in gene expression, the translation into proteins of the information encoded within DNA 关1–5兴. Such variability can lead to deleterious effects in cell function and cause diseases 关6兴. On the positive side, stochasticity in gene expression confers on cells the ability to be responsive to unexpected stresses and may augment growth rates of bacterial cells compared to homogeneous populations 关7兴. Disentangling the various contributions to production fluctuations is complicated by the recent finding that different stochastic processes yield the same response in the variance in protein abundance at stationarity 关8兴. A population of isogenic cells growing under the same environmental conditions can exhibit protein abundances that vary greatly from cell to cell. The sources of variability have been identified at multiple levels 关9–13兴, with transcription and translation playing a major role under certain circumstances 关14–16兴. The low concentration of reactants potentially has two important consequences: the first is that fluctuations around the mean can be large; the second is that the nature of the stochastic noise should be taken into account in some detail because one may not simply invoke the central limit theorem 关17兴, which leads to the universal and ubiquitous Gaussian noise. Thus, two genes expressed at the same average abundance can produce protein populations with different Fano factors F共t兲 = var关x共t兲兴 / 具x共t兲典, where var关x共t兲兴 and 具x共t兲典 are the variance and the mean of the protein concentration x共t兲, respectively 关18兴. We show here that two distinct models, one taking into account the detailed nature of the noise and the other following from an application of the central limit theorem, yield exactly the same stationary solution for the distribution of proteins in isogenic cells under the same environmental conditions. The applicability of the central limit theorem arises from the fact that the stochasticity in gene expression results from the large number of available components which entangle a lot of different mechanisms within a cell. The exact dynamical solution of these two simplified models demonstrates the value of monitoring the dynamics for understanding the nature of the noise in a cell. 1539-3755/2009/80共3兲/031916共8兲 We make the simplified assumption that the kinetics of gene expression can be described approximately by four rate constants: k1 and k2 are the transcription and translation rates, respectively, and ␥1 and ␥2 are the degradation rates for mRNA and proteins, respectively. It has been found experimentally that proteins are produced in bursts 关5,18–20兴 with an exponential distribution of the protein concentration produced in a given event. We will assume that transcription pulses are Poisson events and that the probability distribution that in a single event I ⬎ 0 proteins are produced, w共I兲, is ␥ approximated by w共I兲 = k21 e−共␥1/k2兲I, where k2 / ␥1 is the translation efficiency, i.e., the mean number of proteins produced in a given burst. Here we consider a simple model for the production of proteins without memory and aging of molecules. Using the specific burst distribution given above allows one to obtain the shape of the protein distribution even far from stationarity. Under these assumptions, the stochastic equation that governs the single-variable dynamics of gene expression can be written as ẋ共t兲 = ␦ − ␥2x共t兲 + ⌳共t兲. 共1兲 This pseudoequation describes the real-time stochastic evolution of gene expression through a deterministic part and a stochastic term ⌳共t兲, which will be defined later on. Here x is a continuous variable that represents the protein concentration within a cell. For the sake of generality, we have added the constant ␦, which can be incorporated into the average noise. However, although ␦ ⬎ 0 can account for important effects in ecological systems 关25兴, we will show in the following that it does not play a significant role under the burstlike production of proteins. In order to understand the nature of the noise for the gene expression case, let us consider the random variable, Ik, that is a measure of the number of proteins in the kth transcription event, where k = 1 , 2 , . . . , n. A key quantity of interest is n共t兲 Ik ⬅ ⌳共t兲⌬t, where n共t兲, the number of events in the time 兺k=1 interval 共t , t + ⌬t兲, is a random variable independent of both x and the Iks. As in the experiment, let us postulate that: 共i兲 the Iks are independent and identically distributed with exponential distribution; and 共ii兲 the probability of n events occurring during the time interval ⌬t is given by the Poisson distribu- 031916-1 ©2009 The American Physical Society PHYSICAL REVIEW E 80, 031916 共2009兲 AZAELE, BANAVAR, AND MARITAN tion qn共⌬t兲 = 共k1⌬t兲nexp共−k1⌬t兲 / n!. The distribution of ⌳共t兲 that we use in Eq. 共1兲 can be explicitly calculated 共see Appendix A兲 and leads to the following expression for the cun ␦共ti − t1兲 for n mulants: 具具⌳共t1兲 ¯ ⌳共tn兲典典 = n ! k1共k2 / ␥1兲n兿i=2 ⱖ 2 and 具⌳共t兲典 = k1k2 / ␥1, independent of time. Because the cumulants are delta functions, the noise is still white 共events are uncorrelated if they occur at different times兲; however the noise is no longer Gaussian because cumulants with n greater than two are nonzero. The master equation that describes this burstlike process is 关17兴 p共x,t兲 = − 关共␦ − ␥2x兲p共x,t兲兴 + k1 t x 冕 w共x − y兲p共y,t兲dy 0 共2兲 where p共x , t兲 ⬅ p共x , t 兩 x0 , 0兲 is the conditional probability that the protein concentration has a value x at time t given that it ␥ has a value x0 at time 0; and w共x兲 = k21 e−共␥1/k2兲x. Equation 共2兲 can be easily understood as follows. The first term in the right-hand side is simply related to the deterministic motion given by the first two terms in Eq. 共1兲 and it is independent of the nature of the noise term. The second and third terms are related to the probability per unit time to jump from a population of size y to a population of size x, w共x − y兲, given that k1 is the transition rate for a transcription event. The stationary solution of this model was already known 关21,22兴 with ␦ = 0. For arbitrary ␦ ⬎ 0, the stationary solution is 冉 冊 ␥1 k2 k1/␥2 冉 冊 ⌰共x − ␦/␥2兲 ␦ x− ⌫共k1/␥2兲 ␥2 number of cells 300 200 100 0 0 100 200 300 400 500 600 number of proteins fluorescence units 700 x − k1 p共x,t兲, ps共x兲 = 400 k1/␥2−1 e−共␥1/k2兲共x−␦/␥2兲 , 共3兲 where ⌰共x兲 is the step function equal to 1 when x ⬎ 0 and zero otherwise. This distinctive feature is a sharp signature of the nature of the noise even in the stationary solution but is present only when ␦ ⫽ 0. However, as shown in the fit to the stationary solution in Fig. 1, the singularity, if it exists, is easily masked by other noise effects, leading to a rounding effect. III. ALTERNATIVE MODEL Although experiments on gene expression 关5,18兴 are consistent with a burstlike protein production, steady-state distributions of protein abundances are equally compatible with alternative explanations. In fact, because mRNA is unstable compared to protein lifetime 共␥1 Ⰷ ␥2兲, one can assume that transcripts give rise to a constant flux of proteins f and subsequently any protein degrades at a constant rate ␥2. Because of the great amount of available molecules, one can apply the central limit theorem and suppose that the amplitude of fluctuations is simply proportional to 冑x. Within this framework there is no burstlike production; nevertheless the stationary solutions that one obtains for a burstlike process, including that of the extended autoregulation model 关23,24兴, are also obtained in models with appropriately chosen random multiplicative Gaussian noise 共see Appendix C兲. Within this sce- FIG. 1. The stationary distribution of proteins in a prokaryotic cell population taken from Ref. 关18兴 fitted to Eq. 共3兲 with ␦ = 0 or ␦ / ␥2 = 60.3 共dashed兲. The best fit parameters are ␥1 / k2 = 0.038, k1 / ␥2 = 12.88 共2 ⯝ 6100兲, and ␥1 / k2 = 0.030, k1 / ␥2 = 8.33 共2 ⯝ 8700兲, respectively. From the experimental data it is hard to distinguish between the steady-state distributions predicted by Eq. 共3兲 with ␦ = 0 and ␦ ⬎ 0. nario the stochastic evolution of the protein concentration x共t兲 is governed by the equation ẋ共t兲 = f − ␥2x共t兲 + 冑Dx共t兲共t兲, 共4兲 where 共t兲 is a Gaussian white noise with autocorrelation 具共t兲共t⬘兲典 = 2␦共t − t⬘兲. Note that the same equation could be obtained on setting 具⌳共t兲典 = f and 具具⌳共t兲⌳共t⬘兲典典 ⬅ 具⌳共t兲⌳共t⬘兲典 − 具⌳共t兲典具⌳共t⬘兲典 = 2Dx共t兲␦共t − t⬘兲 in Eq. 共1兲 with all higher-order cumulants being identically zero. Note that the square root of the multiplicative noise in Eq. 共4兲 is not introduced ad hoc. It has its roots in the central limit theorem and can also be justified on the basis of other general considerations about the discrete nature of the process. In fact, on temporal scales much larger than the mRNA lifetime, the protein production can be suitably described by a birth and death process whose rates are proportional to the number of available molecules. Accordingly, the fluctuations of this discrete Markov process can be well described by the multiplicative noise term present in Eq. 共4兲 in the continuum limit, i.e., when a large number of molecules is present. In addition, because the noise goes to zero when x共t兲 = 0, it also prevents the random variable x共t兲 from becoming negative. We point out that in ecology Eq. 共4兲 is useful for studying the evolution of tropical forests 关25兴, where the detailed nature of the stochastic noise is not important because of the relatively large numbers of trees of a given species. In the field of finance, Eq. 共4兲 has been used to study the evolution of interest rates 共the Cox-Ingersoll-Ross model 关26兴兲, where analogous considerations on fluctuations can be made. On defining f ⬅ k1k2 / ␥1 and D ⬅ ␥2k2 / ␥1, Eq. 共4兲 yields the same stationary state as in Eq. 共2兲 with ␦ = 0, i.e., Equation 共3兲. The mean protein concentration at stationarity is k1k2 / ␥1␥2 and the Fano factor at stationarity is k2 / ␥1, relations that are consistent with previous findings 关18兴. In order to take into account the effects of feedback in a system undergoing autoregulation, one can introduce the physically transparent modification f → Dc共x兲, where c is a response function which can be modeled as having two distinct limit- 031916-2 PROBING NOISE IN GENE EXPRESSION AND PROTEIN … px,t0,0 t 5 min px,t0,0 0.035 0.03 0.025 0.02 0.015 0.01 0.005 PHYSICAL REVIEW E 80, 031916 共2009兲 t 20 min 0.01 0.008 0.006 0.004 0.002 100 200 300 400 500 600 700 px,t0,0 0.007 0.006 0.005 0.004 0.003 0.002 0.001 0.5 50 100 150 200 250 100 200 300 400 500 600 700 x 20 10 50 x t 120 min 100 200 300 400 500 600 700 px,t0,0 0.004 0.003 0.003 0.002 0.002 x t 240 min 0.001 0.001 100 200 300 400 500 600 700 x 100 200 300 400 500 600 700 x FIG. 2. Protein distribution dynamics for different types of noise and with the same initial condition, i.e., x0 = 0 proteins at t = 0. The dashed curve is for the multiplicative Gaussian noise, i.e., Eq. 共4兲 with f ⬅ k1k2 / ␥1 and D ⬅ ␥2k2 / ␥1 共see Appendix C兲; whereas the solid curve is for the non-Gaussian noise, i.e., for Eq. 共5兲. In both cases the parameters are ␦ = 0, ␥2 / D = ␥1 / k2 = 0.038, f / D = k1 / ␥2 −1 = 12.88 and we have set ␥−1 2 = 40 min, ␥1 = 2 min. ing values at zero and at infinity with the latter being smaller than the former. Even in this situation, we obtain the same stationary distribution with bistability as in Ref. 关22兴. Despite this much more realistic analysis, the final stationary protein distribution is experimentally indistinguishable from Eq. 共3兲 with ␦ = 0. Thus, a theoretical modeling of the stationary state of protein production provides little insight into the microscopic nature of the noise that leads to stationarity. IV. TEMPORAL EVOLUTION These results raise the question whether the agreement between the stationary solutions of the theoretical models and experiments are in fact a direct probe of the nature of the microscopic noise and whether the asymmetric stationary solutions derive from a careful consideration of the bursty nature of the noise. In order to circumvent the indistinguishability of steady states, one can look into empirical protein abundances far from stationarity, for which we provide analytical formulas. Thus we turn now to a study of the dynamics of Eq. 共2兲, which is a powerful probe of the noise effects. We have derived 共see Appendix B兲 the solution at arbitrary time 共Fig. 2兲, p共x,t兲 = e−k1t␦共x − t兲 + ⌰共x − t兲 冋 ⫻exp − ⫻ 1F 1 冉 ␥1 ␥ t e 2 共x − t兲 k2 tmin 30 t 60 min 0.005 0.004 0.003 0.002 0.001 0.004 1.0 40 px,t0,0 100 200 300 400 500 600 700 px,t0,0 1.5 50 x t 40 min Ft Ft 册 k 1␥ 1 ␥ t 共e 2 − 1兲e−k1t k 2␥ 2 冊 k1 ␥1 + 1,2; 共e␥2t − 1兲共x − t兲 , ␥2 k2 共5兲 100 150 200 250 tmin FIG. 3. Fano factor dynamics with the same initial condition, i.e., x0 = 0 proteins at t = 0. The solid curves are for the non-Gaussian noise case, i.e., F共t兲 = var关x共t兲兴 / 具x共t兲典 is obtained from Eq. 共5兲 with ␦ = 0; the dashed curves are for the multiplicative Gaussian noise, i.e., F共t兲 is calculated with the analytical solution of Eq. 共4兲. The parameters 共k1 = 0.01, k2 = 15 min−1兲 in the main figure correspond to an infrequent production of large bursts, whereas the ones 共k1 = 0.3, k2 = 0.5 min−1兲 used in the inset give rise to small bursts −1 produced frequently. In both cases we have set ␥−1 1 = 2 and ␥2 = 40 min. where 1F1共a , b ; x兲 is the confluent hypergeometric function 关27兴 and 共t兲 ⬅ x0e−␥2t + ␥␦2 共1 − e−␥2t兲 is the solution of the deterministic part of the equation, i.e., without the noise. Interestingly, one obtains a distribution of proteins with a cutoff along the interval 关0 , t兲 at any time whenever ␦ ⬎ 0. The stochasticity plays a major role when the mean number of bursts per cell cycle is very small, i.e., k1 / ␥2 Ⰶ 1. In this case, the solution in Eq. 共5兲 can be approximated by replacing 1F1共1 , 2 ; z兲 with 共ez − 1兲 / z 共see Ref. 关27兴 and Appendix B兲. On using Eq. 共5兲, one can calculate the evolution of the Fano factor, F共t兲, starting from an arbitrary initial amount of proteins. This time evolution is highly sensitive to the nature of the stochasticity. Figure 3 vividly shows the distinct dependence of the initial dynamics of F共t兲 on the nature of the noise. Note that, at stationarity, the distinct types of stochasticity are indistinguishable. Interestingly, within the temporal transient, the fluctuations deviate from Poisson behavior, whereas, at stationarity, both models predict a Poissonian Fano factor for genes with high transcription and low translation rates and limt→⬁ F共t兲 = k2 / ␥1 ⯝ 1. V. FRACTIONAL PROTEIN DISTRIBUTION Another measurable quantity that directly probes the protein distribution and its temporal evolution is the fractional protein distribution 共FPD兲, P共 , t兲, i.e., the probability that at time t the ratio x共t兲 / x共0兲 is equal to , where x共t兲 and x共0兲 are the protein concentrations at time t ⬎ 0 and t = 0, respectively. Unlike the stationary distribution in Eq. 共3兲 with ␦ = 0, the FPD can exhibit a distinctive signature of the nature of the burstlike noise even under stationary conditions: it has an interval in which it identically vanishes and this, in principle, can be experimentally observed. This quantity can be defined at arbitrary times 共see Appendix D兲: if the initial distribution is the steady state given by Eq. 共3兲 with ␦ = 0, then at t ⬎ 0 the FPD is 031916-3 PHYSICAL REVIEW E 80, 031916 共2009兲 AZAELE, BANAVAR, AND MARITAN P共,t兲 = e−k1t␦共 − e−␥2t兲 + ⫻ 2F 1 冉 冉 冊 k1 ␥2 e 共e␥2t − 1兲⌰共 − e−␥2t兲 关1 + e␥2t共 − e−␥2t兲兴k1/␥2+1 2 −k1t 冊 共e␥2t − 1兲共 − e−␥2t兲 k1 k1 , + 1, + 1,2; ␥2 ␥2 1 + e ␥2t共 − e −␥2t兲 共6兲 where 2F1共a , b , c ; x兲 is the standard hypergeometric function 关27兴. Thus, according to Eq. 共6兲, under the burst process hypothesis we predict that 共i兲 the FPD vanishes between 0 and e−␥2t even though the system is at stationarity, an effect which ought to be detectable for time scales less than or of the order of 1 / ␥2, 共ii兲 the FPD depends on k1 and ␥2 only, 共iii兲 at very large time separation there is only one free parameter, the ratio k1 / ␥2, and the FPDs predicted by the Gaussian and non-Gaussian noises become the same 共see Appendix D兲. The analogous time-dependent solutions for the Gaussian white noise can be compared with Eqs. 共5兲 and 共6兲 共see Appendix D兲. The closer the system is to its steady state, the more difficult it is to distinguish among the effects of gestation, senescence and burstlike production. Thus an experimental protocol capable of analyzing the cell population and its time evolution with different initial conditions would be helpful to disentangle the nature of stochastic noise. At early times, the evolution of the distribution is strongly affected by the specific mechanisms involved in the dynamics. At this stage, different distributions of interarrival times between events or burst sizes produce nonstationary distributions that are very different, and the distinctive effects of noise, deterministic driving forces, or coupling of degrees of freedom can be elucidated. Different conditions at initial times propagate into the early temporal evolution in strongly different ways according to the different effects of involved mechanisms, but inexorably lead to the same distribution for large time separation. Let x1 , x2 , . . . , xn be n-independent one-burst processes with only positive increments. Let us suppose that they are identically distributed, so they have the same jump transitions, say a probability density function w共x兲 with x ⱖ 0. Let us further assume that the burst processes occur in time according to a Poisson distribution qn共t兲 = 共k1t兲nexp共−k1t兲 / n!, where k1 is the transcription rate. Because any burst produces xi proteins according to the distribution w共x兲, we are interested in the distribution of the n xi, i.e., the total amount of prorandom variable 共t兲 = 兺i=1 teins produced from t = 0 through t ⬎ 0, where we are assuming that n bursts have been occurred in a time t. The characteristic function for the process 共t兲 is ⬁ 具e iz共t兲 典 = 兺 具eiz共t兲兩n events by t典prob关n events by t兴 n=0 ⬁ = 兺 关g共z兲兴nqn共t兲, where we have used the independence of any one-burst process and we have used the definition g共z兲 ⬅ 具eizx j典 = 冕 ⬁ eizxw共x兲dx, 共A2兲 0 for any j = 1 , . . . , n. Because we are using the Poisson distribution qn共t兲, one obtains 具eiz共t兲典 = ek1t共g共z兲−1兲 . 共A3兲 Thus the characteristic function of 共t兲 in Eq. 共A1兲 defines the following integral equation for the distribution p共x , t兲 = 具␦共x − 共t兲兲典: 冕 ⬁ 0 VI. CONCLUSION 共A1兲 n=0 ⬁ eizx p共x,t兲dx = 兺 关g共z兲兴nqn共t兲. 共A4兲 n=0 By direct substitution one can verify that a solution is In summary, we have shown that fits to experimental data of the stationary solution do not discriminate between different types of noise. However, the temporal evolution of the probability distribution of protein or fractional protein concentration under stationary conditions as well as the evolution of the Fano factor can be used as a powerful probe of the noise effects of protein production. We have shown that the full time dependence of the analytical solution of the model proposed in 关22兴 with burstlike protein production events presents a singularity. This singularity is absent in the corresponding model with Gaussian multiplicative noise. p共x,t兲 = q0共t兲␦共x兲 + q1共t兲w共x兲 + q2共t兲 冕 x w共x − y兲w共y兲dx 0 ⬁ + ¯ = 兺 qn共t兲w共x兲 〫 w共x兲 〫 ¯ 〫 w共x兲 , n=0 n-fold convolution 共A5兲 where ␦共x兲 is a Dirac delta and the symbol “〫” stands for a convolution of w共x兲’s. Now we use the particular jump distribution w共x兲 = e−x 共 ⬅ ␥1 / k2兲. Hence ACKNOWLEDGMENT g共z兲 = This work was supported by PRIN 2007. , − iz 共A6兲 and the calculations for the nth convolution 共n ⱖ 1兲 result in APPENDIX A: DEFINITION OF THE NON-GAUSSIAN WHITE NOISE In this section we wish to exploit the burst hypothesis of protein production in gene expression in order to derive its probability distribution. n xn−1 −x e . 共n − 1兲! Thus we obtain with qn共t兲 = 共k1t兲nexp共−k1t兲 / n! 031916-4 共A7兲 PROBING NOISE IN GENE EXPRESSION AND PROTEIN … PHYSICAL REVIEW E 80, 031916 共2009兲 冉 冊 冕 ⬁ p共x,t兲 = e−k1t␦共x兲 + 共k1tx兲n e−k1t−x 兺 x n=1 共n − 1兲 ! n! j共x,t兲 = ␦ − t = e−k1t␦共x兲 + e−k1t−x I0共2冑k1tx兲 t x = e−k1t␦共x兲 + e−k1t−x 冑k1txI1共2冑k1tx兲, x + 共A8兲 where I共z兲 is the modified Bessel function of the first kind 关27兴, whose definition is ⬁ I共x兲 = 兺 n=0 +2n 共x/2兲 . n ! ⌫共 + n + 1兲 By exploiting this definition, one can see that in Eq. 共A8兲 there is no divergence at x = 0. We can also calculate all the moments 具n共t兲典 = n! k1te−k1t 1F1共n + 1,2;k1t兲 + e−k1t␦n,0 , n 共A9兲 where 1F1共a , b ; x兲 is the confluent hypergeometric function 关27兴 and n = 0 , 1 , 2 , . . .. Now we can consider the fundamental stochastic differential equation which defines the model in the main text, 冉 冊 x共t兲 dt + d共t兲, dx共t兲 = ␦ − 共A10兲 where ⬅ ␥−1 2 and 共t兲 is the noise whose probability distribution is given by Eq. 共A8兲. Finally, in order to recover Eq. 共1兲 in the main text, one can formally write d共t兲 = ⌳共t兲dt. The master equation which governs the process defined by Eq. 共A10兲 is Eq. 共2兲 in the main text. 冋冉 冊 册 冕 ␦− 1 − p共x,t兲, 1 x p共x,t兲 + x x z 0 0 w共z − y兲p共y,t兲dzdy x 共B2兲 p共y,t兲dy, 0 j共0,t兲 = ␦ lim p共x,t兲, where limx→0+ xp共x , t兲 = 0 due to the integrability of p共x , t兲 at x = 0. By Laplace transforming Eq. 共B1兲 with respect to x, one obtains 关共s兲 − 1兴共s,t兲 共s,t兲 s 共s,t兲 − bs共s,t兲 + = j共0,t兲 − , t s 共B4兲 共s兲 ⬅ 兰⬁0 w共x兲e−sxdx where is supposed to be finite. We are interested in the time-dependent reflecting solution of Eq. 共B1兲, so j共0 , t兲 = 0 and the characteristic equations for Eq. 共B4兲 get dt = ds d = , s 共共s兲 − 1 − ␦s兲 se−t/ = s0 , 冋冕 共s兲 = 0 exp 册 共B6兲 共兲 − 1 d − ␦共s − s0兲 , 共B7兲 s0 s where s0 , 0 are arbitrary constants. If we choose an arbitrary well-behaved function 共x兲 such that 共s0兲 = 0, one can write down the general solution of Eq. 共B4兲, 冋冕 册 共兲 − 1 d se−t/ s − ␦共s − se−t/兲 . 共B8兲 Since the initial condition is p共x , 0兲 = ␦共x − x0兲, we have 共s , 0兲 = e−sx0, thereby one can determine the function 共x兲, finally achieving ln 共s,t兲 = − 共␦ + x0e−t/ − ␦e−t/兲s + 共B1兲 −1 where ⬅ k−1 1 , ⬅ ␥2 , ␦ ⱖ 0 , w共x兲 is a probability density function and p共x , t兲 ⬅ p共x , t 兩 x0 , 0兲 is the conditional probability that the protein concentration has a value x at time t given that it has a value x0 at time 0. Note that Eq. 共B1兲 exhibits two temporal scales, and , which are related to transcription and protein dilution, respectively. The probability flux determined by Eq. 共B1兲 is 共B5兲 which can be easily solved. The solutions may be written as follows: w共x − y兲p共y,t兲dy 0 共B3兲 x→0+ 共s,t兲 = 共se−t/兲exp The main goal of this section is to obtain the fundamental solution of the integrodifferential equation that describes the evolution of the protein concentration within a population of isogenic cells under the burst hypothesis 关Eq. 共2兲 in the main text兴, 冕冕 where we assume that limx→⬁ j共x , t兲 = 0 for any t ⬎ 0. The flux at x = 0 is simply APPENDIX B: THE SOLUTION OF THE MASTER EQUATION p共x,t兲 =− t x 1 1 x p共x,t兲 + − 冕 s se−t/ 共兲 − 1 d . 共B9兲 By using the translation theorem which holds for Laplace transforms 关28兴, we may study only the function 冋冕 ⍀共s,t兲 = exp s se−t/ 册 共兲 − 1 d . 共B10兲 We are interested in solving Eq. 共B1兲 with w共x兲 = e−x, where ⬅ ␥1 / k2 is the translation efficiency as defined in the 031916-5 PHYSICAL REVIEW E 80, 031916 共2009兲 AZAELE, BANAVAR, AND MARITAN main text. So 共s兲 = / 共 + s兲 and ⍀共s , t兲 becomes 冉 se−t/ + ⍀共s,t兲 = s+ 冊 / ⌽共x兲 ⬅ 共B11兲 . It turns out that one can analytically calculate the inverse Laplace transform of ⍀共s , t兲 共see below兲, and then one can finally achieve the fundamental solution of Eq. 共B1兲, which is p共x,t兩x0,0兲 = e −t/ 冉 冊 ⫻ 1F1 + 1,2;共et/ − 1兲共x − t兲 , −t/ 共B12兲 In Fig. 1 in the main text we show ps共x兲 for different values of ␦ and we compare it with experimental data obtained in a prokaryotic cell population: the agreement is excellent. When the mean number of bursts per cell cycle is very small, one obtains / Ⰶ 1 and we can expand Eq. 共B12兲 in powers of / . In this case, the solution in Eq. 共B12兲 reads 冉冊 共B16兲 z̄ 2i z⬘+i⬁ z⬘−i⬁ 1− 1 z −c e共xz̄/a兲zdz, 1 z −c ⬁ =兺 n=0 共c兲n 1 , n! zn 共B18兲 when 兩z兩 ⬎ 1 and 共c兲n ⬅ c共c + 1兲 ¯ 共c + n − 1兲 , 共c兲0 ⬅ 1. The series in Eq. 共B18兲 is absolutely and uniformly convergent if 兩z兩 ⬎ 1; thus it can be integrated term by term ⬁ 兺 n=0 共c兲n n! 冕 z⬘+i⬁ z⬘−i⬁ 冉冊 兺 冉 冊 冕 e共xz̄/a兲z xz̄ n dz = 2i␦ z a ⬁ + n=1 共c兲n xz̄ n! a n−1 ⬘+i⬁ ⬘−i⬁ e d , n 共B19兲 ␦共x兲 = 1 2i 冕 z⬘+i⬁ z⬘−i⬁ exzdz. In Eq. 共B19兲 the integral may be evaluated along a simple closed path that encircles the origin within which the integrand has a pole of order n at z = 0. Thus, by applying the residue theorem, one obtains 冖 2i e . n d = 共n − 1兲! Therefore Eq. 共B19兲 gets ⬁ 1 共c兲 兺 n 2i n=0 n! 冕 z⬘+i⬁ z⬘−i⬁ 冉冊 共B20兲 ⬁ e共xz̄/a兲z 共c兲n 共xz̄/a兲n−1 xz̄ . +兺 n dz = ␦ z a n=1 n! 共n − 1兲! 共B21兲 ⌰共x − t兲 共exp关− 共x − t兲 x − t − t/兴 − exp关− e 共x − t兲 − t/兴兲 + O„共/兲 …, 2 共B15兲 We may use the definition and the properties of the confluent hypergeometric function 关27兴 to rewrite this latter relation as follows: ⬁ 兺 n=1 where we have used 共see Ref. 关27兴兲 1F1共1,2;z兲 = c 冉 冊 共B13兲 / ⌰共x − ␦兲共x − ␦兲/−1e−共x−␦兲 . 共B14兲 ⌫共/兲 t/ 冊 冉冊 冕 冉 冊 e−x/a a a b ⌽共x兲 ⬅ where ⬘ ⬎ 1 and we used when x Ⰷ 1 and a , b ⫽ 0 , −1 , −2 , . . .. Thus the normalized solution at stationarity is p共x,t兩x0,0兲 = e−t/␦共x − t兲 + k−i⬁ as + c sx e ds, bs + where x ⬎ 0, k ⬎ 0, 0 ⬍ a ⬍ b, ⬎ 0, and c ⬎ 0. On using the variable z ⬅ 共as + 兲 / z̄, with z̄ ⬅ 共1 − a / b兲 ⬎ 0, one obtains 1− where t ⬅ x0e + ␦共1 − e 兲, 1F1共a , b ; x兲 is the confluent hypergeometric function 关27兴, ⌰共x兲 is the step function which is equal to one for x ⱖ 0 and zero otherwise, and ␦共x兲 is a Dirac delta. It is worth noting that Eq. 共B12兲 exhibits a cutoff along the interval 关0 , t兲 at any time whenever ␦ ⬎ 0, thus the probability is generally continuous but not differentiable 共at x = t兲. One may also achieve the steady-state solution of Eq. 共B1兲 by using the leading term of the asymptotic representation of the confluent hypergeometric function 关27兴, i.e., ps共x兲 = k+i⬁ where z⬘ ⬎ 1. We can also write −t/ ⌫共b兲 x −共b−a兲 , ex 1F1共a,b;x兲 ⯝ ⌫共a兲 冕 冉 共B17兲 ␦共x − t兲 + ⌰共x − t兲 e−t/共et/ − 1兲 ⫻exp关− et/共x − t兲兴 1 2i ez − 1 . z 共c兲n 共xz̄/a兲n−1 = F1共c,1;xz̄/a兲 n! 共n − 1兲! 共xz̄/a兲 1 = c 1F1共c + 1,2;xz̄/a兲, and finally reaching the equation Laplace transform ⌽共x兲 = In this section we calculate the inverse Laplace transform of Eq. 共B11兲. In order to get it we study the function 031916-6 冉 冊 冋冉 冊 共B22兲 册 xz̄ e−x/a a c z̄ ␦ + c 1F1共c + 1,2;xz̄/a兲 , a b a 共B23兲 PROBING NOISE IN GENE EXPRESSION AND PROTEIN … We can obtain the solution in Eq. 共B12兲 on setting a = e−t/, b = 1, c = / and by using the translation theorem which holds for the Laplace transforms. APPENDIX C: THE GAUSSIAN WHITE NOISE CASE Let us consider the fundamental equation of the main text for the number of proteins at time t, x共t兲, within an isogenic cell population, ẋ共t兲 = ␦ − x共t兲/ + ⌳共t兲, where ⬅ ␥−1 2 共C1兲 and PHYSICAL REVIEW E 80, 031916 共2009兲 APPENDIX D: THE FRACTIONAL PROTEIN DISTRIBUTION The fractional protein distribution 共FPD兲, PFPD共 , t兲, is defined as the probability that at time t the ratio x共t兲 / x共0兲 is equal to , where x共t兲 and x共0兲 are the protein concentrations within a population of isogenic cells at time t ⬎ 0 and t = 0, respectively. The system can be initially prepared according to any kind of probability density function, pinit共x0兲, nevertheless because we are interested in FPD at stationarity, we are using Eqs. 共5兲 and 共6兲 of the main text, i.e., the timedependent reflecting solution and the stationary probability distribution with ␦ = 0, respectively. Thus, by definition the FPD is 具⌳共t兲典 = f , PFPD共,t兲 = 具␦共 − x/x0兲典 = 共C2兲 Because we suppose that all the other cumulants are negligible, Eq. 共C2兲 defines a multiplicative Gaussian white noise and on using the Itô prescription 关17兴, one can prove that Eq. 共C1兲 with Eq. 共C2兲 is equivalent to the following Fokker-Planck equation: ṗ = x关共x/ − f − ␦兲p兴 + D2x 共xp兲. pstat共x兲 = 共D兲−f/D⌫共f/D兲−1x f/D−1e−x/D , 共C4兲 where ⌫共x兲 is the gamma function 关27兴. This solution is equivalent 共with ␦ = 0兲 to Eq. 共3兲 of the main text, which has been derived on using a non-Gaussian white noise that embodies the burst hypothesis. One can also obtain the time-dependent solution of Eq. 共C3兲 with reflecting boundary conditions and with initial population equal to x0. One can find all the details of the derivation in 关29兴, here we write down only the final expression p共x,t兩x0,0兲 = 冉 冊 1 D f/D 冤 x f/D−1e−x/D 冋冉 冊 1 D 2 x0xe−t/ 册 1/2−f/2D 1 − e−t/ 冥 冤 冥 1 2 冑x0xe−t/ 共x + x0兲e−t/ D D ⫻ exp − I f/D−1 . 1 − e−t/ 1 − e−t/ dxpstat共x0兲p共x,t兩x0,0兲␦共 − x/x0兲, 0 共D1兲 where ⬎ 0, t ⬎ 0, and ␦共x兲 is a Dirac delta. Notice that PFPD共 , 0兲 = ␦共 − 1兲 and lim PFPD共,t兲 = t→+⬁ 冕 ⬁ dx0x0 pstat共x0兲pstat共x0兲 0 = ⌫共2兲 −1 , ⌫2共兲 共 + 1兲2 共D2兲 where  = k1 / ␥2 and ⌫共兲 is the standard gamma function 关27兴. Note that Eq. 共D2兲 has a peak for ⬎ 0 only when  ⬎ 1, whereas when 0 ⬍  ⬍ 1 there is an integrable singularity at = 0. Furthermore, because the Gaussian and nonGaussian noises have the same steady state, they both have the same FPD when t → +⬁. When substituting Eqs. 共3兲 and 共5兲 of the main text in Eq. 共D1兲 and performing some simple manipulations, one gets PFPD共,t兲 = e−t/␦共 − e−t/兲 + ⌰共 − e−t/兲 ⫻共et/ − 1兲 ⫻ 1F 1 冉 冕 ⬁ /+1 −t/ e ⌫共/兲 dxx/ exp关− x共1 + et/共 − e−t/兲兲兴 0 冊 + 1,2;共et/ − 1兲共 − e−t/兲x , 共D3兲 −1 where ⬅ k−1 1 , ⬅ ␥2 , ⬅ ␥1 / k2. On exploiting the formula 7.621.4 in 关30兴 for the integral, one can come up with Eq. 共6兲 of the main text, which is independent on . When the mean number of bursts per cell cycle is very small, one obtains / Ⰶ 1 and we can expand Eq. 共6兲 of the main text in powers of / . In this case, the solution reads PFPD共,t兲 = e−t/␦共 − e−t/兲 + − 共C5兲 In Fig. 2 of the main text we have compared the behavior of this equation with that in Eq. 共5兲 of the main text. Unlike this latter, Eq. 共C5兲 has no any cutoff neither at finite times nor at stationarity. ⬁ dx0 0 共C3兲 In this case the constant ␦ only shifts the average number of proteins, so we can assume that ␦ = 0 without lack of generality. In Eq. 共C3兲 p ⬅ p共x , t 兩 x0 , 0兲 is the conditional probability that the protein concentration has a value x at time t given p共x , t 兩 x0 , 0兲dx is the that it has a value x0 at time 0, i.e., 兰n+⌬n n fraction of cells with protein population between n and n + ⌬n; furthermore, f ⬅ k1k2 / ␥1 and D ⬅ ␥2k2 / ␥1. Setting ṗ = 0, one obtains the stationary solution of Eq. 共C3兲, 冕 冕 ⬁ 具具⌳共t兲⌳共t⬘兲典典 ⬅ 具⌳共t兲⌳共t⬘兲典 − 具⌳共t兲典具⌳共t⬘兲典 = 2Dx共t兲␦共t − t⬘兲. 冉 ⫻ ln 冉冊 冊 2 ⌰共 − e−t/兲e−t/ − e−t/ 1 + − e−t/ + O关共/兲3兴, 1 + et/共 − e−t/兲 where we have used 共see Ref. 关27兴兲 031916-7 共D4兲 PHYSICAL REVIEW E 80, 031916 共2009兲 AZAELE, BANAVAR, AND MARITAN 2F1共1,1,2;z兲 = − Along the same lines one can obtain the FPD for the Gaussian white noise. Thus, substituting the time-dependent reflecting solution in Eq. 共C5兲 and the steady state in Eq. 共C4兲, one comes up with the final expression PFPD共,t兲 = 2 f/D−1 ⌫ 冑 冢 冉 冊 冉冊 冉 冊冣 f/D+1 冉 42 共 + 1兲2et/ − 4 2 1.5 1 0.5 0.5 关6兴 关7兴 关8兴 关9兴 关10兴 关11兴 关12兴 关13兴 关14兴 关15兴 关16兴 关17兴 1 2 2.5 Ρ 3 0.5 冊 , 共D5兲 M. Kaern et al., Nat. Genet. 6, 451 共2005兲. J. M. Raser and E. K. O’Shea, Science 309, 2010 共2005兲. J. Paulsson, Nature 共London兲 427, 415 共2004兲. I. Golding, J. Paulsson, S. M. Zawilski, and E. C. Cox, Cell 123, 1025 共2005兲. L. Cai, N. Friedman, and X. S. Xie, Nature 共London兲 440, 358 共2006兲. J. A. Magee, S. A. Abdulkadir, and J. Milbrandt, Cancer Cells 3, 273 共2003兲. M. Thattai and A. van Oudenaarden, Genetics 167, 523 共2004兲. J. M. Pedraza and J. Paulsson, Science 319, 339 共2008兲. M. B. Elowitz, A. J. Levine, E. D. Siggia, and P. S. Swain, Science 297, 1183 共2002兲. J. M. Raser and E. K. O’Shea, Science 304, 1811 共2004兲. A. Becskei, B. B. Kaufmann, and A. van Oudenaarden, Nat. Genet. 37, 937 共2005兲. N. Rosenfeld et al., Science 307, 1962 共2005兲. J. M. Pedraza and A. van Oudenaarden, Science 307, 1965 共2005兲. J. R. S. Newman et al., Nature 共London兲 441, 840 共2006兲. A. Bar-Even et al., Nat. Genet. 38, 636 共2006兲. H. H. McAdams and A. Arkin, Proc. Natl. Acad. Sci. U.S.A. 94, 814 共1997兲. N. G. van Kampen, Stochastic Processes in Physics and 1.5 1 2 2.5 3 Ρ 0.5 2 2.5 3 2 2.5 3 2 2.5 3 Ρ 1 1.5 Ρ t 200 min FPD 1.2 1 0.8 0.6 0.4 0.2 1.5 t 30 min FPD 1.5 1.25 1 0.75 0.5 0.25 t 60 min FPD where f ⬅ k1k2 / ␥1 , ⬅ ␥−1 2 and D ⬅ ␥2k2 / ␥1. Note that in this case PFPD共 , t兲 is much simpler than Eq. 共6兲 in the main text and depends only on f / D = k1 / ␥2 and . In Fig. 4 we have compared the FPD for the non-Gaussian white noise 关Eq. 共6兲 in the main text兴 and Eq. 共D5兲, which have the same steady state. 关5兴 1.5 t 20 min 1 0.8 0.6 0.4 0.2 0.5 关1兴 关2兴 关3兴 关4兴 1 FPD 0.5 f/D+1/2 t 10 min FPD 2.5 6 5 4 3 2 1 1.75 1.5 1.25 1 0.75 0.5 0.25 f 1 + D 2 共 + 1兲 共et/兲 f/2D f 1 − e−t/ ⌫ D t sinh 2 ⫻ t 1 min FPD ln共1 − z兲 . z 1 1.5 2 2.5 3 Ρ 0.5 1 1.5 Ρ FIG. 4. Fractional protein distribution 共FPD兲. The dashed curve is for the Gaussian noise case, i.e., Eq. 共D5兲 with f ⬅ k1k2 / ␥1 and D ⬅ ␥2k2 / ␥1; whereas the solid curve is for the non-Gaussian noise case, i.e., Eq. 共6兲 in the main text. The arrow indicates the cutoff point. Note, however, that extrinsic noise could tend to smooth out the discontinuity. In both cases k1 / ␥2 = 12.88 and we have set ␥−1 2 = 40 min. Chemistry 共Elsevier, New York, 2004兲. 关18兴 E. M. Ozbudak, M. Thattai, I. Kurtser, A. D. Grossman, and A. van Oudenaarden, Nat. Genet. 31, 69 共2002兲. 关19兴 M. Thattai and A. van Oudenaarden, Proc. Natl. Acad. Sci. U.S.A. 98, 8614 共2001兲. 关20兴 A. M. Kierzek, J. Zaim, and P. Zielenkiewicz, J. Biol. Chem. 276, 8165 共2001兲. 关21兴 J. Paulsson and M. Ehrenberg, Phys. Rev. Lett. 84, 5447 共2000兲. 关22兴 N. Friedman, L. Cai, and X. S. Xie, Phys. Rev. Lett. 97, 168302 共2006兲. 关23兴 A. Becskei and L. Serrano, Nature 共London兲 405, 590 共2000兲. 关24兴 F. J. Isaacs, J. Hasty, C. R. Cantor, and J. J. Collins, Proc. Natl. Acad. Sci. U.S.A. 100, 7714 共2003兲. 关25兴 S. Azaele, S. Pigolotti, J. R. Banavar, and A. Maritan, Nature 共London兲 444, 926 共2006兲. 关26兴 J. Cox, J. Ingersoll, and S. Ross, Econometrica 53, 385 共1985兲. 关27兴 N. N. Lebedev, Special Functions and their Applications 共Dover, New York, 1972兲. 关28兴 G. B. Arfken and H. J. Weber, Mathematical Methods for Physicists, 5th ed. 共Harcourt Academic Press, Burlington, MA, 2001兲. 关29兴 S. H. Lehnigk, J. Math. Phys. 19, 1267 共1978兲. 关30兴 I. S. Gradshteyn and I. M. Ryzhik, Table of Integrals, Series and Products 共Academic Press, New York, 2000兲. 031916-8