Download comparing institutional influence

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Linear regression wikipedia , lookup

Regression analysis wikipedia , lookup

Time series wikipedia , lookup

Interaction (statistics) wikipedia , lookup

Data assimilation wikipedia , lookup

Coefficient of determination wikipedia , lookup

Transcript
Mitchell/Deane Page 1 of 25
COMPARING INSTITUTIONAL INFLUENCE:
THE RELATIVE EFFECTIVENESS OF THREE ENVIRONMENTAL AGREEMENTS
Ronald B. Mitchell1
Department of Political Science
University of Oregon
[email protected]
http:/www.uoregon.edu/~rmitchel/
Glenn Deane
Department of Sociology
University at Albany, State University of New York
[email protected]
Draft of Saturday, March 8, 2008 not for citation.
Criticism and suggestions -- whether constructive or otherwise -- welcome.
1
This paper extends the argument made in an article in Global Environmental Politics 2:4 (November 2002). The
material in this article is based upon work supported by the National Science Foundation under Grant No. 0318374
entitled “Analysis of the effects of environmental treaties” September 2003–August 2006. Any opinions, findings,
and conclusions or recommendations expressed in this material are those of the author and do not necessarily reflect
the views of the National Science Foundation. The paper has benefited from comments from Judith Kelley and
Layna Moseley. Early drafts of the paper benefited from generous support from a Sabbatical Fellowship in the
Humanities and Social Sciences from the American Philosophical Society and a 2002 Summer Research Award
from the University of Oregon.
Mitchell/Deane Page 2 of 25
Abstract
International relations scholars have engaged the debate over whether and when international institutions influence
state behavior for over a quarter of a century. This paper argues for shifting the debate from the sterile evaluation of
whether individual institutions matter to the more policy-relevant question of "which institution worked better." We
develop and demonstrate an approach for selecting institutions and evaluating their influence in ways that allow
comparative assessment of whether, how much, and what type of influence those institutions had. In particular, we
demonstrate the value of including multiple institutional variables that correspond to different claims about
institutional influence. Our empirical analysis, using three protocols under the Convention on Long-Range
Transboundary Air Pollution, brings to bear new variables and new data for 53 countries and 23 years that
demonstrate that our approach can distinguish whether institutions influence state behavior, how much influence
they had, the pathways by which they had influence, and the types of country on which they had influence.
Mitchell/Deane Page 3 of 25
Introduction
Which treaty worked better? Although the international relations has engaged the question of institutional influence
for over a quarter century, efforts to compare the influence of different international institutions have been
remarkably rare. While scholars have debated the question of "do regimes matter" {Haas, 1989 #1258}, diplomats
have gotten on with the business of institutional design, adopting certain design elements while eschewing others
with little systematic comparative input from the scholarly community. This article identifies a next step for research
into the influence of international institutions, namely, comparing their influence. Such a comparison involves three
analytic tasks: a) convincingly demonstrating the causal influence, if any, of each institution being compared, b)
identifying the magnitude of each institution's influence in a comparable metric, and c) providing a means to
evaluate whether the conditions under which each institution operated were comparable. The first task requires
seriously and carefully addressing numerous theoretical and methodological challenges that obstruct efforts to
accurately attribute causal influence to an international institution, including spurious correlations and endogeneity.
The second task requires comparing institutions that face nominally comparable political problems and estimating
their influence in ways that use plausibly comparable metrics of their degree of influence. The third of these tasks
requires explicitly evaluating the difficulty of the problems being addressed.
The present analysis takes up these three tasks as follows. First, it seeks to convincingly demonstrate the causal
influence of multiple institutions through quantitative analyses that include a range of non-institutional variables,
explicitly address problems of endogeneity, and remove both the "inertial" and the "country-specific" components of
the available data before evaluating the influence of international institutions. Second, the analysis takes three
protocols regulating pollution emissions under a single environmental treaty and measures their influence in terms of
annual proportional changes in those emissions, thereby allowing direct comparison of the influence of institutional
variables. Third, the analysis uses the inertial trends in the different datasets as well as the influence of other
included variables as proxies by which to compare whether the problems the agreements faced were relatively
comparable.
Analysis of the influence of international environmental institutions has now reached a stage at which quantitative
methods can both complement and extend the valuable insights generated by qualitative research methods {Mitchell,
2002 #4725}. Quantitative analysis allows reexamination of questions of institutional influence already addressed
by qualitative methods as well as initial examination of questions that cannot be, or usually are not, answered by
such methods. Careful quantitative modeling offers a range of different, and arguably superior, ways of
distinguishing institutional influence from random variation, of identifying the magnitude of institutional influence
relative to other factors, and even of accounting systematically for the social processes of institutional influence.
When applied to multiple institutions, quantitative techniques provide opportunities to quantify the magnitude of a
given institution's influence relative to other institutions.
This paper argues that analysis of comparable institutions within a comparable statistical model -- in which the
dependent variable (DV) is measured in identical "annual percentage change" units and the same right-hand side
variables are included -- allows direct comparison of institutional influence. It also demonstrates the value of
including multiple indicators of institutional influence that reflect different arguments regarding how and who
institutions influence. In particular, we argue for examining the influence of institutions on members separately from
examining whether institutions may influence non-members as well as members and whether, among member states,
institutions influence laggards differently than leaders. Finally, it illustrates these arguments by evaluating three
protocols under the Convention on Long-Range Transboundary Air Pollution, demonstrating variance in both their
degree and type of influence.
Theoretical setup
Do international institutions have influence? That is, do they cause states to behave differently than they would
otherwise? Under what conditions do they wield influence? What states do they influence? By what institutional
mechanisms do institutions influence state behavior? An increasing number of scholars have engaged these and
related questions with increasing sophistication {Goldstein, 2007 #5351; Kelley, 2007 #5391; Morrow, 2007 #5390;
Simmons, 2005 #5173; von Stein, 2005 #5172; Neumayer, 2005 #5208; Hathaway, 2002 #5140; Simmons, 2000
#4434}. Although conceptually straightforward, the question poses several significant analytic challenges. This
article develops a general model for analyzing the effects of international institutions quantitatively in ways that
build on recent concerns and insights about mis-estimating their influence. It highlights several analytic tasks
necessary to engage different claims that have been made about institutional influence. In particular, it highlights the
often-overlooked need to avoid both Type I and Type II errors, i.e., simultaneously to avoid claims of institutional
influence where it does not exist and to avoid claims of the lack of institutional influence where it does exist.
Mitchell/Deane Page 4 of 25
The initial debate between realists and institutionalists (or neo-realists and neo-institutionalists) over whether
institutions ever influence state behavior has expanded, over time, to include constructivists and a broader range of
questions, including "under what conditions do institutions matter?", "what features cause institutions to matter?",
"how do we know if (and show that) institutions matter?", "by what causal pathways do institutions matter?", and
"did this institution matter?" The debate has been engaged at a theoretical or "strategic" level, with some claiming
that the very nature of international institutions precludes them from wielding any independent influence on state
behavior {Waltz, 1979 #3185; Mearsheimer, 1995 #2020; Strange, 1983 #2979} while others have developed a
strong theoretical foundation for believing that states will, under certain circumstances, create international
institutions and will adjust their behavior to conform with the dictates of those institutions in ways they would not
otherwise {Keohane, 1984 #1624; Haggard, 1987 #1276; Martin, 2001 #4530}. The debate has also been engaged at
an empirical or "tactical" level, with some scholars presenting -- and others contesting -- evidence and arguments of
international institutions, at least certain institutions under certain circumstances, wielding influence.
There is no debate over whether states form international institutions -- states clearly do. And there is little debate
about whether states comply with international institutions -- states also clearly do. Indeed, realists that argue that
international institutions do not influence states generally expect and accept that states comply with institutional
dictates much of the time. The debate is over causation, i.e., it is over whether state behavior conforms to
institutional dictates because of institutions or for other reasons. Indeed, the debate is simply a new version of the
older question "why do states behave as they do," asked within a context that allows for the possibility that
international institutions are part of the answer. One line of counter-institutional argument involves a claim of
"spurious correlation," i.e., that whenever institutions appear to have influenced behavior, it is actually the influence
of exogenous non-institutional factors. The other, more recently engaged, line of counter-institutional argument
involves a claim of "institutional endogeneity," i.e., that, because international institutions are voluntary "contracts"
among states, their membership lists include only those states that expect their behavior will conform with
institutional dictates, but for non-institutional reasons. If institutional endogeneity is complete or "perfect," then
institutions wield no influence in a counterfactual sense, and institutions serve only to "screen" states rather than to
constrain their behavior {Downs, 1996 #897; Simmons, 2000 #4434; Keohane, 2003 #5171; Simmons, 2005 #5173;
von Stein, 2005 #5172; Mitchell, forthcoming #5194; Werner, 2005 #5373}.
Definitions
We focus here on the influence of those international institutions that derive from treaties, conventions, protocols,
and other formal agreements "negotiated among international actors, that prescribe, proscribe, and/or authorize
behavior" {Koremenos, 2001 #4555, 762}. Although international institutions can take many forms, the formal and
public nature of international legal agreements facilitates analysis by removing analytic ambiguity about the
commitments that states have made to each other.
To clarify the central concern of the debate over institutional influence, we distinguish between behavioral
conformance and behavioral adjustment. Much of the debate to date has been framed in terms of behavioral
"conformance," i.e., the degree to which a state's behavior conforms to institutional rules. Although the term
compliance is often used, we prefer conformance for several reasons. First, "compliance" dichotomizes behaviors in
ways that ignores institutionally-induced behavior that falls short of compliance, does not distinguish coincidental
and institutionally-induced compliance, and conflates institutional influences that produce "bare minimum"
compliance with those that induce behaviors that far exceed legal minimums. Second, there are many international
institutions whose rules are sufficiently vague that "compliance" cannot be determined but that should not, therefore,
be definitionally precluded from being influential or effective {for an elaboration of this discussion, see \Mitchell,
2007 #5307}. Thus, "conformance" involves the comparison of observed behaviors to institutional standards,
recognizing that that comparison should be continuous rather than dichotomous and that the institutional standards
may not always make such a comparison straight-forward.
But, as already noted, most important aspects of the debate are not about behavioral conformance but about
behavioral adjustment. We use the phrase "behavioral adjustment" to capture the causal aspect that is central to the
debate, namely, whether states adjust their behaviors in response to international institutions {Chayes, 1991 #650;
Simmons, 1994 #4117}. States can adjust either by changing their behavior in ways they would not have done
otherwise or by refraining from changing their behavior when, absent institutional influences, they would have. Our
goal in distinguishing between conformance and adjustment is for the latter term to refer to those behaviors that are
responses to international institutions regardless of the extent to which such behaviors conform with institutional
dictates.
Finally, since it is a central component of the debate, it proves useful to reiterate Sprinz and Vaahtoranta'
classification of states in terms of their interest in, and support for, international environmental regulation. Sprinz
Mitchell/Deane Page 5 of 25
and Vaahtoranta argue that state positions on international environmental policy can be predicted based on the
interplay between a state's ecological vulnerability to an environmental problem and the abatement costs it would
have to incur in any collective effort to address that problem {Sprinz, 1994 #2912}. The two variables of their
framework distinguish pushers (high vulnerability and low costs) and draggers (low vulnerability and high costs) but
also intermediates (high vulnerability and high costs) and bystanders (low vulnerability and low costs). We adopt the
more alliterative terminology in which pushers are "leaders" and all other states are "laggards."
Possible mechanisms and locations of institutional influence
Accurately assessing institutional influence requires knowing both where and how to look for it. That, in turn,
requires delineating extant claims about institutional influence. We believe there are essentially four major claims
about the influence of international institutions: they do not have such influence, they influence members but not
non-members, they influence leader members more than laggard members, and they influence both members and
non-members.
Institutions have no influence
Realist theory argues that international institutions have no independent influence on behavior. In the terms just
delineated, although there may be institutional conformance, there will be no adjustment. States join or reject
institutional membership based on calculations of which choice better aligns with their short-term, institutionindependent interests. After institutional formation, the behaviors of member states will tend to conform with
institutional dictates but will diverge from those dictates if and when they became "inconvenient." The behaviors of
non-member states will tend not to conform with institutional dictates but will do so if and when exogenous forces
make doing so in their interests. In this view, leader states promote, join, and comply with treaties while laggard
states oppose, do not join, and ignore the dictates of treaties. These dynamics produce a correlation between
conformance and membership but one that is spurious. States that have exogenous preferences for certain behaviors
(i.e., "leaders") create and join institutions that promote those behaviors and then follow their dictates; states that do
not share those preferences (i.e., "laggards") reject membership in those institutions and do not follow their dictates.
But for neither set of states, however, does post-agreement behavior differ from what it would have been otherwise:
members change their behavior only in ways that they would have had there been no institution and non-members
do not change their behavior.
Institutions influence members but not non-members
The dominant view of institutional influence is that they influence members but not non-members. States create
institutions to improve on the suboptimal outcomes created by independent decision-making. Institutions can create
conditions that make states more willing to take the risks involved in contingent, interdependent behaviors that can
produce Pareto-improving outcomes. States join or reject membership based on calculations of whether institutional
features reduce the risks of defection by other members sufficiently to warrant incurring the costs of their own
adjustment, in essence, supplementing the instrumental calculations predicted by realists with consideration of
additional longer-term and more interdependent considerations. Member states will not merely conform but may
adjust, i.e., they may adopt behaviors they would otherwise eschew to the extent they see others as doing likewise.
This line of reasoning has been adopted by most research on international institutions, with institutional influence
often estimated by comparing the behavior of members to otherwise-similar non-members {Goldstein, 2007 #5351}.
Although scholars disagree over exactly which institutional mechanisms lead states to adjust, they share an
underlying assumption that the mechanisms either only operate on -- or, at least, only have influence on -- member
states. Careful research has highlighted that the membership/non-membership distinction may not accurately
categorize which states are subject to institutional mechanisms and which are not: many institutions allow members
states to opt out of certain obligations {Rosendorff, 2001 #3844} and others "create rights and obligations for nonmembers" {Goldstein, 2007 #5351, 38}. But these claims largely argue simply for more accurate definitions and
categorizations of whether a state was legally subject to institutional provisions or not.
Institutions influence laggard members more than leader members
Other scholarship, however, suggests that using the membership/non-membership distinction risks much deeper
mis-estimation of institutional influence. Theories of institutional formation suggest that membership is composed
of both leaders and laggards {Sprinz, 1994 #2912}. Laggards may become members due to pressure from more
powerful states {Zürn, 1997 #4090; Martin, 1992 #1963} or because they have mixed motives that make their
interests in taking institutionally-required action contingent on the behavior of others. In either case, however, there
is a theoretical expectation that institutional influence on laggard states should be different, and most likely greater,
than on leader states. This perspective suggests that three types of states join international institutions: a)
unilateralist "leaders" that want other states to adopt the behaviors that they already plan to engage in or have
already undertaken for non-institutional, exogenous reasons; b) "contingent" states who view it as in their interest to
Mitchell/Deane Page 6 of 25
adjust their behavior if other states do and who accept that adjustments by others are contingent on their own
adjustments; and c) "recalcitrant" or "free rider" states who join because they view themselves as benefiting from
behavioral adjustment by other states or because they receive explicit side-payments but who seek to avoid the costs
that their own adjustment would involve. Indeed, a dominant theme in the literature is that institutions are responses
to Prisoners' Dilemma or Tragedy of the Commons problems in which the countries that negotiate and join an
institution are all "contingent" states that would act independently -- and suboptimally -- were it not for the
institution. To the extent this is true, members will adjust but non-members will not and, so, the member/nonmember distinction captures institutional influence adequately. But a closer reading of theory suggests a different
interpretation, namely, that leader members should be distinguished from laggard (i.e., non-leader) members with an
expectation that we should expect institutional influence only among the latter. The non-contingent leaders will
conform with institutional dictates but due to the exogenous forces that make them leaders. But contingent and
recalcitrant states who become members and conform their behavior to institutional dictates provide evidence that
adjustment is occurring. It is these non-leaders that we would expect, absent the agreement, to have behaviors
different than the institution dictates.
Institutions influence members and non-members alike
Finally, there are reasons to believe that both that institutions may influence the behavior of both members and nonmembers. As constructivists argue, states that have already accepted a norm may create institutions to strengthen or
expand its influence among states that have not yet accepted it. Yet, if institutions influence behavior through
normative -- rather than instrumental -- processes, then their influence need not stop at the member/non-member
divide. To be sure, some norms are defined in ways that mean they only apply in contexts of reciprocity and such
reciprocity may be limited to members of an institution. Thus, to the extent that granting most-favored nation status
to other states is based in normative, rather than instrumental, calculations, it would be likely to operate almost
exclusively among institutional members who have shown their willingness to reciprocate. But many norms are
defined in more universal and non-contingent terms. Thus, human rights and prisoner of war agreements promulgate
relatively non-contingent norms, that states are expected to follow even in dealings with states that do not follow
them and, to a lesser extent, even if a significant fraction of states fail to follow it. To the extent an institution
strengthens a norm, states may adjust their behavior to conform to the norm, even if not the legal specificities, of an
institution and such adjustments may not be limited to member states. In short, the expectation of conformance may
"detach" from institutional membership, creating pressures that may lead even non-member states to adjust.
Institutions may also foster technological, economic, or even transnational professional changes that make
adjustment easier and more attractive or non-adjustment more difficult and more costly, regardless of a country's
membership status. Coordination problems that are truly global may need only a portion of relevant states to accept
a particular standard to "tip the balance" in favor of that standard vs. alternatives. Thus, [Air Traffic Control doesn't
need all parties but all parties will abide by rules.] The 1987 Montreal Protocol's efforts to protect the stratospheric
ozone layer induced considerable research into alternatives to chlorofluorocarbons (CFCs) which identified
substitute chemicals that were, in at least some cases, cheaper than CFCs with the effect that most states, regardless
of their membership status, began shifting away from CFC use relatively rapidly. Although such adjustment clearly
resulted from "DuPont's decision to phase out CFCs and develop substitutes [which] changed the market conditions
and forced other CFC manufacturers to follow suit," evidence strongly suggests that Dupont would not have made
those decisions in the absence the Montreal Protocol {Haas, 1992 #1263, 221}. In a process that contrasts with
Gruber's "go it alone" power theory {Gruber, 2000 #4482}, institutions may be able to entrain social processes such
that behaviors that previously were available and attractive become difficult or costly to engage in, even for states
that would, otherwise, have preferred not to adopt those behaviors.
One analytic goal of this paper is to model institutional influence in ways that allow us to identify institutional
influence by any of these mechanisms as well as to distinguish among them. As developed below, we include
institutional variables to reflect these different possible mechanisms and locations of institutional influence.
The influence of international environmental institutions
To assess the possibility of comparing the influence of international institutions within a quantitative framework, we
examine three environmental protocols under the Convention on Long-Range Transboundary Air Pollution
(LRTAP). States negotiated the LRTAP Convention in 1979 and have negotiated eight additional protocols. We
focus on two that address sulfur emissions and one that addresses nitrogen oxides, the 1985 Protocol On The
Reduction Of Sulfur Emissions Or Their Transboundary Fluxes By At Least 30 Per Cent, the 1988 Protocol
Concerning The Control Of Emissions of Nitrogen Oxides Or Their Transboundary Fluxes, and the 1994 Protocol
On Further Reduction Of Sulfur Emissions. We refer to these as the first Sulfur Protocol, the Nitrogen Protocol, and
the second Sulfur Protocol, respectively.
Mitchell/Deane Page 7 of 25
These agreements have several analytic virtues. First, they foster comparison across protocols by controlling various
factors that are challenging to control statistically but can be controlled for through case selection. Analyzing these
protocols controls for many issue and institutional characteristics since the three protocols address different aspects
of the same environmental problem within the same overarching institutional framework. These cases also control
for potential member states, regional political background conditions and temporal changes, and level and trends in
economic development. In short, they pass at least an initial hurdle of addressing similar problems in a similar
setting.
Second, environmental treaties present interesting cases for evaluating institutional influence because the "tit for tat"
reciprocity considered central in most theoretical claims about -- and empirical evaluations of -- institutional
influence is unlikely to be the mechanism of institutional influence {Axelrod, 1984 #306; Downs, 1996 #897;
Morrow, 2007 #5390}. Morrow notes that international agreements "do change what states do by setting common
standards that strengthen reciprocal enforcement" {Morrow, 2007 #5390, 570}, but in environmental agreements (as
in human rights agreements) states are unlikely to adopt a reciprocity involving contingent pollution or contingent
wildlife protection because of both normative and bureaucratic constraints.
Third, scholars have increasingly applied quantitative methods to assess the influence of non-environmental
international institutions, including those related to international political economy {Goldstein, 2007 #5351;
Simmons, 2000 #4434; von Stein, 2005 #5172} and to human rights agreements {Hathaway, 2002 #5140;
Neumayer, 2005 #5208}. Quantitative techniques have been used only infrequently, however, to assess individual
environmental institutions {Auer, 2001 #5376; Neumayer, 2002 #5145; Helm, 2000 #4775; Murdoch, 1997 #4444;
Murdoch, 1997 #4770; Murdoch, 1997 #4747; Murdoch, 2003 #5255; Schofer, 2005 #5377; Ringquist, 2005
#5243}. This article seeks to build on the small body of work that has used quantitative techniques to assess the
influence of individual international environmental agreements in such a way that the quantitative assessments of
different agreements can be meaningfully compared {e.g., see \Neumayer, 2002 #5348; Roberts, 2004 #5349}.
Fourth, the agreements chosen illustrate what appears to be a more general problem of analyses of international
institutions, namely, that qualitative analyses of various international institutions provide compelling process
tracing, counterfactuals, and narratives of institutional influence that quantitative analyses often fail to support. That
pattern is evident specifically with respect to these agreements. These agreements have received both quantitative
and qualitative attention, with most quantitative analyses finding the institutions non-influential and the qualitative
analyses finding them influential. We strive here to conduct a quantitative analysis that holds the possibility of
assessing whether these apparently contradictory findings are actually consistent. Indeed, by building a quantitative
model that reflects qualitative insights while carefully addressing endogeneity, we believe we can contribute to this
debate. Qualitative research has found the LRTAP regime (or one or more of these protocols) has led at least some
states to change their behavior in ways that have improved air quality {Levy, 1993 #1818; Wettestad, 2002 #4714;
Wettestad, 2002 #5238}. By contrast, quantitative studies have contended that LRTAP protocols have "made no
difference" {Ringquist, 2005 #5243, 99}, merely codifying emission reductions members would have made "even in
the absence of the treaty" {Murdoch, 1997 #4770, 157; see also \Murdoch, 1997 #4444; Murdoch, 2003 #5255;
Finus, 2003 #5388}. By contrast, other quantitative studies of the same agreements have concluded that the 1988
Sofia Protocol led to reductions that were 2% greater than they would have been otherwise {Bratberg, 2005 #5387;
for a contrasting conclusion, however, see \Helm, 2000 #4775}. The most visible disconnect between the analytic
approaches fits with the theoretical claims made above. The major finding of the qualitative literature is that many
members conform but only relatively few adjust, i.e., many members reduce but only some reduce because of
institutional influence. By contrast, the quantitative literature finds that the average institutional affect, across all
members and after accounting for other factors, does not distinguish members as a group from non-members as a
group. Although quantitative analyses have not explicitly accounted for the difference between leaders and nonleaders, the qualitative analyses, by necessity, have built their findings on careful analysis of relatively few countries
and without the advantages of more systematic evaluation of variables that is fostered in quantitative analysis.
Finally, the extensive involvement of the International Institute for Applied Systems Analysis (IIASA) in the
operations of the LRTAP regime have produced data on the dependent variable (DV) and several independent
variables (IVs) that has been created and/or carefully reviewed by experts, making it of far higher quality than
corresponding data for other agreements. Given that analyses of data of low quality can fail to identify real
correlations while also identifying spurious ones, analyses of LRTAP data are more likely than in many realms to be
credible.
The analytic tasks
To make a convincing comparison of the influence of two or more international institutions, i.e., to compare the
relative effectiveness of two or more regimes, requires attention to several analytic tasks. The first involves
Mitchell/Deane Page 8 of 25
specifying the model so that it accurately addresses the questions of theoretical interest. The second involves
specifying the model in ways that, as much as possible, move toward supporting valid causal claims by addressing
alternative explanations and endogeneity. The third involves estimating the magnitude of the influence of the
institutions being compared in units and with models that allow those estimates to be meaningfully and convincingly
compared. This includes demonstrating that the institutions operated in reasonably-similar circumstances that posed
reasonably-similar obstacles to progress in resolving the problem.
Specifying the model to address institutional influence
The foregoing discussion of possible mechanisms and locations of institutional influence highlights the need to
include three different institutional variables. The first is a membership variable. This is the standard approach in
which, if the average behavior of members differs sufficiently from that of otherwise-similar non-members, it is
taken as evidence of institutional influence. This variable is coded as 0 for non-members and 1 for members.
Although an important variable to include, this approach fails to model an important feature of institutional theory,
properly. A regression that compares all members to all non-members identifies institutional influence as the
average across both leader and non-leader member states. Yet, membership ranks often include, and may be
dominated by, leader states that theory predicts will not be influenced by the institutions (i.e., will conform but not
adjust). In such cases, institutional influences on non-leader states that join an institution may be missed because that
institutional influence is "watered down" by leader states that are members but are not influenced by the institution.
Therefore, we include a second institutional variable in the form of a non-leader-membership interaction term. This
variable, which we do not believe has been used in prior quantitative analyses of international institutions, evaluates
whether the behavior of non-leader members differs from the behavior of otherwise-similar leader members. This
variable is coded as 0 for non-members, 0 for leader members, and 1 for non-leader members for the period after
they become members. That interaction term also (when combined with the membership term) evaluates whether the
behavior of non-leader members differs from the behavior of otherwise-similar non-members. Finally, to address
the possibility of institutional influence on non-members, we include an institutional existence variable. This
variable is coded as 0 for all countries before an agreement enters into force and 1 thereafter. Inclusion of this
variable allows for identification of whether an institution influences states through processes that are not dependent
on the usually instrumental impacts connected to institutional membership. Although the simplified model here
sheds little light on what those processes might be, the influence of institutions through pathways involving norms
and ideas or through economic or technological changes can be captured through such a variable.
Addressing alternative explanations and endogeneity
We develop our analytic model with the goal of avoiding the Type I error of finding institutional influence where it
does not exist. Besides including a range of generic and issue-specific independent variables identified by prior
researchers and placing analytic uncertainty on the side of institutional non-influence rather than institutional
influence, we address the problem of endogeneity directly.
Our analytic approach was influenced significantly by the desire to address the problem of endogeneity, i.e., the fact
that countries are not randomly assigned to receive the "treatment" of protocol membership but "self select" into the
"treatment" group. States may decide whether to become members of an agreement based on their assessment of
how they are likely to behave rather than deciding how to behave based on whether they are members or not. In
short, the endogeneity claim is that countries become members of international institutions at the point in time that,
for exogenous reasons, they are ready to change their behavior to conform with institutional dictates. Therefore,
before/after and member/non-member comparisons may exhibit significant differences even if the institution has no
independent influence on behavior. An increasing number of scholars have recognized that, because states selfselect into institutional membership, behavioral differences between members and non-members may arise because
international institutions "screen" rather than constrain {Simmons, 2000 #4434; Keohane, 2003 #5171; von Stein,
2005 #5172; Simmons, 2005 #5173; Kelley, 2007 #5391; Mitchell, forthcoming #5194}. Indeed, the problem has
been directly addressed in relation to the influence of LRTAP-related protocols {Murdoch, 2003 #5255; Ringquist,
2005 #5243}. Intuitively, endogeneity involves the notion that states that becomes members of an international
institution are, even before they become members, different than states that do not become members. In this view, to
the extent that members and non-members behave differently after an agreement takes effect, these post-protocol
differences are due to pre-existing differences between members and non-members rather than to any influences of
the protocol itself.
Endogeneity can be addressed in various ways. The first, and preferred, method is to analyze two-stage models that
estimate the likelihood that a state will join an institution and use that likelihood as inputs to models of subsequent
behavior. Second, the analyst can and should include variables considered to drive both behavior and membership so
Mitchell/Deane Page 9 of 25
that the influence of institutional variables is evaluated after having accounted for other differences between
members and non-members. Third, the analyst can use matching techniques to compare members to non-members
that have been carefully selected to be "otherwise-similar" {Barabas, 2006 #5569}.
We address endogeneity in three ways. First, we include proxies for a broad range of independent variables, as
identified in prior analyses, to ensure that institutional variables are not biased by those factors being omitted
variables in the analysis. Second, we do not directly instrument our variables in a two-stage approach that includes
the modeling of negotiating positions on these protocols. But we do include issue-specific variables that correspond
to the "ecological vulnerability" and "abatement costs" that have been found to be important drivers of state
positions in international environmental negotiations {Sprinz, 1994 #2912}. Third, we include a variable to capture
whether states are leaders or non-leaders that is identified based on evaluations by other scholars of the negotiating
positions states took rather than on whether states became members or not. Including this variable allows us to
distinguish leader members from non-leader members in line with the theoretical implications delineated above.
Estimating the magnitude of institutional influence in comparable terms
If the foregoing steps are necessary to convincingly evaluating any international institutions influence, there are
additional constraints imposed by our desire to compare institutional effects. To rank agreements in terms of their
effectiveness, rather than to simply categorize them dichotomously as effective or not, requires models that meet at
least three additional criteria.
First, the models must be analytically similar. We adopt the view that, indeed, the models should include the exact
same explanatory variables and vary only in the dependent variable being regressed. Thus, as noted below, we
model both sulfur dioxide and nitrogen oxide emissions as functions of the same independent variables.
Second, if we want to answer the question with which this article began, we must create models that generate
coefficients for institutional variables which are meaningfully comparable across regressions. That, in turn, requires
that the dependent variable be measured in the same units. Annual percentage change (APC) scores are an important
means of allowing comparisons of institutional influence that do not depend, as various proposed alternative
methods do, on qualitative and often subjective assessments of institutional influence {Mitchell, 2002 #4725; Hovi,
2003 #5014; Hovi, 2003 #5017; Young, 2003 #5018; Underdal, 2004 #4774; Miles, 2002 #4230}.
Finally, meaningful and convincing comparison of the effects of different institutions requires selecting institutions
that appear to have faced relatively similar problems that were equally malign (or benign) in their
character{Underdal, 2002 #4221}, and then, in addition, demonstrating that the evidence confirms that the problems
were relatively equal in their susceptibility to resolution. In short, the institutions being compared must be shown to
have operated in circumstances that posed reasonably-similar obstacles to progress in resolving the problem.
Analytic and data improvements
Our approach builds on, but differs from, previous quantitative studies of LRTAP protocols in using significantly
enhanced models, data, and estimation techniques. First, we develop and analyze a panel (cross-sectional timeseries) dataset of relevant variables that is significantly larger in countries, years, and variables than previous
analyses. Our dataset includes annual data on emissions from 1980-2003 and corresponding annual data for a core
subset of independent variables for 53 countries and for all modeled independent variables for 36 countries.
Ringquist and Kostadinova's 2005 article uses data for only 19 countries that ends in 1994 and the most recent work
by Murdoch, Sandler, and colleagues (2003) uses data for only 25 countries that ends in 1990. Considerable
improvement has been made in agreement-specific variables, with annual national data included for ecological
vulnerability (measured as exceedances of critical loads), abatement costs (measured as costs of a 5% emission
reduction as a share of GDP), and self deposition and deposition import (measured as share of emissions deposited
domestically and share of deposition imported from other countries). Although data on all three variables have been
used previously, no previous studies have included annual data for these variables.
Second, past analyses either have used aggregate levels of emissions or percentage changes from a single base year
{see \Murdoch, 1997 #4444; Murdoch, 1997 #4770; Murdoch, 2003 #5255; Ringquist, 2005 #5243}. But the APC
scores that we adopt because of their value for comparing institutional effects also have the virtue of helping
transform panel data that, by its nature, contains considerable country-specific and time-specific trends, thereby
moving it closer to meeting the fundamental assumption of, and necessary condition that, observations in a statistical
model be independent and identically distributed (IID). Data on country emissions over time are, to be sure, not IID
in their "raw" form. To move country-year panel data significantly toward IID, one can remove trends over time and
trends due to country-specific factors. Modeling annual percentage changes "squeezes out" a significant amount of
the "embedded" correlation that arises in panel data due to both time-trends and country characteristics.
Mitchell/Deane Page 10 of 25
Third, and following previous work, we seek to identify and isolate institutional influence by modeling emissions as
a function of both institutional and non-institutional factors. We include proxies for all non-institutional (i.e., nonprotocol) independent variables used in previous analyses, but add proxies for economic performance, ecological
vulnerability, abatement costs, and strategic interaction that have either not been included in previous analyses or
have been included in only limited fashion. We initially model only non-institutional factors and then assess whether
institutional variables are both statistically significant (i.e., as measured by their t-statistics) and add to the
explanatory power of the model. To take advantage of the availability of emissions data for several non-European
countries not included in previous analyses, we run a baseline model that includes all 53 countries for the full time
period, using only those variables for which independent variable data is available for all countries.
Finally, a central aspect of the analysis here is comparing the influence of three different protocols, the first sulfur
protocol, the second sulfur protocol, and the nitrogen protocol. Conducting a quantitative analysis of multiple
agreements is intended to allow us to identify the relative influence of different agreements by generating a common
metric of effects within a model that explicitly accounts for other factors that may be driving the behaviors the
agreement tries to influence. This approach not only allows us to assess whether the influence of an institution is
larger or smaller than the influence of non-institutional factors in a given case as well as how that institutional
influence compares with the influence of other institutions. Looking across three institutions also allows us to
evaluate whether non-institutional factors play similar or different roles in different environmental contexts, e.g.,
evaluating whether economic forces make some problems more intractable than others {Underdal, 2002 #4221}.
This paper does not make full progress toward several more ambitious goals regarding institutional comparisons {as
delineated in \Mitchell, 2002 #4725} but represents an initial attempt to pursue them with some degree of
quantitative rigor.
Data and Variables
We model emissions as a function of broad-scale demographic, economic, and political forces; of more direct
factors including ecological vulnerability, abatement costs, emission reduction strategies; and of institutional forces.
Issues related to the treatment of missing data are discussed in Appendix II.
The dependent variable: emissions
Our dependent variables are annual national emissions of SOx and NOx (designated sox and nox). As noted, we
model emissions in terms of annual percentage changes, operationalized here by taking the natural log of emissions.
The dataset includes emissions data for 53 countries, combining commonly-used IIASA-RAINS emissions data for
23 years for 43 European states with at least 11 years of emissions data for 10 additional states (OECD emissions
data for two LRTAP members [Canada and the United States] and four non-members [Australia, Japan, Korea, and
New Zealand] and national data for four post-Soviet Asian republics [Kyrgyzstan, Turkmenistan, Tajikistan, and
Uzbekistan]). This more than triples the number of country-year observations for emissions used in recent analyses
{Ringquist, 2005 #5243}.
Generic independent-variables: metrics of demographic, economic, and political influences
Variation in emissions of most pollutants are likely to be influenced by both the population and level of economic
activity of a country. We include annual population data (designated pop) from the World Bank's World
Development Indicators dataset {World Bank, 2006 #5277}.
Growth in a country's economic activity also should produce growth in its pollution levels. We use Gross Domestic
Product (designated gdp) as our proxy for economic activity, using figures reported in 1990 US dollars and
purchasing power parity to allow meaningful comparison across countries and over time {Maddison, 2006 #5269;
Groningen Growth and Development Centre, 2006 #5268}.
Both the literatures on environmental Kuznets curves and on post-materialism engage, in different ways, the notion
that economic development has environmental benefits, i.e., that "richer" means "greener" {Grossman, 1995 #4562;
Harbaugh, 2000 #4563; Inglehart, 1990 #1493; Inglehart, 1995 #1492} To evaluate these arguments, we include per
capita income data, based on dividing GDP figures by population figures (designated gdp_capita).
We also evaluate the influence of trade openness (designated trade_gdp), including the standard metric of total trade
(exports plus imports) as a percentage of GDP, using annual UN Statistical Data for all three components as measure
in current US dollars {United Nations Department of Economic and Social Affairs, 2006 #5332}.2
2
To address the lack of UN Statistical Data for Soviet exports and imports, data on GDP, exports, and imports for
the Soviet Union was extracted from the annual CIA World Factbook {Central Intelligence Agency - United States,
1981-1992 #5363}.
Mitchell/Deane Page 11 of 25
To capture the influence of governmental structure, we follow standard practice and use annual Freedom House
sources of 0 for Not Free, 1 for Partially Free, and 2 for Free (designated free) {Freedom House, 2006 #4773}.3
Issue-specific independent variables
Leaders and laggards
For both NOx and SOx negotiations, we sought to distinguish leaders from non-leaders. For SOx, we combined
evidence of actual state negotiating positions from Sprinz and Vaahtoranta {, 1994 #2912, 100-101} with Levy's list
of countries that, as of 1989 (after the protocol was signed), had declared reduction targets greater than the 30%
required by the protocol {, 1993 #1818, Table 3.8, 118}.4 For NOx, we identified leaders as those countries that, in
addition to signing the NOx protocol (which required freezing emissions) signed an optional 30% reduction pledge
{Levy, 1993 #1818, Table 3.2, 97}. Since non-leaders are the countries most likely to show institutional influence,
we inverted the coding scheme and designate it as "laggard," coding leader countries as 0 and all other countries as
1. The codings are listed in Appendix I. Although a significant improvement over prior failures to address leaders
and laggards directly, these proxies for both the SOx and NOx cases are, nonetheless, likely to be rather "noisy", i.e.,
to lead to coding some leaders as laggards and vice versa.
Metrics of ecological vulnerability
As Sprinz and Vaahtoranta note, ecological vulnerability and abatement costs (essentially corresponding to the costs
and benefits of environmental protection) play an important role in determining a state’s positions in international
environmental negotiations and, presumably, their behavior in response to those agreements {Sprinz, 1994 #2912}.
Both Sprinz and Vaahtoranta as well as Murdoch and Sandler have undertaken analyses that include countryspecific cost estimates for SOx reductions as well as estimates of vulnerability to SOx emissions from other
countries {Murdoch, 1997 #4444}. The analysis here incorporates far more refined data that estimates the costs in
each year for each country for different pollutants.
We incorporate two different proxies to capture ecological vulnerability. The first proxy is forest cover as a
percentage of national land area (designated forest) from the World Bank's World Development Indicators dataset
for 1990, 2000, and 2003, with annual data being interpolated or extrapolated from available data points {World
Bank, 2006 #5277}. We incorporate forest cover in our model largely to provide comparability with previous
studies {Murdoch, 1997 #4444; Murdoch, 1997 #4770; Murdoch, 2003 #5255; Ringquist, 2005 #5243}. However, it
is a poor proxy, both because it has only weak construct validity (i.e., it does poorly at measuring that "variation in
the extent of the problem" in which we are interested) and because empirical data is so sparse.
To address these concerns, we developed a new dataset corresponding to ecological vulnerability as understood by
those negotiating these agreements. The RAINS research team at IIASA assesses environmental damage by
estimating the extent to which actual depositions exceed a "no damage" state for all ecosystems within a country
{Amann, 1998 #5364, 5} Since different ecosystems are differentially susceptible to acidification, these
"exceedances" must be "accumulated" across different ecosystems. Since ecosystems vary in size across countries,
these estimates are normalized by land area {Amann, 1998 #5364, 5}. The RAINS model estimates these average
accumulative exceedances (AAEs) in "averaged acid equivalents per hectare and year" or "aeq/ha/year") for all
countries in the LRTAP region, based on a given set of national emissions. For each year, national AAE figures
(designated aae_avg) were generated by entering actual emission levels for all countries in that year into the RAINS
model {International Institute for Applied Systems Analysis, 2006 #5334}.5
Metrics of abatement costs
Since emission levels are also, presumably, influenced by the estimated costs of reducing from those levels, for each
country-year observation we estimate the cost of a 5% reduction from then-current levels as a percentage of GDP.
The optimal dataset would consist of those costs that policy-makers estimated, at the time, that they would need to
3
That dataset does not provide scores for 1982, but the scores for all countries in the dataset are identical for 1981
and 1983 and so those scores are filled in with the 1983 scores.
4
An alternative rule for coding "leaders" would be to identify those countries that supported the "30% club" but, as
of the end of 1984, that group included all but 1 of the countries that -- six months later -- signed the protocol. The
list also a) codes Germany as a leader as a single country, based on West Germany being a leader and bringing East
Germany along after reunification; b) codes Russia, Belarus, and Ukraine as leaders since it was those three Soviet
republics that actually signed the protocol; and c) codes the Czech Republic and Slovakia as leaders since
Czechoslovakia was a leader at the time.
5
Chris Heyes of the International Institute for Applied Systems Analysis provided invaluable assistance in
developing and interpreting these estimates.
Mitchell/Deane Page 12 of 25
incur to make a small reduction from then-current emission levels. Since such estimates of emission reduction costs
were neither generated nor recorded for most countries nor in most years, we follow previous authors in relying on
estimates of costs generated by IIASA's RAINS team. At regular intervals, the RAINS team has estimated
abatement costs for several retrospective and prospective points in time {International Institute for Applied Systems
Analysis, 2007 #5372}. Thus, in 2000, they estimated cost curves for 1990, 1995, 2000, and 2010. Efforts to
combine data from different estimates demonstrated that variation in the data generated by such an approach
reflected changes in estimation techniques more than changes in underlying costs. Therefore, we have used the 1990
cost curves generated by IIASA in 2000 since, of cost curves available to the authors, they represented the estimates
closest in time (and were estimated closest in time) to the midpoint of the data series.
Using these 1990 cost curves, we estimated the costs for each country of making a 5% reduction from then-current
levels. For example, Belgium produced 400 units of SOx in 1985; a 5% reduction equates to a 20 unit reduction
leaving 380 units remaining; IIASA's 1990 cost curves estimate the marginal costs for Belgium to reduce to 380
units as 366.2 Euros per ton of SO2 removed. Costs for some countries in some years could not be estimated
because actual emission levels were above or below those included in the cost curves. Data for missing years was
filled in with data from the next subsequent year for which data was available (or, for missing years at the end of the
time period, from the last prior year for which data was available). Although RAINS estimates of marginal costs
increase monotonically, marginal abatement cost estimates derived by the foregoing method do not, with years with
higher emission levels than the previous year likely to have lower marginal costs. Yet, the logic of marginal costs
suggests that countries would choose the lowest marginal cost option to reduce emissions and that subsequent efforts
to reduce emissions would be more expensive. Therefore, annual estimates were revised to increase monotonically
by replacing cost estimates in a given year with the previous year's cost estimate wherever that prior year cost
estimate was higher. To estimate abatement costs (designated sox_expense_percent and nox_expense_percent) as a
share of GDP, we multiplied these estimates of marginal costs for a 5% reduction by the quantity of emissions
represented by a 5% reduction and divided the product by GDP.
Metrics of preferred emission reduction strategies
Although ecological vulnerability and abatement costs indicate whether and how strongly a country will want an
environmental problem addressed, other scholars have suggested the need to include other variables to identify what
policy they will prefer for addressing the problem {Murdoch, 1997 #4770; Murdoch, 2003 #5255}. For a country
that is vulnerable to an environmental problem, its preference ranking of unilateral or multilateral action will depend
on how transnational its own environmental problem is. In the acid precipitation case, a country's preference for
international cooperation depends on how much of its problem involves "self deposition," i.e., deposition of
emissions from its own emitters, and how much involves "deposition import," i.e., deposition of emissions from
emitters in other countries. We measure self deposition (designated sox_self_depo and nox_self_depo) as that
fraction of a country's emissions (deposited in the European region) 6 deposited on its own territory. This proxies a
country's incentive to restrict domestic production of emissions as a strategy to lessen depositions on its territory.
Thus, a country with a 100% self deposition rate receives all the benefits of any domestic emission reductions it can
induce in the form of reduced deposition in its own territory; a country with a 0% self-deposition rate receives no
direct benefits from any emission reductions it makes.
We measure deposition import (designated sox_depo_import and nox_depo_import) as that fraction of total
deposition on a country's territory that originates in other countries. This proxies a country's incentives to engage in
international cooperation as a strategy to reduce depositions on its territory. Thus, a country with a 100%
"deposition import" level receives all depositions from other countries and has strong incentives to find strategies
that induce other countries to reduce their emissions; a country with a 0% "deposition import" level receives no
depositions from other countries and has no incentives to engage in international efforts to get other countries to
reduce their emissions.7
6
EMEP and other sources {Murdoch, 1997 #4444} use total emissions to calculate the corresponding figures. We
use only emissions deposited in the EMEP-modeled region (some being deposited outside that region), because
those data can be assumed as strictly comparable to the national and aggregate deposition data in the model, since
total emissions are from separately reported sources.
7
To clarify that self deposition and deposition import are not complements, consider a country that emits 100 tons
of SOx of which 80 tons lands on its own territory (20 traveling to neighboring countries) and which receives a total
of 240 tons of SOx on its territory. Then, its self deposition rate would be 80% (80/100) while its deposition import
rate would be 66.7% (160/240).
Mitchell/Deane Page 13 of 25
In what constitutes a comprehensive new dataset, annual data have been derived from a series of source-receptor
matrices (also called "blame matrices," since they identify which countries are to blame for any given countries'
deposition levels) published periodically by the Norwegian Meteorological Institute (NMI) (data for 1985 and 19871992 {Sandnes, 1993 #5251}; 1996 {EMEP, 2001 #5352}, 1997 {EMEP, 1999 #5353}, 1998 {EMEP, 2000
#5354}, 2000 {EMEP, 2003 #5355}, and 2003 {EMEP, 2005 #5356}). For this variable, as with forests, available
data was used to interpolate or extrapolate missing data.
Institutional variables
As noted above, we incorporate three different variables to estimate the influence, if any, of the protocols being
analyzed.
Membership: The first, and most usual, metric involves institutional membership using the date an agreement enters
into force for individual countries (designated sox1_ceif, sox2_ceif, nox_ceif, where ceif stands for country entry
into force). Such an approach compares how states behave as members of an institution with how those same states
or comparable states behave when they are not members of the institution. Membership data are coded as 1 in those
years in which a country is in force for a country and 0 in all other years.8
Institutional existence: We also include a second metric for institutional existence as measured by an agreement's
general entry into force date (designated sox1_teif, sox2_teif, nox_teif, where peif stands for protocol entry into
force). Institutional existence variables compare the behavior of all countries (whether members or not) before and
after an agreement takes effect. This, less typical, approach assumes that institutions have the potential to influence
all states, regardless of membership status. By including both membership and institutional existence variables, we
can distinguish three possibilities: a) institutions influence all states equally regardless of their formal membership,
b) institutions influence members but have no influence on non-members, and c) institutions influence both
members and non-members but to different degrees. Data on institutional membership and existence comes from the
LRTAP Secretariat {United Nations Economic Commission for Europe, 2007 #5077}.
Laggard-membership interaction: As noted previously, theoretical considerations lead us to be interested in
assessing whether institutional influence is most evident among non-leader member states. To evaluate this
hypothesis, we include a laggard-membership interaction term that is simply the product of our laggard and
membership variables.
Modeling methodology
Our model's structural form also requires attention. First, we use APC for our DV and include a one-period lag of
the DV as a right-hand side (RHS) variable. The use of APC (implemented here by taking the natural log of
emissions) largely "de-nationalizes" the DV of country-specific influences while inclusion of a lagged DV "detrends" the DV of time-specific influences. Estimating APC rather than levels or first differences of emissions
already removes much of the time-related and country-related trends.9 Including a lagged DV is a way of modeling
the intuition that the best estimate of the APC in emissions in one period is the APC of emissions in the previous
period. In essence, including the lagged DV thereby removes "inertia" that is due to the range of included and
omitted variables that influenced the APC of emissions in the prior period. Of particular note, these strategies reduce
the variance that RHS variables can even potentially explain (since such variance is captured by the lagged DV).
While making finding correlations with RHS variables more difficult, the strategy has the advantage of reducing the
risk of falsely attributing variation in the DV to RHS variables.
Second, a major influence on our choice of how to model emissions is the fact that our institutional variables of
interest as well as several other of our RHS variables (e.g., average accumulated exceedances, self deposition, and
deposition imports) are either known or suspected of not being independent or exogenous of emission levels. To
address these challenges, we adopt the Arellano-Bond dynamic panel data estimator rather than the more traditional
method of using panel-corrected standard errors (PCSE) {Arellano, 1991 #5380; Beck, 1995 #4731}. RHS variables
fail to be exogenous to the extent that their values in one period correlate with the model's error term in current or
8
Since international agreements almost always enter into force only after a certain number of states have formally
accepted them (through ratification or other procedures), all countries that contribute to an agreement entering into
force will exhibit identical membership dates.
9
Thus, in our dataset, for any given countries, emission levels at time period t are very likely to correlate with
emission levels at time period t-1. By contrast, changes in emissions levels at time period t are far less likely to
correlate with changes in emission levels at time period t-1. Using APC further reduces the likelihood of correlation
between emissions in time period t and in time period t-1.
Mitchell/Deane Page 14 of 25
prior time periods, with the former referred to as endogenous, and the latter as predetermined, variables. Arellano
and Bond {, 1991 #5380} developed a method that identifies the number of periods needed to cleanse a RHS
variable of such correlations. Lagging RHS variables the appropriate number of periods creates valid instruments
that are independent of the dependent variable and therefore meet the requirements for inclusion in a model. The
Arellano-Bond method can identify variables as endogenous or predetermined and can identify the number of lags
needed to generate valid instrument. The Arellano-Bond approach also handles missing observations in the interior
of panels more elegantly than does a PCSE approach. Arellano and Bond also developed tests that ensure the
validity of the model's specification particularly with respect to serial correlation of the errors, using both a direct
test of serial correlation and Sargan tests (similar to Hausman tests) of over-identifying restrictions {Arellano, 1991
#5380, 281-283}.
Third, Arellano-Bond procedures, like any involving dynamic panel data estimators, require identification of the
appropriate lag structures for RHS variables. When attempting to identify causal influences, especially when
modeling a DV in which endogeneity is a significant possibility, it makes sense initially to identify the appropriate
lag structures for RHS variables based on theoretical considerations. Selecting the lag period (if any) of RHS
variables based on the highest correlations with the DV would bias the model toward finding those variables to be
influential, even if that "influence" runs counter to theoretical expectations about when and where such influence
should be evident. On the other hand, the selection of lag structure must take empirical considerations into account,
since that is the method by which endogeneity is accounted for. We have adopted a "middle ground" in which all
RHS variables are included with both contemporaneous and one-period lags with the exception of RHS variables
related to GDP (i.e., gdp, gdp_capita, and trade_gdp) that are included with contemporaneous and two-period lags.
This model passes all appropriate specification tests for both the NOx and SOx emissions cases.
The empirical model
We seek to take advantage of available emissions data, within the constraints imposed by data availability of our
RHS variables. We generate a baseline model of SOx and NOx emissions (as APC) using only the lag of the DV.
We then model emissions for all 53 countries on lagged APC of emissions, population, GDP, GDP per capita, trade
openness, political status, forest cover, and laggard (i.e., lagged DV, gdp, gdp_capita, trade_gdp, free, forest,
laggard). We then run the same model with institutional variables -- membership, institutional existence, and
laggard-membership interaction terms -- included (sox1_ceif, sox1_teif, sox1_ceif*laggard, sox2_ceif, sox2_teif,
sox2_ceif*laggard or nox_ceif, nox_teif, nox_ceif*laggard). We follow this with a model that includes the issuespecific variables of average accumulative exceedances, abatement costs, self deposition, nor deposition import (i.e.,
aae_avg, sox_expense_percent, sox_self_depo, sox_depo_import, nox_expense_percent, nox_self_depo,
nox_depo_import). Adding these RHS variables reduces the number of countries to 36 because of the lack of
corresponding data on some RHS variables. We then re-run the model with institutional variables included.
The delineation of the models is summarized here.
Model 1: Lagged DV only
sox = f(sox_t-1); 53 countries
nox = f(nox_t-1); 53 countries
Model 2: Lagged DV plus control variables available for 54 countries
sox = f(sox_t-1, gdp, pop, gdp_capita, trade_gdp, free, forest, laggard); 53 countries
nox = f(nox_t-1, gdp, pop, gdp_capita, trade_gdp, free, forest, laggard); 53 countries
Model 3: Lagged DV plus control variables available for 54 countries plus institutional variables
sox = Model 2 + (sox1_ceif, sox1_teif, sox2_ceif, sox2_teif, sox1_ceif*laggard, sox2_ceif*laggard); 53 countries
nox = Model 2 + (nox_ceif, nox_teif, nox_ceif*laggard); 53 countries
Model 4: Lagged DV plus control variables available for 36 countries
sox = Model 2 + (aae_avg, sox_expense_percent, sox_self_depo, sox_depo_import); 36 countries
nox = Model 2 + (aae_avg, nox_expense_percent, nox_self_depo, nox_depo_import); 36 countries
Model 5: Lagged DV plus control variables available for 36 countries plus institutional variables
sox = Model 4 + (sox1_ceif, sox1_teif, sox2_ceif, sox2_teif, sox1_ceif*laggard, sox2_ceif*laggard); 36 countries
nox = Model 4 + (nox_ceif, nox_teif, nox_ceif*laggard); 36 countries
We predict the sign and strength of coefficients as follows:
 changes in the dependent variable will correlate positively and strongly with prior-period changes in
emissions (sox_t-1). Despite efforts to make observations as independent as possible, we expect the lagged
dependent variable to retain considerable explanatory power.
Mitchell/Deane Page 15 of 25

changes in economic activity (gdp) will correlate strongly and positively with changes in emissions levels.
Increases in economic activity are expected to produce relatively direct and immediate increases in
emissions.
 changes in population (pop) will correlate positively but weakly, if at all, with changes in emissions.
Population seems likely to be an alternate, and less compelling, proxy for levels of economic activity that
should be captured by gdp.
 changes in income/level of development (gdp_capita) will correlate positively and relatively strongly with
changes in emissions. Countries with higher average incomes will expend those resources in ways that lead
to increased pollution and that these influences will be mitigated only slightly by an increasing demand for
environmental protection (a la the environmental Kuznets curve literature).
 changes in trade openness (trade_gdp) will correlate weakly, if at all, with changes in emissions. The
extensive "free trade and the environment" literature has generated considerable debate but few robust
empirical findings about whether trade openness increases or decreases environmental degradation. The
relationship may depend on the types of trade and forms of environmental degradation involved. Trade may
increase emissions if imports are dominated by items (e.g., cars) whose operation increases emissions;
similarly, lower prices may increase total consumption. Trade may decrease emissions if imports replace
highly polluting domestic production; by lowering prices; it may also generate competitive pressures in
domestic sectors that improve the economic and energy efficiency of production. The various vectors of
influence between trade and environmental protection make it likely that trade's net effect on emissions is
weak and also make it difficult to estimate the sign of that net effect.
 political status (free) will correlate positively but weakly with changes in emissions. Political status is
largely invariant over time and therefore is not modeled in change terms. Since countries considered more
free receive higher Freedom House scores, and since freer countries are assumed to have both more vocal
citizens and more responsive governments, emissions should increase at a slower rate in freer countries
than otherwise similar countries that are less free.
 changes in ecological vulnerability as proxied by forest cover (forest) will correlate positively but only
weakly with changes in emissions. Declining forest cover should lead publics to mobilize and push their
governments (even in non-free countries) to take action to reduce this damage, including through reducing
emissions of the responsible acid precipitants. Both data quality and the inherent nature of forest cover,
however, leads us to expect that changes in forest cover will correlate weakly if at all with emissions.
 changes in ecological vulnerability as proxied by average accumulated exceedance (aae_avg) will correlate
positively but strongly with changes in emissions. High levels of exceedances should produce significant
pressure to address the problem. If exceedances get yet higher, that pressure should increase while, if
exceedances decline, we should expect that pressure to abate.
 changes in abatement costs (sox_expense_percent and nox_expense_percent) will correlate positively but
weakly with changes in emissions. Although marginal costs of abatement are monotonically increasing, our
analysis uses total abatement costs and thus can increase or decrease. We should expect emission levels to
decline as abatement (for a 5% reduction as a share of GDP) become cheaper, and to increase as abatement
becomes more expensive. Although we expect abatement costs to have a relatively strong effect, data
quality leads us to expect the observed correlation to be weak, at best.
 changes in self deposition (sox_self_depo and nox_self_depo) will correlate negatively but weakly with
changes in emissions. Increases in self deposition should increase pressure for unilateral action to reduce
emissions. Here we expect only a weak correlation because of the endogeneity of the variable -- increases
in emissions generate increased self deposition which counteracts the political vector just delineated.
 changes in deposition import (sox_depo_import and nox_depo_import) will correlate positively but weakly
with changes in emissions. Increases in deposition import should increase pressure for multilateral action to
reduce emissions and reduce unilateral efforts.
 laggard (laggard) will correlate positively and strongly with changes in emissions. The laggard variable is
invariant over time and therefore is not modeled in change terms. We expect emissions in leader countries
to decline more than, or increase at a slower rate than, those in otherwise similar laggard countries.
As denoted in the literature review above, the results from the three membership variables will address the debate
among realists, institutionalists, and constructivists with respect to institutional influence.
 membership (sox1_ceif, sox2_ceif, and nox_ceif) will correlate negatively and strongly with changes in
emissions. Membership is largely invariant over time and therefore is not modeled in change terms. We
Mitchell/Deane Page 16 of 25
expect emissions in member countries to decline more than, or increase at a slower rate than, those in
otherwise similar non-member countries.
 institutional existence (sox1_teif, sox2_teif, and nox_teif,) will correlate negatively and weakly with
changes in emissions. Institutional existence is largely invariant over time and therefore is not modeled in
change terms. We expect emissions in all countries to decline more than, or increase at a slower rate than,
emissions during otherwise similar periods of time.
 membership-laggard (laggard) will correlate negatively and strongly with changes in emissions. We believe
that laggard states that are members will show some differences from otherwise similar leader members.
We believe that they will show even greater differences from otherwise similar laggard non-members.
Interpreting results
Interpretation of results from a dynamic panel data estimation regression deserves comment. Because many of the
included variables (gdp, pop, gdp_capita, trade_gdp, forest, aae_avg, sox/nox_expense_percent, sox/nox_self_depo,
sox/nox_depo_import) are represented by two terms in the equation (first differences and one- or two-period lagged
differences), the overall effect of the variable and its statistical significance can only be identified through postestimation techniques that combine the effects of both terms.
In addition, coefficients are usually interpreted differently in light of inclusion of the lagged DV. The traditional
approach, which we adopt here, to interpreting the coefficients of RHS variables in such models is to identify "longrun" effects of the variables by correcting for the inertia in the time series. This entails using the lagged difference of
the DV as the "coefficient of adjustment," dividing the coefficients for all variants of each RHS variable in question
by the coefficient of the lagged DV and evaluating statistical significance by using the likelihood that that ratio is
equal to 0 by means of a chi-square statistic.10
Findings
Results from the regressions provide considerable evidence of institutional effects across all three protocols, make
sense of the divergence in conclusions of prior quantitative and qualitative analyses of the protocols studied here,
and represent a "proof of concept" that institutional influence across institutions can be compared convincingly.
Institutional variables: the independent variables of interest
The most interesting and consistent finding with respect to institutional influences is that membership does not
predict emissions in any of our model specifications in which laggard-membership terms are included. Institutional
membership is not statistically significant for any of the three protocols in the more refined country set (36
countries) or variable sets (limited or extended set of control variables) included in the model. We only present
Model 5 results here for clarity of presentation but, across all model specifications, other variants of institutional
variables are statistically significant. In light of the strength of the correlations in most cases despite the noisiness of
data on the laggard variable (see above), this strongly supports the argument that understanding institutional
influence requires modeling different types of potential institutional influence and the value of distinguishing
between leader and laggard members. In several instances, analyses that did not include a membership-laggard
interaction term would have either failed to find, or would have mis-estimated institutional effects, even though
those effects emerge when those terms are included.
Both our analytic priors and the empirical results suggest that Model 5 presents the most accurate results
(corresponding results from Model 3 are presented in Appendix III). In the Model 5 runs, institutional variables for
two of the protocols are statistically significant: the first SOx Protocol's entry into force (sox1_teif) and the NOx
Protocol's membership-laggard interaction term relative to non-members ((nox_ceif*laggard)+nox_ceif). Table 1
presents these results and suggest that institutions may be influential by different pathways, with different states, and
to different degrees.
Table 1: (Model 5 results for all variables are reported in Appendix IV)
Model 5
NOx
36 countries, all variables
Protocol
-1.5%
Effect of protocol entry into force
10
1st SOx
Protocol
-5.4%
2nd SOx
Protocol
5.4%
Thus, in the SOx Model 5, the "long run" coefficient for trade of -167.49 (-16749%) is generated by summing the
coefficients for the difference of trade and the two-period lagged difference of trade included in the equation (0.1434 and -64.3463, respectively) and dividing them by one minus the coefficient of the lagged DV (10.7699=0.2301), i.e., the coefficient of adjustment.
Mitchell/Deane Page 17 of 25
("xxx_teif" from model w/ interaction term)
Effect of leader/laggard variable on emissions
("laggard" from model w/ no interaction term)
Effect of membership across members and non-members
("xxx_ceif" from model w/ no interaction term)
Effect of membership on leaders
("xxx_ceif" from model w/ interaction term)
Effect of membership on laggards
("xxx_ceif+(xxx_ceif*laggard)" from model w/ interaction term)
* - Yellow highlighting identifies variables significant at the .05 level.
0.7%
-1.3%
-1.3%
-1.8%
-1.2%
2.6%
-0.4%
-0.3%
4.7%
-2.5%
-4.2%
-5.7%
Let us consider the interpretation of these results for each protocol in turn.
 Entry into force of the NOx Protocol generally (nox_teif) does not appear to have influenced state behavior.
To assess whether institutional membership influenced behavior, we ran our model with and without the
membership/laggard interaction term. The model without the interaction term identifies the overall effects
of "laggardness" and of "membership," assuming there is no interaction between the variables. For NOx,
the coefficient on laggard is not statistically significant: the average emissions of leaders cannot be
distinguished from those of laggards.11 The coefficient on membership also is not statistically significant:
the average emissions of members cannot be distinguished from those of non-members. Adding the
membership/laggard interaction term to that model allows us to assess the theoretical question, noted
above, of whether the protocol had different influences on leaders and laggards. Model results nicely fit
theoretical expectations: the coefficient of membership for leaders is not statistically significant but for
laggards it is negative and statistically significant, i.e., among leaders, members cannot be distinguished
from non-members, but among laggards, members can be distinguished by their significantly lower
emissions than non-members. Intuitively, this suggests the NOx protocol's effect was to lead laggard states
to "catch up" with leader states in ways that they would not have done otherwise, reducing their emissions
relative to corresponding non-members by an average of 2.5% per year.
 The first SOx Protocol appears to have led all 36 states included in the model -- both the 20 members and
the 16 non-members -- to reduce their emissions an average of 5% per year. Countries that joined did not
behave differently than corresponding non-members, either on average or based on whether they were
leaders or laggards but they did behave differently after the protocol entered into force than before.
 After taking the influence of the first SOx Protocol into account, the second SOx Protocol appears not to
have influenced behavior at all. No statistically significant differences are evident with respect to either the
second SOx Protocol's entry into force or country membership in that Protocol (whether evaluated
generally or by leader/laggard subgroups).
Since the motivating question for this article was "which worked better," a comparison of these results is also
warranted. Using Model 5 results, the most direct and convincing comparison is that between the first and second
SOx Protocols. The exact same DV is used (SOx emissions as APC) and the coefficients are estimated within the
same regression equation. The effect of the first SOx Protocol appears to have been to lead all European states,
regardless of membership, to reduce emissions by about 5% per year even after taking general time trends and the
influence of other RHS variables into account. By contrast, the second SOx Protocol appears not to have had any
influence. The NOx Protocol seems to have had about half the influence of the first SOx Protocol but only on
laggard members, causing them to reduce emissions by only 2.5% per year. In this case, the first SOx Protocol
appears to have had a larger influence on more states than the NOx Protocol but the analysis points to new questions
of institutional influence such as whether an institution that had a large influence on only laggard member states
should be considered more or less influential than one which had a smaller influence on a large group of states,
perhaps including non-member states.
Our model also provides a proxy by which to verify our claim that the problems were relatively similar in their
"level of difficulty" or malignity by examining the coefficient of the lagged DV. In Model 5 (i.e., among the 36
European states most directly affected by European acid precipitants), the coefficient of the lagged DV is almost
11
When relevant coefficients are examined in the model with an interaction term, the average emissions of leaders
cannot be distinguished from those of laggards either among members or among non-members.
Mitchell/Deane Page 18 of 25
identical, picking up 77% of the variance in the DV in the NOx case and 79% of the variance in the SOx case. These
figures tell us about the "speed" with which the time series for each country reaches equilibrium state, and that, once
all variables are included as in Model 5, that the trajectories of emissions of both pollutants is both similar and quite
slow to reach equilibrium levels. Another, more intuitive, interpretation of this result is simply that the "inertia" in
both SOx and NOx emissions that we have not captured by the variables we include in the model is both large and
relatively similar, i.e., for both pollutants the included RHS variables had the opportunity to explain only about onethird of emission variance with the rest already accounted for by other factors. The value of our analysis lies not in
any belief that these numbers reflect the "true" influence of these agreements but rather in demonstrating that the
effects of international institutions can be convincingly and meaningfully compared if the institutions are selected to
ensure they address relatively-similar problems, if the DVs employed are measured in similar units, and if the same
RHS variables are included in the model.
Other RHS variables
Although our interest here was in the institutional variables, the other variables included in the model as controls
deserve some attention. Two findings were particularly contrary to our original expectations. The first was that
economic activity (measured as GDP) was not statistically significant for either pollutant in any of the models
evaluated. We had strong theoretical priors that economic activity would drive emissions and strong confidence in
the quality of our data on both emissions and GDP. The absence of such a correlation has several possible
explanations: a) no correlation exists between GDP and emissions after accounting for other factors (which seems
unlikely), b) that including the lagged DV absorbs the correlation which would otherwise be captured by GDP, or c)
that the correlation is between the levels of GDP and emissions and that correlation was eliminated by transforming
emissions into APC form. We are inclined to believe that the third of these is the actual reason, and investigation of
that question will be the focus of future work.
The second unexpected finding is the frequency with which income (GDP per capita), trade (as a percent of GDP),
and population are statistically significant correlates of emissions, particularly for NOx. Contrary to the
environmental Kuznets curve literature, whenever income has a statistically significant correlation with emissions,
its sign is positive. Given that all the countries in the sample are relatively developed, they should be on the
downward sloping side of the famous $8,000 per person level of income at which income is alleged to correlate with
greater environmental protection {Grossman, 1995 #4562; Harbaugh, 2000 #4563}.
With respect to trade, as noted above, the debate is unresolved in terms of whether trade would tend to increase or
decrease emissions. Our results lend support to the notion that the net effect of trade is downward pressure on
emissions with the sign in all cases being negative, and with statistically significant correlations for most models and
pollutants. An economic interpretation would see this as evidence that trade leads to more efficient production of
goods, with corresponding benefits to the environment. A political interpretation would see this as evidence that
economic interdependence has environmental benefits in reducing transboundary pollution.
Population also has statistically significant correlations with emissions, at least in the 36-country runs. Here too,
however, the sign of the correlation does not fit comfortably with existing theory -- in the cases in which
population's correlation with emissions is statistically significant, it is strongly so (all below the .02 level of
significance) and has a negative sign. Although we can generate theories for why increases in population produce
decreases in emissions, such theories would appear to have a "post hoc" character that calls their plausibility into
question.
Our dual proxies for ecological vulnerability prove consistently significant. Our variable for forest cover (forest)
proves significant in all the SOx models and some of the NOx models. Given that the forest data was interpolated
from only three years of data, this correlation may be an artifact of common trending in the forest data.
The alternative proxy we introduced to improve on ecological vulnerability was average accumulative exceedances
(aae_avg) for all countries. That variable is consistent in both Model 4 (without institutional variables) and 5 (with
institutional variables) for both SOx and NOx but, contrary to expectation, the correlation with emissions is
consistently positive. This variable was introduced, following Sprinz and Vaahtoranta {, 1994 #2912}, as a proxy
for political pressure to make reductions. The most likely explanation of the strongly positive correlation found here
is that this correlation may be capturing the opposing direction of causality, i.e., that higher emissions in a country
lead to greater environmental damage in that country.
Evidence regarding abatement costs is mixed. In the NOx case, abatement costs are found to be significant (at least
at the .10 level) with positive signs. This fits with expectation -- as abatement costs increase, so would average
emission levels or, put differently, lower abatement costs lead to lower emission rates. The same findings are not
found with respect to SOx, however, either bringing the correlation into question or raising concerns about the
degree to which the data is sufficiently sound to find such a correlation.
Mitchell/Deane Page 19 of 25
Finally, strategic interaction variables (self_depo and depo_import) are consistently negative and usually statistically
significant, but with the correlation being stronger for SOx then NOx. Both self-deposition and deposition import
have negative correlations with lower emission levels. It is tempting to interpret this as saying that increases in
environmental damage, whether generated at home or abroad, lead to lower emission levels. But the findings just
presented regarding ecological vulnerability (aae_avg) suggest otherwise.
Alternative model specification
Appendix IV shows the results of initial efforts to evaluate the stability of our model specification. There, we
compare results from Model 5, just presented, with results from a model with the same countries but a more limited
set of variables as well as a model with that more limited set of variables run with all 53 countries for which such
data is available. A comparison of the results in a step-wise way clarifies some of the foregoing analysis.
Comparing Model 5 to a model that includes the same countries but a more limited set of variables (i.e., without the
pollutant-specific variables of aae_avg, xxx _expense_percent, xxx_self_depo, and xxx_depo_import) in which the
signs of institutional variables are generally the same but in which institutional influence appears quite different. In
that model, as already noted, membership alone again does not appear to have any influence for any of the protocols,
once other institutional variables are included. The NOx Protocol appears not to have any influence, the first SOx
protocol appears to have had both an "institutional existence" impact on all states regardless of membership as well
as a "laggard-membership" impact in which laggard members reduce emissions at a rate significantly different than
leader members. And, in this model, the second SOx protocol appears to have an influence through a laggardmembership mechanism as well, although it is significant at the 10% rather than 5% level. The differences between
these models is due simply to inclusion of additional variables in Model 5 and almost all those variables are
statistically significant when included. This suggests that the Model 5 results are more accurate, with the results of
Model 3 illustrating a spurious correlation due to the variables that are left out of Model 3 but included in Model 5.
Comparing this 36-country Model 3 with the same model that includes all 53-countries for which RHS variables is
available highlights additional sensitivities. In the 53-country Model 3, the NOx Protocol and both the SOx
Protocols appear to have laggard-membership influences on emissions. For all variables that are statistically
significant in both cases, the signs and the order of magnitude of the coefficients are the same, although the level of
statistical significance changes for many variables. The additional 17 countries included in this model introduce
significantly more variation across most variables than that among the original 36 countries, since most of those
countries are non-European countries. Efforts are underway to better understand the variation in results across
models and to decipher the sources of variation among the models and thereby improve model specification.
Conclusion
This paper has investigated the comparative influence of three international institutions as a means of illustrating the
importance of specifying quantitative models of institutional influence in ways that reflect the extant debate about
the different mechanisms and locations of institutional influence. Three major conclusions can be drawn from this
study.
First, we have developed and demonstrated an approach that allows modeling multiple institutions in sufficiently
comparable ways that comparison of the coefficients for institutional variables becomes plausible. A strategy that
combines selection of institutions to hold various factors constant, using annual percentage changes as the dependent
variable in all cases, including common right-hand side variables, and using statistical indicators of the "inertia" in
the DV provides a means to address the usual critique that institutions are "simply not comparable."
Second, we have shown the importance of including three different institutional variables to arbitrate between
arguments that institutions have no influence, that their influence operates on members and non-members alike, that
they operate on members more than non-members, and that they operate on laggard members more than leader
members. The differences in outcomes between models that do or do not include these variables and the statistical
significance of both the institutional existence and laggard-membership interaction terms confirm that their inclusion
is important to accurately identifying whether, and what type of, institutional influence is involved. Traditional
modeling that includes membership as the sole indicator of institutional influence is likely to miss or mis-estimate
institutional influence.
Third, the analysis has shown that the specific institutions investigated here vary not only in how much influence
they have but in the causal pathways by which they influence state behavior. Our results suggest that the first SOx
Protocol had a large influence on states through some mechanism (whether normative, technological, or economic)
that was not contingent on a state's membership. The NOx Protocol appears to have had less influence, only
influencing laggard member states and influencing those states to a smaller degree (in percentage terms) than the
first SOx Protocol. By contrast, the second SOx Protocol appears not to have had any significant influence on states,
regardless of either their membership or leader/laggard status. These conclusions are tentative and subject to
Mitchell/Deane Page 20 of 25
revision based on efforts to evaluate the sensitivity of the findings to different model specifications and country
inclusions and their proper interpretation requires making sense of some that run counter to existing theory. Yet,
overall, the results suggest that our approach allows more nuanced and accurate insights into institutional influence
than have been available through other approaches. Once confirmed these findings may help clarify the
inconsistencies that exist between previous quantitative and qualitative studies of these institutions.
Mitchell/Deane Page 21 of 25
Appendix I
Coding of Laggard variables
SOx "Leaders"
We identified leader states by combining evidence from Sprinz and Vaahtoranta regarding state negotiating
positions {, 1994 #2912, 100-101} with the list of SOx Protocol signatories that, as of 1989, had declared reduction
targets greater than the 30% required by the protocol from Levy {, 1993 #1818, Table 3.8, 118}. The following
countries were identified as "leaders" and coded as 0, with all other countries coded as 1, to create the laggard
variable for SOx.
Austria (Levy 1993; Sprinz and Vaahtoranta 1994)
Belarus (Sprinz and Vaahtoranta 1994)
Belgium (Levy 1993; Sprinz and Vaahtoranta 1994)
Canada (Sprinz and Vaahtoranta 1994)
Czech Republic (Sprinz and Vaahtoranta 1994)
Denmark (Levy 1993; Sprinz and Vaahtoranta 1994)
Finland (Levy 1993; Sprinz and Vaahtoranta 1994)
France (Levy 1993; Sprinz and Vaahtoranta 1994)
Germany (Levy 1993; Sprinz and Vaahtoranta 1994)
Hungary (Sprinz and Vaahtoranta 1994)
Italy (Sprinz and Vaahtoranta 1994)
Luxembourg (Levy 1993)
Netherlands (Levy 1993; Sprinz and Vaahtoranta 1994)
Norway (Levy 1993; Sprinz and Vaahtoranta 1994)
Russia (Sprinz and Vaahtoranta 1994)
Slovakia (Sprinz and Vaahtoranta 1994)
Sweden (Levy 1993; Sprinz and Vaahtoranta 1994)
Switzerland (Levy 1993; Sprinz and Vaahtoranta 1994)
Ukraine (Sprinz and Vaahtoranta 1994)
NOx "Leaders"
We identified NOx leaders as countries listed by Levy as having signed the NOx protocol and also made a further
30% reduction pledge {Levy, 1993 #1818, Table 3.2, 97}. The following countries were identified as "leaders" and
coded as 0, with all other countries coded as 1, to create the laggard variable for NOx.
Austria
Belgium
Denmark
Finland
France
Germany
Italy
Liechtenstein
Netherlands
Norway
Sweden
Switzerland
Mitchell/Deane Page 22 of 25
Appendix II
Treatment of missing data
For emissions, population, GDP (and GDP per capita), trade openness, political status, average accumulated
exceedances, membership, and institutional existence, data coverage for all countries was sufficiently extensive that
missing data was not interpolated or otherwise generated, and was left to be addressed through statistical techniques.
To generate annual data for forest cover based on data availability only for 1990, 2000, and 2005, we interpolated
missing values via quadratic polynomial regression for each country. In addition, we used Belgian and Czech
Republic forest cover to estimate Luxembourg and Slovakian forest cover, respectively, since data on the latter
countries were missing. We generated self deposition and deposition import data for missing years (data availability
only for 1985, 1987-1992, 1996, 1997, 1998, 2000, and 2003) using the same method as for forest cover, but we
retained observed values for the years with LRTAP reported data (with only three observed values for forest cover,
the quadratic fit always passed through the observed data points). We extrapolated forecasted values for years 2004
and 2005. For various countries, abatement costs (sox_expense_percent and nox_expense_percent) were not
available because emission levels in those years did not fall within the levels available on the cost curves. Missing
data was filled in from the nearest subsequent year for which corresponding country data was available, with
missing data at the end of the period filled in from the last year for which corresponding country data was available.
To develop a complete dataset that took advantage of all available emissions data required considerable
manipulation of data for states related to Czechoslovakia, Germany, the Soviet Union, and Yugoslavia. For the full
time period under study, emissions data was available on all the successor states to Czechoslovakia, the Soviet
Union, and Yugoslavia (although in the Soviet case, from separate sources) and on Germany but not on the former
West and East Germany. For ease of exposition, we refer to Czechoslovakia, Germany, the Soviet Union, and
Yugoslavia as "aggregate" states, and their successor or predecessor states as "constituent" states. Given the
different treatment of aggregate and constituent states by various data sources, generating independent variable data
that corresponds to the dependent variable proved challenging. For political status, self deposition, and other
deposition, the constituent states to Czechoslovakia, Yugoslavia, and the Soviet Union were filled in with the
corresponding data for those states in the pre-dissolution period; aggregate German data was filled in with the
average of West and East German data for the pre-reunification period. For GDP, GDP per capita, and trade
openness, aggregate German data is available, but data for successor states to Czechoslovakia, Yugoslavia, and the
Soviet Union were generated by determining each constituent states' percentage of the aggregate state's GDP over
the 1990-1992 period and then applying that percentage to the (known) aggregate state's GDP during the period
prior to 1990. For abatement costs, constituent state data is available for Czechoslovakia, Yugoslavia, and the Soviet
Union but aggregate German data was filled in with the average of West and East German data for the prereunification period.
Mitchell/Deane Page 23 of 25
Appendix III
Model 3 results
Model 3:
53 countries, "core" variables
Effect of protocol entry into force
("xxx_teif" from model w/ interaction term)
Effect of leader/laggard variable on emissions
("laggard" from model w/ no interaction term)
Effect of membership across members and non-members
("xxx_ceif" from model w/ no interaction term)
Effect of membership on leaders
("xxx_ceif" from model w/ interaction term)
Effect of membership on laggards
("xxx_ceif+(xxx_ceif*laggard)" from model w/ interaction term)
* - Yellow highlighting identifies variables significant at the .05 level.
NOx
Protocol
-1.3%
1st SOx
Protocol
3.2%
2nd SOx
Protocol
4.1%
2.1%
-0.6%
-0.6%
-2.4%
-7.1%
-5.4%
0.7%
-4.3%
-1.7%
-2.7%
-9.8%
-19.1%
Mitchell/Deane Page 24 of 25
Appendix IV: Model results
Long Run Effects
Model 5
(36 ctry)
NOx
Model 5
(36 ctry)
1st SOx
Model 5
(36 ctry)
2nd SOx
Model 3
(53 ctry)
NOx
Model 3
(53 ctry)
1st SOx
Model 3
(53 ctry)
2nd SOx
Model 3
(36 ctry)
NOx
Model 3
(36 ctry)
1st SOx
Model 3
(36 ctry)
2nd SOx
forest
2.9%
7.7%
3.5%
12.3%
10.4%
11.3%
pop
-0.8%
-1.2%
0.0%
0.4%
-1.4%
-2.1%
gdp
0.0%
0.1%
-0.0%
-0.1%
-0.0%
0.1%
trade_gdp
-28027.0% -16750.0%
-42976.4% -92152.6%
-2709.2% -55435.0%
gdp_capita
3906.0% -816.60%
4442.2% 11108.9%
3876.7% 1376.6%
free
0.1%
-1.4%
0.4%
-4.9%
-1.0%
0.3%
laggard
1.8%
1.3%
4.1%
3.5%
1.0%
5.4%
aae_avg
0.0%
0.1%
xxx_expense_percent
16.8%
-47.0%
xxx_self_depo
-91.9%
-154.0%
xxx_depo_import
-108.0%
-235.4%
xxx_teif
-1.5%
-5.4%
5.4%
-1.3%
3.2%
4.1%
-1.8%
-7.7%
5.2%
xxx_ceif
-0.4%
-0.3%
4.7%
0.7%
-4.3%
-1.7%
1.0%
-0.5%
6.7%
xxx_ceif*laggard
-2.0%
-3.9%
-10.5%
-3.4%
-5.5%
-17.4%
-2.7%
-11.8%
-19.2%
(xxx_ceif*laggard)+xxx_ceif -2.45%
-4.2%
-5.7%
-2.7%
-9.8%
-19.1%
-1.7%
-12.3%
-12.5%
Coefficients with statistical significance greater than 5% are designated with yellow highlighting, those with statistical significance between 5 and 10% are
designated with orange highlighting. xxx_ represents nox_ for the NOx columns and sox_ for the SOx columns.
Mitchell/Deane Page 25 of 25
References