Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Chapter 16 Qualitative and Limited Dependent Variable Models Walter R. Paczkowski Rutgers University Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 1 Chapter Contents 16.1 Models with Binary Dependent Variables 16.2 The Logit Model for Binary Choice 16.3 Multinomial Logit 16.4 Conditional Logit 16.5 Ordered Choice Models 16.6 Models for Count Data 16.7 Limited Dependent Variables Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 2 In this chapter, we: – Examine models that are used to describe choice behavior, and which do not have the usual continuous dependent variable – Introduce a class of models with dependent variables that are limited • They are continuous, but that their range of values is constrained in some way, and their values not completely observable • Alternatives to least squares estimation are needed since the least squares estimator is biased and inconsistent Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 3 16.1 Models with Binary Dependent Variables Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 4 16.1 Models with Binary Dependent Variables Many of the choices that individuals and firms make are ‘‘either–or’’ in nature – Such choices can be represented by a binary (indicator) variable that takes the value 1 if one outcome is chosen and the value 0 otherwise – The binary variable describing a choice is the dependent variable rather than an independent variable Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 5 16.1 Models with Binary Dependent Variables Examples: – Models of why some individuals take a second or third job, and engage in ‘‘moonlighting’’ – Models of why some legislators in the U.S. House of Representatives vote for a particular bill and others do not – Models explaining why some loan applications are accepted and others are not at a large metropolitan bank – Models explaining why some individuals vote for increased spending in a school board election and others vote against – Models explaining why some female college students decide to study engineering and others do not Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 6 16.1 Models with Binary Dependent Variables We represent an individual’s choice by the indicator variable: Eq. 16.1 Principles of Econometrics, 4th Edition 1 individual drives to work y 0 individual takes bus to work Chapter 16: Qualitative and Limited Dependent Variable Models Page 7 16.1 Models with Binary Dependent Variables If the probability that an individual drives to work is p, then P[y = 1] = p – The probability that a person uses public transportation is P[y = 0] = 1 – p – The probability function for such a binary random variable is: f ( y) p y (1 p)1 y , Eq. 16.2 y 0,1 with E y p, var y p 1 p Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 8 16.1 Models with Binary Dependent Variables For our analysis, define the explanatory variable as: x = (commuting time by bus - commuting time by car) Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 9 16.1 Models with Binary Dependent Variables 16.1.1 The Linear Probability Model We could model the indicator variable y using the linear model, however, there are several problems: – It implies marginal effects of changes in continuous explanatory variables are constant, which cannot be the case for a probability model • This feature also can result in predicted probabilities outside the [0, 1] interval – The linear probability model error term is heteroskedastic, so that a better estimator is generalized least squares Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 10 16.1 Models with Binary Dependent Variables 16.1.1 The Linear Probability Model In regression analysis we break the dependent variable into fixed and random parts – If we do this for the indicator variable y, we have: y E ( y) e p e Eq. 16.3 – Assuming that the relationship is linear: Eq. 16.4 Principles of Econometrics, 4th Edition E ( y ) p 1 2 x Chapter 16: Qualitative and Limited Dependent Variable Models Page 11 16.1 Models with Binary Dependent Variables 16.1.1 The Linear Probability Model The linear regression model for explaining the choice variable y is called the linear probability model: Eq. 16.5 Principles of Econometrics, 4th Edition y E ( y ) e 1 2 x e Chapter 16: Qualitative and Limited Dependent Variable Models Page 12 16.1 Models with Binary Dependent Variables 16.1.1 The Linear Probability Model The probability density functions for y and e are: Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 13 16.1 Models with Binary Dependent Variables 16.1.1 The Linear Probability Model Using these values it can be shown that the variance of the error term e is: var e 1 2 x 1 1 2 x – The estimated variance of the error term is: Eq. 16.6 Principles of Econometrics, 4th Edition ˆ i2 var ei b1 b2 xi 1 b1 b2 xi Chapter 16: Qualitative and Limited Dependent Variable Models Page 14 16.1 Models with Binary Dependent Variables 16.1.1 The Linear Probability Model We can transform the data as: yi* yi ˆ i xi* xi ˆ i – And estimate the model: ˆ i1 2 xi* ei* yi* 1 by least squares to produce the feasible generalized least squares estimates – Both least squares and feasible generalized least squares are consistent estimators of the regression parameters Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 15 16.1 Models with Binary Dependent Variables 16.1.1 The Linear Probability Model If we estimate the parameters of Eq. 16.5 by least squares, we obtain the fitted model explaining the systematic portion of y – This systematic portion is p p̂ b1 b2 x Eq. 16.7 – By substituting alternative values of x,we can easily obtain values that are less than zero or greater than one Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 16 16.1 Models with Binary Dependent Variables 16.1.1 The Linear Probability Model The underlying feature that causes these problems is that the linear probability model implicitly assumes that increases in x have a constant effect on the probability of choosing to drive Eq. 16.8 Principles of Econometrics, 4th Edition dp 2 dx Chapter 16: Qualitative and Limited Dependent Variable Models Page 17 16.1 Models with Binary Dependent Variables 16.1.2 The Probit Model To keep the choice probability p within the interval [0, 1], a nonlinear S-shaped relationship between x and p can be used Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 18 16.1 Models with Binary Dependent Variables FIGURE 16.1 (a) Standard normal cumulative distribution function (b) Standard normal probability density function 16.1.2 The Probit Model Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 19 16.1 Models with Binary Dependent Variables 16.1.2 The Probit Model A functional relationship that is used to represent such a curve is the probit function – The probit function is related to the standard normal probability distribution: 1 .5 z 2 ( z ) e 2 – The probit function is: Eq. 16.9 Principles of Econometrics, 4th Edition ( z ) P[ Z z ] z 1 .5u 2 e du 2 Chapter 16: Qualitative and Limited Dependent Variable Models Page 20 16.1 Models with Binary Dependent Variables 16.1.2 The Probit Model The probit statistical model expresses the probability p that y takes the value 1 to be: p P[ Z 1 2 x] (1 2 x) Eq. 16.10 – The probit model is said to be nonlinear Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 21 16.1 Models with Binary Dependent Variables 16.1.3 Interpretation of the Probit Model We can examine the marginal effect of a one-unit change in x on the probability that y = 1 by considering the derivative: Eq. 16.11 Principles of Econometrics, 4th Edition dp d (t ) dt (1 2 x)2 dx dt dx Chapter 16: Qualitative and Limited Dependent Variable Models Page 22 16.1 Models with Binary Dependent Variables 16.1.3 Interpretation of the Probit Model Eq. 16.11 has the following implications: 1. Since Φ(β1+ β2x) is a probability density function, its value is always positive 2. As x changes, the value of the function Φ(β1+ β2x) changes 3. if β1+ β2x is large, then the probability that the individual chooses to drive is very large and close to one • Similarly if β1+ β2x is small Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 23 16.1 Models with Binary Dependent Variables 16.1.3 Interpretation of the Probit Model We estimate the probability p to be: pˆ (1 2 x) Eq. 16.12 – By comparing to a threshold value, like 0.5, we can predict choice using the rule: 1 yˆ 0 Principles of Econometrics, 4th Edition pˆ 0.5 pˆ 0.5 Chapter 16: Qualitative and Limited Dependent Variable Models Page 24 16.1 Models with Binary Dependent Variables 16.1.4 Maximum Likelihood Estimation of the Probit Model The probability function for y is combined with the probit model to obtain: Eq. 16.13 f ( yi ) [(1 2 xi )]yi [1 (1 2 xi )]1 yi , yi 0,1 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 25 16.1 Models with Binary Dependent Variables 16.1.4 Maximum Likelihood Estimation of the Probit Model If the three individuals are independently drawn, then: f ( y1 , y2 , y3 ) f ( y1 ) f ( y2 ) f ( y3 ) – The probability of observing y1 = 1, y2 = 1, and y3 = 0 is: P[ y1 1, y2 1, y3 0] f (1, 1, 0) f (1) f (1) f (0) Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 26 16.1 Models with Binary Dependent Variables 16.1.4 Maximum Likelihood Estimation of the Probit Model We now have: P[ y1 1, y2 1, y3 0] 1 2 (15) 1 2 (6) Eq. 16.14 L β1 ,β 2 1 1 2 (7) for x1 = 15, x2 = 6, and x3 = 7 – This function, which gives us the probability of observing the sample data, is called the likelihood function • The notation L(β1, β2) indicates that the likelihood function is a function of the unknown parameters, β1and β2 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 27 16.1 Models with Binary Dependent Variables 16.1.4 Maximum Likelihood Estimation of the Probit Model Eq. 16.15 In practice, instead of maximizing Eq. 16.14, we maximize the logarithm of Eq. 16.14, which is called the log-likelihood function: ln L β1 ,β 2 ln 1 2 (15) 1 2 (6) 1 1 2 (7) ln 1 2 (15) ln 1 2 (6) ln 1 1 2 (7) – The maximization of the log-likelihood function is easier than the maximization of Eq. 16.14 – The values that maximize the log-likelihood function also maximize the likelihood function • They are the maximum likelihood estimates Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 28 16.1 Models with Binary Dependent Variables 16.1.4 Maximum Likelihood Estimation of the Probit Model A feature of the maximum likelihood estimation procedure is that while its properties in small samples are not known, we can show that in large samples the maximum likelihood estimator is normally distributed, consistent and best, in the sense that no competing estimator has smaller variance Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 29 16.1 Models with Binary Dependent Variables 16.1.5 A Transportation Example Let DTIME = (BUSTIME-AUTOTIME)÷10, which is the commuting time differential in 10minute increments – The probit model is: P(AUTO = 1) = Φ(β1+ β2DTIME) – The maximum likelihood estimates of the parameters are: 1 2 DTIME 0.0644 0.3000 DTIMEi (se) Principles of Econometrics, 4th Edition (0.3992) (0.1029) Chapter 16: Qualitative and Limited Dependent Variable Models Page 30 16.1 Models with Binary Dependent Variables 16.1.5 A Transportation Example The marginal effect of increasing public transportation time, given that travel via public transportation currently takes 20 minutes longer than auto travel is: dp (1 2 DTIME )2 (0.0644 0.3000 2)(0.3000) dDTIME (0.5355)(0.3000) 0.3456 0.3000 0.1037 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 31 16.1 Models with Binary Dependent Variables 16.1.5 A Transportation Example If an individual is faced with the situation that it takes 30 minutes longer to take public transportation than to drive to work, then the estimated probability that auto transportation will be selected is: pˆ (1 2 DTIME ) (0.0644 0.3000 3) 0.7983 – Since 0.7983 > 0.5, we “predict” the individual will choose to drive Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 32 16.1 Models with Binary Dependent Variables 16.1.6 Further PostEstimation Analysis Rather than evaluate the marginal effect at a specific value, or the mean value, the average marginal effect (AME) is often considered: 1 AME N N i 1 β1 β 2 DTIMEi β 2 – For our problem: AME 0.0484 – The sample standard deviation is: 0.0365 – Its minimum and maximum values are 0.0025 and 0.1153 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 33 16.1 Models with Binary Dependent Variables 16.1.6 Further PostEstimation Analysis Consider the marginal effect: dp (1 2 DTIME )2 g 1 , 2 dDTIME – The marginal effect function is nonlinear Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 34 16.2 The Logit Model for Binary Choice Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 35 16.2 The Logit Model for Binary Choice Probit model estimation is numerically complicated because it is based on the normal distribution – A frequently used alternative to the probit model for binary choice situations is the logit model – These models differ only in the particular Sshaped curve used to constrain probabilities to the [0, 1] interval Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 36 16.2 The Logit Model for Binary Choice Eq. 16.16 Eq. 16.17 If L is a logistic random variable, then its probability density function is: e l (l ) , l 2 l 1 e – The cumulative distribution function for a logistic random variable is: 1 l p[ L l ] 1 el Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 37 16.2 The Logit Model for Binary Choice The probability p that the observed value y takes the value 1 is: Eq. 16.18 p P L 1 2 x 1 2 x Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models 1 1 e1 2 x Page 38 16.2 The Logit Model for Binary Choice The probability that y = 1 is: p 1 1 e 1 2 x exp 1 2 x 1 exp 1 2 x The probability that y = 0 is: 1 1 p 1 exp 1 2 x Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 39 16.2 The Logit Model for Binary Choice The shapes of the logistic and normal probability density functions are somewhat different, and maximum likelihood estimates of β1 and β2 will be different – However, the marginal probabilities and the predicted probabilities differ very little in most cases Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 40 16.2 The Logit Model for Binary Choice 16.2.1 An Empirical Example from Marketing Consider the Coke example with: 1 if Coke is chosen COKE 0 if Pepsi is chosen Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 41 16.2 The Logit Model for Binary Choice 16.2.1 An Empirical Example from Marketing Based on ‘‘scanner’’ data on 1,140 individuals who purchased Coke or Pepsi, the probit and logit models for the choice are: pCOKE E COKE β1 β 2 PRATIO β3 DISP_COKE β 4 DISP_PEPSI pCOKE E COKE γ1 γ 2 PRATIO γ3 DISP_COKE γ 4 DISP_PEPSI Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 42 16.2 The Logit Model for Binary Choice Table 16.1 Coke-Pepsi Choice Models 16.2.1 An Empirical Example from Marketing Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 43 16.2 The Logit Model for Binary Choice 16.2.1 An Empirical Example from Marketing The parameters and their estimates vary across the models and no direct comparison is very useful, but some rules of thumb exist – Roughly: ˆ γ Logit 4β LPM β Probit 2.5βˆ LPM γ Logit 1.6β Probit Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 44 16.2 The Logit Model for Binary Choice 16.2.2 Wald Hypothesis Tests If the null hypothesis is H0: βk = c, then the test statistic using the probit model is: t βk c se β k a t N K – The t-test is based on the Wald principle Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 45 16.2 The Logit Model for Binary Choice 16.2.2 Wald Hypothesis Tests Using the probit model, consider the two hypotheses: Hypothesis (1) H 0 : β3 β 4 , H1 :β3 β 4 Hypothesis (2) H 0 : β3 0, β 4 0, H1 : either β3 or β4 is not zero Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 46 16.2 The Logit Model for Binary Choice 16.2.2 Wald Hypothesis Tests To test hypothesis (1) in a linear model, we would compute: t β DISP _ COKE β DISP _ PEPSI se β DISP _ COKE β DISP _ PEPSI a t1140 41136 – Noting that it is a two-tail hypothesis, we reject the null hypothesis at the α = 0.05 level if t ≥ 1.96 or t ≤ -1.96 – The calculated t-value is t = -2.3247, so we reject the null hypothesis • We conclude that the effects of the Coke and Pepsi displays are not of equal magnitude with opposite sign Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 47 16.2 The Logit Model for Binary Choice 16.2.2 Wald Hypothesis Tests A generalization of the Wald statistic is used to test the joint null hypothesis (2) that neither the Coke nor Pepsi display affects the probability of choosing Coke Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 48 16.2 The Logit Model for Binary Choice 16.2.3 Likelihood Ratio Hypothesis Tests When using maximum likelihood estimators, such as probit and logit, tests based on the likelihood ratio principle are generally preferred – The idea is much like the F-test • One test component is the log-likelihood function value in the unrestricted, full model (ln LU) evaluated at the maximum likelihood estimates • The second ingredient is the log-likelihood function value from the model that is ‘‘restricted’’ by imposing the condition that the null hypothesis is true (ln LR) Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 49 16.2 The Logit Model for Binary Choice 16.2.3 Likelihood Ratio Hypothesis Tests The restricted probit model is obtained by imposing the condition β3 = -β4: pCOKE E COKE β1 β 2 PRATIO β3 DISP _ COKE β 4 DISP _ PEPSI β1 β 2 PRATIO β 4 DISP _ COKE β 4 DISP _ PEPSI β1 β 2 PRATIO _ β 4 DISP _ PEPSI DISP _ COKE – We have LR = -713.6595 – The likelihood ratio test statistic value is: LR 2 ln LU ln LR 2 710.9486 713.6595 5.4218 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 50 16.2 The Logit Model for Binary Choice 16.2.3 Likelihood Ratio Hypothesis Tests To test the null hypothesis (2), use the restricted model E(COKE) = Φ(β1 + β2PRATIO) – The value of the likelihood ratio test statistic is 19.55 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 51 16.3 Multinomial Logit Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 52 16.3 Multinomial Logit We are often faced with choices involving more than two alternatives – These are called multinomial choice situations • If you are shopping for a laundry detergent, which one do you choose? Tide, Cheer, Arm & Hammer, Wisk, and so on • If you enroll in the business school, will you major in economics, marketing, management, finance, or accounting? Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 53 16.3 Multinomial Logit The estimation and interpretation of the models is, in principle, similar to that in logit and probit models – The models go under the names • multinomial logit • conditional logit • multinomial probit Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 54 16.3 Multinomial Logit 16.3.1 Multinomial Logit Choice Probabilities As in the logit and probit models, we will try to explain the probability that the ith person will choose alternative j pij = P individual i chooses alternative j – Assume J = 3 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 55 16.3 Multinomial Logit 16.3.1 Multinomial Logit Choice Probabilities For a single explanatory factor, the choice probabilities are: 1 Eq. 16.19a pi1 Eq. 16.19b exp 12 22 xi pi 2 , j2 1 exp 12 22 xi exp 13 23 xi Eq. 16.19c exp 13 23 xi pi 3 , j 3 1 exp 12 22 xi exp 13 23 xi Principles of Econometrics, 4th Edition 1 exp 12 22 xi exp 13 23 xi Chapter 16: Qualitative and Limited Dependent Variable Models , j 1 Page 56 16.3 Multinomial Logit 16.3.1 Multinomial Logit Choice Probabilities A distinguishing feature of the multinomial logit model in Eq. 16.19 is that there is a single explanatory variable that describes the individual, not the alternatives facing the individual – Such variables are called individual specific – To distinguish the alternatives, we give them different parameter values Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 57 16.3 Multinomial Logit 16.3.2 Maximum Likelihood Estimation Suppose that we observe three individuals, who choose alternatives 1, 2, and 3, respectively – Assuming that their choices are independent, then the probability of observing this outcome is: P y11 1, y22 1, y33 1 p11 p22 p33 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 58 16.3 Multinomial Logit 16.3.2 Maximum Likelihood Estimation Or P y11 1, y22 1, y33 1 1 1 exp 12 22 x1 exp 13 23 x1 exp 12 22 x2 1 exp 12 22 x2 exp 13 23 x2 exp 13 23 x3 1 exp 12 22 x3 exp 13 23 x3 L 12 , 22 , 13 , 23 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 59 16.3 Multinomial Logit 16.3.2 Maximum Likelihood Estimation Maximum likelihood estimation seeks those values of the parameters that maximize the likelihood or, more specifically, the log-likelihood function, which is easier to work with mathematically Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 60 16.3 Multinomial Logit 16.3.3 Post-Estimation Analysis For the value of the explanatory variable x0, we can calculate the predicted probabilities of each outcome being selected – For alternative 1: p01 1 1 exp 12 22 x0 exp 13 23 x0 – Similarly for alternatives 2 and 3 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 61 16.3 Multinomial Logit 16.3.3 Post-Estimation Analysis The βs are not ‘‘slopes’’ – The marginal effect is the effect of a change in x, everything else held constant, on the probability that an individual chooses alternative m = 1, 2, or 3: Eq. 16.20 Principles of Econometrics, 4th Edition pim xi all else constant 3 pim pim 2 m 2 j pij xi j 1 Chapter 16: Qualitative and Limited Dependent Variable Models Page 62 16.3 Multinomial Logit 16.3.3 Post-Estimation Analysis Alternatively, and somewhat more simply, the difference in probabilities can be calculated for two specific values of xi p1 pb1 pa1 1 exp 12 22 xb exp 13 23 xb Principles of Econometrics, 4th Edition 1 1 1 exp 12 22 xa exp 13 23 xa Chapter 16: Qualitative and Limited Dependent Variable Models Page 63 16.3 Multinomial Logit 16.3.3 Post-Estimation Analysis Eq. 16.21 Another useful interpretive device is the probability ratio – It shows how many times more likely category j is to be chosen relative to the first category P yi j pij exp 1 j 2 j xi P yi 1 pi1 j 2,3 – The effect on the probability ratio of changing the value of xi is given by the derivative: Eq. 16.22 pij pi1 Principles of Econometrics, 4th Edition xi 2 j exp 1 j 2 j xi Chapter 16: Qualitative and Limited Dependent Variable Models j 2,3 Page 64 16.3 Multinomial Logit 16.3.3 Post-Estimation Analysis An interesting feature of the probability ratio Eq. 16.21 is that it does not depend on how many alternatives there are in total – There is the implicit assumption in logit models that the probability ratio between any pair of alternatives is independent of irrelevant alternatives (IIA) • This is a strong assumption, and if it is violated, multinomial logit may not be a good modeling choice • It is especially likely to fail if several alternatives are similar Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 65 16.3 Multinomial Logit 16.3.3 Post-Estimation Analysis Tests for the IIA assumption work by dropping one or more of the available options from the choice set and then re-estimating the multinomial model – If the IIA assumption holds, then the estimates should not change very much – A statistical comparison of the two sets of estimates, one set from the model with a full set of alternatives, and the other from the model using a reduced set of alternatives, is carried out using a Hausman contrast test proposed by Hausman and McFadden Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 66 16.3 Multinomial Logit Table 16.2 Maximum Likelihood Estimates of PSE Choice 16.3.4 An Example Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 67 16.3 Multinomial Logit Table 16.3 Effects of Grades on Probability of PSE Choice 16.3.4 An Example Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 68 16.4 Conditional Logit Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 69 16.4 Conditional Logit Variables like PRICE are individual- and alternative-specific because they vary from individual to individual and are different for each choice the consumer might make – This type of information is very different from what we assumed was available in the multinomial logit model, where the explanatory variable xi was individual-specific; it did not change across alternatives Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 70 16.4 Conditional Logit 16.4.1 Conditional Logit Choice Probabilities Consider a model for the probability that individual i chooses alternative j: pij P individual i chooses alternative j The conditional logit model specifies these probabilities as: Eq. 16.23 pij exp 1 j 2 PRICEij exp 11 2 PRICEi1 exp 12 2 PRICEi 2 exp 13 2 PRICEi 3 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 71 16.4 Conditional Logit 16.4.1 Conditional Logit Choice Probabilities Set β13 = 0 – Estimation of the unknown parameters is by maximum likelihood • Suppose that we observe three individuals, who choose alternatives one, two, and three, respectively Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 72 16.4 Conditional Logit 16.4.1 Conditional Logit Choice Probabilities We have: P y11 1, y22 1, y33 1 p11 p22 p33 exp 11 2 PRICE11 exp 11 2 PRICE11 exp 12 2 PRICE12 exp 2 PRICE13 exp 12 2 PRICE22 exp 11 2 PRICE21 exp 12 2 PRICE22 exp 2 PRICE23 exp 2 PRICE33 exp 11 2 PRICE31 exp 12 2 PRICE32 exp 2 PRICE33 L 12 , 22 , 2 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 73 16.4 Conditional Logit 16.4.2 Post-Estimation Analysis The own price effect is: pij Eq. 16.24 PRICEij pij 1 pij 2 The change in probability of alternative j being selected if the price of alternative k changes (k ≠ j) is: Eq. 16.25 Principles of Econometrics, 4th Edition pij PRICEik pij pik 2 Chapter 16: Qualitative and Limited Dependent Variable Models Page 74 16.4 Conditional Logit 16.4.2 Post-Estimation Analysis An important feature of the conditional logit model is that the probability ratio between alternatives j and k is: pij pik exp 1 j 2 PRICEij exp 1k 2 PRICEik exp 1 j 1k 2 PRICEij PRICEik – The probability ratio depends on the difference in prices, but not on the prices themselves Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 75 16.4 Conditional Logit Table 16.4a Conditional Logit Parameter Estimates 16.4.2 Post-Estimation Analysis Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 76 16.4 Conditional Logit Table 16.4b Marginal Effect of Price on Probability of Pepsi Choice 16.4.2 Post-Estimation Analysis Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 77 16.4 Conditional Logit 16.4.2 Post-Estimation Analysis Models that do not require the IIA assumption have been developed, but they are difficult – These include the multinomial probit model, which is based on the normal distribution, and the nested logit and mixed logit models Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 78 16.4 Conditional Logit 16.4.3 An Example The predicted probability of a Pepsi purchase, given that the price of Pepsi is $1.00, the price of 7-Up is $1.25 and the price of Coke is $1.10 is: pˆ i1 exp 11 2 1.00 exp 11 2 1.00 exp 12 2 1.25 exp 2 1.10 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 79 0.4832 16.4 Conditional Logit 16.4.3 An Example The standard error for this predicted probability is 0.0154, which is computed via ‘‘the delta method.’’ – If we raise the price of Pepsi to $1.10, we estimate that the probability of its purchase falls to 0.4263 (se = 0.0135) – If the price of Pepsi stays at $1.00 but we increase the price of Coke by 15 cents, then we estimate that the probability of a consumer selecting Pepsi rises by 0.0445 (se = 0.0033) – These numbers indicate to us the responsiveness of brand choice to changes in prices, much like elasticities Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 80 16.5 Ordered Choice Models Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 81 16.5 Ordered Choice Models The choice options in multinomial and conditional logit models have no natural ordering or arrangement – However, in some cases choices are ordered in a specific way Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 82 16.5 Ordered Choice Models Examples: 1. Results of opinion surveys in which responses can be strongly disagree, disagree, neutral, agree or strongly agree 2. Assignment of grades or work performance ratings 3. Standard and Poor’s rates bonds as AAA, AA, A, BBB and so on 4. Levels of employment are unemployed, parttime, or full-time Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 83 16.5 Ordered Choice Models When modeling these types of outcomes numerical values are assigned to the outcomes, but the numerical values are ordinal, and reflect only the ranking of the outcomes – In the first example, we might assign a dependent variable y the values: 1 2 y 3 4 5 Principles of Econometrics, 4th Edition strongly disagree disagree neutral agree strongly agree Chapter 16: Qualitative and Limited Dependent Variable Models Page 84 16.5 Ordered Choice Models There may be a natural ordering to college choice – We might rank the possibilities as: Eq. 16.26 3 4-year college (the full college experience) y 2 2-year college (a partial college experience) 1 no college – The usual linear regression model is not appropriate for such data, because in regression we would treat the y values as having some numerical meaning when they do not Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 85 16.5 Ordered Choice Models 16.5.1 Ordered Probit Choice Probabilities When faced with a ranking problem, we develop a ‘‘sentiment’’ about how we feel concerning the alternative choices, and the higher the sentiment, the more likely a higher-ranked alternative will be chosen – This sentiment is, of course, unobservable to the econometrician – Unobservable variables that enter decisions are called latent variables Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 86 16.5 Ordered Choice Models 16.5.1 Ordered Probit Choice Probabilities For college choice, a latent variable may be grades: yi* GRADESi ei – This model is not a regression model, because the dependent variable is unobservable • Consequently it is sometimes called an index model Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 87 16.5 Ordered Choice Models FIGURE 16.2 Ordinal choices relative to thresholds 16.5.1 Ordered Probit Choice Probabilities Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 88 16.5 Ordered Choice Models 16.5.1 Ordered Probit Choice Probabilities We can now specify: 3 (4-year college) if y 2 (2-year college) if 1 (no college) if Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models yi* 2 1 yi* 2 yi* 1 Page 89 16.5 Ordered Choice Models 16.5.1 Ordered Probit Choice Probabilities If we assume that the errors have the standard normal distribution, N(0, 1), an assumption that defines the ordered probit model, then we can calculate the following: P yi 1 P yi* 1 P GRADESi ei 1 P ei 1 GRADESi 1 GRADESi Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 90 16.5 Ordered Choice Models 16.5.1 Ordered Probit Choice Probabilities Also: P yi 2 P 1 yi* 2 P 1 GRADESi ei 2 P 1 GRADESi ei 2 GRADESi 2 GRADESi 1 GRADESi Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 91 16.5 Ordered Choice Models 16.5.1 Ordered Probit Choice Probabilities Finally: P yi 3 P yi* 2 P GRADESi ei 2 P ei 2 GRADESi 1 2 GRADESi Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 92 16.5 Ordered Choice Models 16.5.2 Estimation and Interpretation If we observe a random sample of N = 3 individuals, with the first not going to college (y1 = 1), the second attending a two-year college (y2 = 2), and the third attending a four-year college (y3 = 3), then the likelihood function is: L , 1 , 2 P y1 1 P y2 2 P y3 3 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 93 16.5 Ordered Choice Models 16.5.2 Estimation and Interpretation Econometric software includes options for both ordered probit, which depends on the errors being standard normal, and ordered logit, which depends on the assumption that the random errors follow a logistic distribution – Most economists will use the normality assumption – Many other social scientists use the logistic – There is little difference between the results Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 94 16.5 Ordered Choice Models 16.5.2 Estimation and Interpretation The types of questions we can answer with this model are the following: 1. What is the probability that a high school graduate with GRADES = 2.5 (on a 13-point scale, with one being the highest) will attend a two-year college? Pˆ y 2 | GRADES 2.5 2 2.5 1 2.5 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 95 16.5 Ordered Choice Models 16.5.2 Estimation and Interpretation The types of questions (Continued): 2. What is the difference in probability of attending a four-year college for two students, one with GRADES = 2.5 and another with GRADES = 4.5? Pˆ y 2 | GRADES 4.5 Pˆ y 2 | GRADES 2.5 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 96 16.5 Ordered Choice Models 16.5.2 Estimation and Interpretation The types of questions (Continued): 3. If we treat GRADES as a continuous variable, what is the marginal effect on the probability of each outcome, given a one-unit change in GRADES? P y 1 GRADES P y 2 GRADES P y 3 GRADES Principles of Econometrics, 4th Edition 1 GRADES 1 GRADES 2 GRADES 2 GRADES Chapter 16: Qualitative and Limited Dependent Variable Models Page 97 16.5 Ordered Choice Models Table 16.5 Ordered Probit Parameter Estimates 16.5.3 An Example Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 98 16.6 Models for Count Data Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 99 16.6 Models for Count Data When the dependent variable in a regression model is a count of the number of occurrences of an event, the outcome variable is y = 0, 1, 2, 3, … – These numbers are actual counts, and thus different from ordinal numbers Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 100 16.6 Models for Count Data Examples: – The number of trips to a physician a person makes during a year – The number of fishing trips taken by a person during the previous year – The number of children in a household – The number of automobile accidents at a particular intersection during a month – The number of televisions in a household – The number of alcoholic drinks a college student takes in a week Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 101 16.6 Models for Count Data The probability distribution we use as a foundation is the Poisson, not the normal or the logistic – If Y is a Poisson random variable, then its probability function is: Eq. 16.27 e y f y P Y y , y! y 0,1,2, where y ! y y 1 y 2 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models 1 Page 102 16.6 Models for Count Data In a regression model, we try to explain the behavior of E(Y) as a function of some explanatory variables – We do the same here, keeping the value of E(Y) ≥ 0 by defining: E Y exp 1 2 x Eq. 16.28 – This choice defines the Poisson regression model for count data Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 103 16.6 Models for Count Data 16.6.1 Maximum Likelihood Estimation Suppose we randomly select N = 3 individuals from a population and observe that their counts are y1 = 0, y2 = 2, and y3 = 2, indicating 0, 2, and 2 occurrences of the event for these three individuals – The likelihood function is the joint probability function of the observed data is: L 1 , 2 P Y 0 P Y 2 P Y 2 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 104 16.6 Models for Count Data 16.6.1 Maximum Likelihood Estimation The log-likelihood function is: ln L 1 , 2 ln P Y 0 ln P Y 2 ln P Y 2 – Using Eq. 16.28 for λ, the log of the probability function is: e y ln P Y y ln y! y ln ln y ! exp 1 2 x y 1 2 x ln y ! Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 105 16.6 Models for Count Data 16.6.1 Maximum Likelihood Estimation Then the log-likelihood function, given a sample of N observations, becomes: ln L 1 , 2 exp 1 2 xi yi 1 2 xi ln yi ! N i 1 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 106 16.6 Models for Count Data 16.6.2 Interpretation of the Poisson Regression Model Prediction of the conditional mean of y is straightforward: E y0 0 exp 1 2 x0 The probability of a particular number of occurrences can be estimated by inserting the estimated conditional mean into the probability function, as: Pr Y y Principles of Econometrics, 4th Edition exp 0 0y y! , Chapter 16: Qualitative and Limited Dependent Variable Models y 0,1, 2, Page 107 16.6 Models for Count Data 16.6.2 Interpretation of the Poisson Regression Model The marginal effect is: E yi i 2 xi Eq. 16.29 – This can be expressed as a percentage, which can be useful: %E y xi Principles of Econometrics, 4th Edition 100 E yi E yi xi Chapter 16: Qualitative and Limited Dependent Variable Models 1002 % Page 108 16.6 Models for Count Data 16.6.2 Interpretation of the Poisson Regression Model Suppose the conditional mean function contains a indicator variable, how do we calculate its effect? – If E(yi) = λi = exp(β1 + β2xi + δDi), we can examine the conditional expectation when D = 0 and when D = 1 E yi | Di 0 exp 1 2 xi E yi | Di 1 exp 1 2 xi Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 109 16.6 Models for Count Data 16.6.2 Interpretation of the Poisson Regression Model The percentage change in the conditional mean is: exp 1 2 xi exp 1 2 xi 100 % 100 e 1 % exp 1 2 xi Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 110 16.6 Models for Count Data Table 16.6 Poisson Regression Estimates 16.6.3 An Example Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 111 16.7 Limited Dependent Variables Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 112 16.7 Limited Dependent Variables When a model has a discrete dependent variable, the usual regression methods we have studied must be modified – Now we present another case in which standard least squares estimation of a regression model fails Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 113 16.7 Limited Dependent Variables FIGURE 16.3 Histogram of wife’s hours of work in 1975 16.7.1 Censored Data Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 114 16.7 Limited Dependent Variables 16.7.1 Censored Data This is an example of censored data, meaning that a substantial fraction of the observations on the dependent variable take a limit value, which is zero in the case of market hours worked by married women Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 115 16.7 Limited Dependent Variables 16.7.1 Censored Data We previously showed the probability density functions for the dependent variable y, at different x-values, centered on the regression function E y | x 1 2 x Eq. 16.30 – This leads to sample data being scattered along the regression function – Least squares regression works by fitting a line through the center of a data scatter, and in this case such a strategy works fine, because the true regression function also fits through the middle of the data scatter Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 116 16.7 Limited Dependent Variables 16.7.1 Censored Data For our new problem when a substantial number of observations have dependent variable values taking the limit value of zero, the regression function E(y|x) is no longer given by Eq. 16.30 – Instead E(y|x) is a complicated nonlinear function of the regression parameters β1 and β2, the error variance σ2, and x – The least squares estimators of the regression parameters obtained by running a regression of y on x are biased and inconsistent—least squares estimation fails Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 117 16.7 Limited Dependent Variables 16.7.2 A Monte Carlo Experiment In this example we give the parameters the specific values β1 = -9 and β2 = 1 – The observed sample is obtained within the framework of an index or latent variable model: yi* 1 2 xi ei 9 xi ei Eq. 16.31 – We assume: ei ~ N 0, 2 16 yi 0 if yi* 0 yi yi* if yi* 0 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 118 16.7 Limited Dependent Variables FIGURE 16.4 Uncensored sample data and regression function 16.7.2 A Monte Carlo Experiment Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 119 16.7 Limited Dependent Variables 16.7.2 A Monte Carlo Experiment Eq. 16.32a In Figure 16.5 we show the estimated regression function for the 200 observed y-values, which is given by: yˆ 2.1477 0.5161x (se) (.3706) (0.0326) – If we restrict our sample to include only the 100 positive y-values, the fitted regression is: yˆ 3.1399 0.6388 x Eq. 16.32b Principles of Econometrics, 4th Edition (se) (1.2055) (0.0827) Chapter 16: Qualitative and Limited Dependent Variable Models Page 120 16.7 Limited Dependent Variables FIGURE 16.5 Censored sample data, and latent regression function and least squares fitted line 16.7.2 A Monte Carlo Experiment Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 121 16.7 Limited Dependent Variables 16.7.2 A Monte Carlo Experiment We can compute the average values of the estimates, which is the Monte Carlo ‘‘expected value’’: 1 EMC bk NSAM Eq. 16.33 NSAM m1 bk ( m ) where bk(m) is the estimate of βk in the mth Monte Carlo sample Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 122 16.7 Limited Dependent Variables 16.7.3 Maximum Likelihood Estimation If the dependent variable is censored, having a lower limit and/or an upper limit, then the least squares estimators of the regression parameters are biased and inconsistent – We can apply an alternative estimation procedure, which is called Tobit Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 123 16.7 Limited Dependent Variables 16.7.3 Maximum Likelihood Estimation Tobit is a maximum likelihood procedure that recognizes that we have data of two sorts: 1. The limit observations (y = 0) 2. The nonlimit observations (y > 0) – The two types of observations that we observe, the limit observations and those that are positive, are generated by the latent variable y* crossing the zero threshold or not crossing that threshold Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 124 16.7 Limited Dependent Variables 16.7.3 Maximum Likelihood Estimation The (probit) probability that y = 0 is: P yi 0 P[ yi 0] 1 1 2 xi Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 125 16.7 Limited Dependent Variables 16.7.3 Maximum Likelihood Estimation The full likelihood function is the product of the probabilities that the limit observations occur times the probability density functions for all the positive, nonlimit, observations: 1 2 1 2 xi 1 2 2 L 1 , 2 , 1 2 exp 2 yi 1 2 xi yi 0 2 yi 0 – The maximum likelihood estimator is consistent and asymptotically normal, with a known covariance matrix. Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 126 16.7 Limited Dependent Variables 16.7.3 Maximum Likelihood Estimation For artificial data, we estimate: Eq. 16.34 Principles of Econometrics, 4th Edition yi 10.2773 1.0487 xi (se) (1.0970) (0.0790) Chapter 16: Qualitative and Limited Dependent Variable Models Page 127 16.7 Limited Dependent Variables Table 16.7 Censored Data Monte Carlo Results 16.7.3 Maximum Likelihood Estimation Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 128 16.7 Limited Dependent Variables 16.7.4 Tobit Model Interpretation In the Tobit model the parameters β1and β2 are the intercept and slope of the latent variable model Eq. 16.31 – In practice we are interested in the marginal effect of a change in x on either the regression function of the observed data E(y|x) or the regression function conditional on y > 0, E(y|x, y > 0) Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 129 16.7 Limited Dependent Variables 16.7.4 Tobit Model Interpretation The slope of E(y|x) is: Eq. 16.35 Principles of Econometrics, 4th Edition E y | x 1 2 x 2 x Chapter 16: Qualitative and Limited Dependent Variable Models Page 130 16.7 Limited Dependent Variables 16.7.4 Tobit Model Interpretation The marginal effect can be decomposed into two factors called the ‘‘McDonald-Moffit’’ decomposition: E y | x x Prob y 0 E y | x, y 0 x E y | x, y 0 Prob y 0 x – The first factor accounts for the marginal effect of a change in x for the portion of the population whose y-data is observed already – The second factor accounts for changes in the proportion of the population who switch from the y-unobserved category to the y-observed category when x changes Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 131 16.7 Limited Dependent Variables FIGURE 16.6 Censored sample data, and regression functions for observed and positive y-values 16.7.4 Tobit Model Interpretation Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 132 16.7 Limited Dependent Variables 16.7.5 An Example Consider the regression model: Eq. 16.36 HOURS 1 2 EDUC 3 EXPER 4 AGE 5 KIDSL6 e Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 133 16.7 Limited Dependent Variables Table 16.8 Estimates of Labor Supply Function 16.7.5 An Example Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 134 16.7 Limited Dependent Variables 16.7.5 An Example The calculated scale factor is 0.3638 – The marginal effect on observed hours of work of another year of education is: E HOURS EDUC 2 73.29 0.3638 26.34 • Another year of education will increase a wife’s hours of work by about 26 hours, conditional upon the assumed values of the explanatory variables Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 135 16.7 Limited Dependent Variables 16.7.6 Sample Selection If the data are obtained by random sampling, then classic regression methods, such as least squares, work well – However, if the data are obtained by a sampling procedure that is not random, then standard procedures do not work well – Economists regularly face such data problems Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 136 16.7 Limited Dependent Variables 16.7.6 Sample Selection If we wish to study the determinants of the wages of married women, we face a sample selection problem – We only observe data on market wages when the woman chooses to enter the workforce – If we observe only the working women, then our sample is not a random sample • The data we observe are ‘‘selected’’ by a systematic process for which we do not account Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 137 16.7 Limited Dependent Variables 16.7.6 Sample Selection A solution to this problem is a technique called Heckit – This procedure uses two estimation steps: 1. A probit model is first estimated explaining why a woman is in the labor force or not – The selection equation 2. A least squares regression is estimated relating the wage of a working woman to education, experience, and so on, and a variable called the ‘‘inverse Mills ratio,’’ or IMR – The IMR is created from the first step probit estimation and accounts for the fact that the observed sample of working women is not random Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 138 16.7 Limited Dependent Variables 16.7.6a The Econometric Model The selection equation – It is expressed in terms of a latent variable z*I that depends on one or more explanatory variables wi, and is given by: zi* 1 2 wi ui Eq. 16.37 i 1, , N – The latent variable is not observed, but we do observe the indicator variable: Eq. 16.38 Principles of Econometrics, 4th Edition * 1 z i 0 zi 0 otherwise Chapter 16: Qualitative and Limited Dependent Variable Models Page 139 16.7 Limited Dependent Variables 16.7.6a The Econometric Model The second equation is the linear model of interest: yi 1 2 xi ei Eq. 16.39 i 1, , n, N n – A selectivity problem arises when yi is observed only when zi = 1 and if the errors of the two equations are correlated • In such a situation the usual least squares estimators of β1and β2 are biased and inconsistent Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 140 16.7 Limited Dependent Variables 16.7.6a The Econometric Model Consistent estimators are based on the conditional regression function: Eq. 16.40 E yi | zi* 0 1 2 xi i , i 1, ,n where the additional variable λi is the ‘‘inverse Mills ratio”: Eq. 16.41 Principles of Econometrics, 4th Edition 1 2 wi i 1 2 wi Chapter 16: Qualitative and Limited Dependent Variable Models Page 141 16.7 Limited Dependent Variables 16.7.6a The Econometric Model Consistent estimators are based on the conditional regression function: Eq. 16.40 E yi | zi* 0 1 2 xi i , i 1, ,n where the additional variable λi is the ‘‘inverse Mills ratio”: Eq. 16.41 Principles of Econometrics, 4th Edition 1 2 wi i 1 2 wi Chapter 16: Qualitative and Limited Dependent Variable Models Page 142 16.7 Limited Dependent Variables 16.7.6a The Econometric Model The parameters γ1 and γ2 can be estimated using a probit model, based on the observed binary outcome zi so that the estimated IMR: i 1 2 wi 1 2 wi – Therefore: Eq. 16.42 Principles of Econometrics, 4th Edition yi 1 2 xi i vi , i 1, Chapter 16: Qualitative and Limited Dependent Variable Models ,n Page 143 16.7 Limited Dependent Variables 16.7.6b Heckit Example: Wages of Married Women Eq. 16.43 An estimated model is: ln WAGE 0.4002 0.1095EDUC 0.0157 EXPER (t) ( 2.10) (7.73) R2 0.1484 (3.90) The Heckit procedure starts by estimating a probit model: P LFP 1 1.1923 0.0206 AGE 0.0838 EDUC 0.3139 KIDS 1.3939 MTR (t ) ( 2.93) (3.61) ( 2.54) ( 2.26) The inverse Mills ratio is: IMR Principles of Econometrics, 4th Edition 1.1923 0.0206 AGE 0.0838EDUC 0.3139KIDS 1.3939MTR 1.1923 0.0206 AGE 0.0838EDUC 0.3139KIDS 1.3939MTR Chapter 16: Qualitative and Limited Dependent Variable Models Page 144 16.7 Limited Dependent Variables 16.7.6b Heckit Example: Wages of Married Women The final combined model is: ln WAGE 0.8105 0.0585EDUC 0.0163EXPER 0.8664 IMR Eq. 16.44 (t ) (t -adj) Principles of Econometrics, 4th Edition (1.64) (1.33) (2.45) (1.97) (4.08) (3.88) Chapter 16: Qualitative and Limited Dependent Variable Models ( 2.65) ( 2.17) Page 145 16.7 Limited Dependent Variables 16.7.6b Heckit Example: Wages of Married Women In most instances it is preferable to estimate the full model, both the selection equation and the equation of interest, jointly by maximum likelihood – The maximum likelihood estimated wage equation is: ln WAGE 0.6686 0.0658 EDUC 0.0118 EXPER (t ) (2.84) (3.96) (2.87) – The standard errors based on the full information maximum likelihood procedure are smaller than those yielded by the two-step estimation method Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 146 Key Words Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 147 Keywords binary choice models censored data conditional logit count data models feasible generalized least squares Heckit identification problem independence of irrelevant alternatives (IIA) Principles of Econometrics, 4th Edition index models individual and alternative specific variables individual specific variables latent variables likelihood function limited dependent variables linear probability model Chapter 16: Qualitative and Limited Dependent Variable Models logistic random variable logit log-likelihood function marginal effect maximum likelihood estimation multinomial choice models multinomial logit Page 148 Keywords odds ratio ordered choice models ordered probit ordinal variables Principles of Econometrics, 4th Edition Poisson random variable Poisson regression model probit Chapter 16: Qualitative and Limited Dependent Variable Models selection bias Tobit model truncated data Page 149 Appendices Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 150 16A Probit Marginal Effects: Details 16A.1 Standard Error of Marginal Effect at a Given Point Consider the probit model p = Φ(β1 + β2x) – The marginal effect at x = x0 is: dp dx β1 β 2 x0 β 2 g β1 ,β 2 x x0 – The estimator of the marginal effect, based on maximum likelihood, is: g β1 ,β 2 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 151 16A Probit Marginal Effects: Details 16A.1 Standard Error of Marginal Effect at a Given Point The variance is: g β1 ,β 2 g β1 ,β 2 var g β1 ,β 2 var β1 var β 2 β1 β 2 g β1 ,β 2 g β1 ,β 2 2 cov β1 ,β 2 β1 β 2 Eq. 16A.1 2 2 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 152 16A Probit Marginal Effects: Details 16A.1 Standard Error of Marginal Effect at a Given Point To implement the delta method we require the derivative: g β1 ,β 2 β1 β1 β 2 x0 β 2 β1 β1 β 2 x0 β 2 β 2 β1 β 2 x0 β1 β1 β1 β 2 x0 β1 β 2 x0 β 2 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 153 16A Probit Marginal Effects: Details 16A.1 Standard Error of Marginal Effect at a Given Point To obtain the final result, we used β 2 β1 0 and: β1 β 2 x0 β1 1 12 β1 β2 x0 2 e β1 2 1 12 β1 β2 x0 2 1 e 2 β β x 1 2 0 2 2 β1 β 2 x0 β1 β 2 x0 We then obtain the key derivative: g β1 ,β 2 β 2 Principles of Econometrics, 4th Edition β1 β 2 x0 1 β1 β 2 x0 β 2 x0 Chapter 16: Qualitative and Limited Dependent Variable Models Page 154 16A Probit Marginal Effects: Details 16A.1 Standard Error of Marginal Effect at a Given Point Using the transportation data, we get: var β 1 cov β1 ,β 2 Principles of Econometrics, 4th Edition cov β1 ,β 2 0.1593956 0.0003261 0.0003261 0.0105817 var β 2 Chapter 16: Qualitative and Limited Dependent Variable Models Page 155 16A Probit Marginal Effects: Details 16A.1 Standard Error of Marginal Effect at a Given Point For DTIME = 2 (x0 = 2), the calculated values of the derivatives are: g β1 ,β 2 β1 0.055531 and g β1 ,β 2 β 2 0.2345835 The estimated variance and standard error of the marginal effect are: var g β1 ,β 2 0.0010653 and se g β1 ,β 2 0.0326394 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 156 16A Probit Marginal Effects: Details 16A.2 Standard Error of Average Marginal Effect The average marginal effect of this continuous variable is: 1 N AME i 1 β1 β2 DTIMEi β2 g2 β1 ,β2 N We require the derivatives: g 2 β1 ,β 2 β1 1 N β β DTIME β 2 i 2 i 1 1 β1 N 1 N i 1 β1 β 2 DTIMEi β 2 N β1 1 N g β1 ,β 2 i 1 N β1 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 157 16A Probit Marginal Effects: Details 16A.2 Standard Error of Average Marginal Effect Similarly: g 2 β1 ,β 2 β 2 β 2 1 N β β DTIME β i 2 N i 1 1 2 1 N i 1 β1 β 2 DTIMEi β 2 N β 2 1 N g β1 ,β 2 i 1 N β 2 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 158 16A Probit Marginal Effects: Details 16A.2 Standard Error of Average Marginal Effect For the transportation data: g 2 β1 ,β 2 β1 0.00185 and g 2 β1 ,β 2 β 2 0.032366 The estimated variance and standard error of the average marginal effect are: var g 2 β1 ,β 2 0.0000117 and se g 2 β1 ,β 2 0.003416 Principles of Econometrics, 4th Edition Chapter 16: Qualitative and Limited Dependent Variable Models Page 159