Download Unbalanced Panel Data … and Stata - Pdx

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Unbalanced Panel Data
… and Stata
Kuan-Pin Lin
Portland State University and
WISE, Xiamen University
Panel Data Anslysis

Panel Data Definition

Unbalanced Panel
yit , xit (t  1, 2,..., Ti ; i  1,..., N )



Balanced Panel: Ti  T , i
Short Panel: T  , N  
Long Panel: T  , N  

Panel Data Models
Fixed Effects Model
 Random Effects Model
yit  xit' β  ui  vt  eit

i  1, 2,..., N ; t  1, 2,..., Ti

y i  Xi β  ui iTi  vTi  ei
Panel Data Analysis

Benefits




More Data Variability
Less Colinearity among Variables
Control for Unobserved Heterogeneity
yit  xit   ui  vt  eit
Limitations


Time Serial and Cross Sectional Correlation
Missing Data


Randomly and Non-randomly
Sample Selection Bias
Unbalanced Panel Data


Incomplete panels are more likely to be the
norm in typical economic empirical studies
Problems with non-response and
measurement errors


Gaps in Time Series
Holes in Cross Sections
Panel Data Analysis Using Stata

Declare panel data and variables


Panel data analysis: xt commands





xtset
xtdes
xtsum
xtdata
xtline
Panel data regression

xtreg
Returns to Schooling
Koops and Tobias (2004)



Study the relationship between wages and education,
ability, and family characteristics.
Data is available in two parts. The first file contains the
panel of 17,919 observations on the Person ID and 4
time-varying variables. The second file contains time
invariant variables for the individual or the 2,178
households. See the article for details on the empirical
model and data construction (data achieve and demo
program 1 2).
Data source: U.S. National Longitudinal Survey of Youth
(NLSY), U.S. Dept. of Labor, Bureau of Labor Statistics.
Returns to Schooling
Koops and Tobias (2004)

Part 1:






Column 1 = Person id (ranging from 1 to 2178),
Column 2 = Education,
Column 3 = Log of hourly wage,
Column 4 = Potential experience,
Column 5 = Time trend.
Part 2:





Column 1 = Time invariant ability,
Column 2 = Mother's education,
Column 3 = Father's education,
Column 4 = Dummy variable for residence in a broken home,
Column 5 = Number of siblings.
Housing Prices in China
Wang (2011)



Study the effects on housing consumption and prices
from the privatization of state-owned housing reform
began in 1994 (data file and demo program 1 2)
Based on standard regression analysis, the removal of
price distortions allowed households to increase their
consumption of housing and led to an increase in
equilibrium housing prices.
Using CHNS, data is obtained for 31677 individuals from
about 2900 households in counties and cities of 9
provinces over 6 years (1989, 1991, 1993,1997, 2000,
2004).
References




B.H. Baltagi, and S.H. Song, “Unbalanced Panel Data: A
Survey,” Statistical Papers 47, 493-523, 2006.
B.H. Baltagi, Chapter 9: Unbalanced Panel Data Models,
Econometric Analysis of Panel Data, 4th ed., John Wiley,
New York, 2008.
G. Koops, and J.L. Tobias, “Learning About
Heterogeneity in Returns to Schooling”, Journal of
Applied Econometrics 19, 827-849, 2004.
S-Y. Wang, “State Misallocation and Housing Prices:
Theory and Evidence from China”, American Economic
Review 101, 2081-2107, 2011.