* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Non-Owner-Occupancy Misrepresentation and Loan Default
Present value wikipedia , lookup
Federal takeover of Fannie Mae and Freddie Mac wikipedia , lookup
Financialization wikipedia , lookup
United States housing bubble wikipedia , lookup
Payday loan wikipedia , lookup
Moral hazard wikipedia , lookup
Credit card interest wikipedia , lookup
Peer-to-peer lending wikipedia , lookup
Credit rationing wikipedia , lookup
Yield spread premium wikipedia , lookup
Interest rate ceiling wikipedia , lookup
Continuous-repayment mortgage wikipedia , lookup
Adjustable-rate mortgage wikipedia , lookup
History of pawnbroking wikipedia , lookup
Securitization wikipedia , lookup
Syndicated loan wikipedia , lookup
20130224-IA Internet Appendix for “Asset Quality Misrepresentation by Financial Intermediaries: Evidence from RMBS Market”1 TOMASZ PISKORSKI, AMIT SERU, and JAMES WITKIN This Internet Appendix (IA) contains a more detailed discussion on data construction and several robustness tests not included in the main text of the paper. It also sketches out a simple example that is discussed in the paper. 1 Citation format: Piskorski, Tomasz, Amit Seru, and James Witkin, Internet Appendix for “Asset Quality Misrepresentation by Financial Intermediaries: Evidence from RMBS Market,” Journal of Finance, doi: 10.1111/jofi.12271. :. Please note: Wiley-Blackwell is not responsible for the content or functionality of any supporting information supplied by the authors. Any errors (other than missing material) should be directed to the authors of the article. I. Data A. Verifying the Quality of the BlackBox-Equifax Merge As we mention in Section II of the published article, our primary data set consists of a merge between (i) loan-level mortgage data collected by BlackBox Logic and (ii) borrower-level credit report information collected by Equifax. The merge is performed by Equifax using a proprietary merge algorithm utilizing more than 25 variables. Equifax is one of the three largest credit bureaus in the U.S. (along with Experian and TransUnion) and collects data on more than 400 million consumers across the world. As such, it is one of the few potential providers for such detailed consumer credit data, and the firm has built a reputation based on data management. It provides a variety of analytical tools, from fraud detection to portfolio analytics, to a diverse group of client companies spanning multiple industry sectors. Given Equifax’s background as a data services company with over $1.5 billion in annual revenue, it has the expertise to perform the merge between the BlackBox and Equifax data sets. Nevertheless, as our measures of underwriting misrepresentation rely on identifying discrepancies between these two data sets, we independently verify the quality of the merge. Equifax reports a merge confidence measure that ranges from 0 to 0.90 (ranging from low to high confidence). The majority of loans fall in the high-confidence buckets, between 0.8 and 0.81 or between 0.89 and 0.90 (see Table IA.I). To further verify the accuracy of the merge, we compare the dynamic loan payment history reported to BlackBox to that reported to Equifax. While BlackBox provides loan-level data, the Equifax data are reported at the consumer level with statistics such as delinquencies either aggregated by debt product (first liens, HELOCs, credit cards, etc.) or separated by account size and product (largest first lien, second-largest first lien, etc.). We restrict our analysis to consumers with only one first mortgage in Equifax so that we can compare the correct payment records.2 Additionally, we limit our analysis to observations for which the loan’s status in BlackBox is either current or 30, 60, or 90+ days delinquent, as it can be difficult to correctly map other BlackBox statuses such as bankruptcy or prepayment to a “correct” status in Equifax. We report the results in Table IA.II. We find that for the entire sample, the BlackBox and Equifax payment statuses exactly match 93.8% of the time and are within 30 days of each other 98.0% of the time. We include the second measure to account for potential lags in reporting across the two databases. We observe that status-matching rates increase monotonically with Equifax’s confidence measure, with only 0.3% of the highest-confidence loans having significantly different BlackBox and Equifax delinquency statuses. It is worth noting that among loans with a confidence measure of 0.89 or greater, about 16% of loans do not have matching BlackBox and Equifax zip codes. 3 This 2 Consider a borrower that has two first mortgages: we would observe the delinquency status of both mortgages, but we would have to make assumptions regarding which status corresponded to the matched BlackBox loan. 3 This group of loans includes mortgages reported to investors as for non-owner-occupied properties. 2 reinforces the view that Equifax uses unique identifiers in performing its merge, given the difficulty of performing a merge across millions of loans without requiring a match on zip code. Based on these findings, we conclude that the most reliable subsample of the merged data contains loans with a confidence of 0.89 or higher (44.4% of the sample). For the analysis of misreported second liens, this results in a sample of 854,959 mortgages, down from 1,741,606 loans originated over the 2005 to 2007 period that are reported as having no second liens in BlackBox (a 51.41% reduction). For the analysis of owner-occupancy misreporting (see Section II for discussion of owner-occupancy misrepresentation), this results in a sample of 1,563,223 mortgages, down from 3,549,858 loans originated over the 2005 to 2007 period that are reported to BlackBox as owner-occupied (a 55.96% reduction). Next, we examine how restricting attention to loans with a high Equifax merge confidence impacts our sample. Panel A of Table IA.III reports statistics for loans for which BlackBox reports that the CLTV and LTV are equal. The first two columns report these statistics without any restrictions, while the last two columns report the statistics for the high-quality merged sample. Again, imposing these restrictions leads to a sample with slightly higher-quality observables. FICO scores and origination balances increase, while interest rates and CLTV decrease. Figure IA.1 displays kernel density plots comparing the merged sample with a sample of loans with no reported second liens in BlackBox that do not meet the Equifax merge confidence restrictions. Again, the shapes of the distributions are quite similar, demonstrating that these two samples are also not meaningfully different. In Panel B of Table IA.III we display summary statistics for loans reported as owneroccupied in the BlackBox data set to which we apply our second measure of asset quality misrepresentation. The first two columns report these statistics for all loans in the BlackBox data set that are reported as owner-occupied. The last two columns report statistics for the subsample of these loans limited to mortgages with the highest Equifax merge confidence (0.89 or higher). On average, loans in the high-quality merged sample have somewhat higher-quality observable characteristics. In particular, these loans have lower interest rates, lower CLTV ratios, and higher FICO scores and origination balances. Figure IA.2 displays kernel density plots comparing this high-quality merged sample to the sample of BlackBox loans with reported owner-occupied status that do not meet the merge confidence restriction. We see that despite the differences in mean values, the two samples have similar underlying distributions of important observable characteristics. Finally, we conduct a placebo test to assess whether there is a relationship between incorrectly merged records and subsequent loan performance. To identify incorrectly merged records, we use the origination loan balances that Equifax reports each month for the borrower’s two largest active first mortgages. We compare these two amounts over the loan’s first six observations to the origination loan balance reported by BlackBox to construct Balance Mismatch, which takes the value of one if neither Equifax balance is ever within 2% of the 3 BlackBox origination balance, and zero otherwise.4 We note that the balance of the securitized first mortgage is unlikely to be misreported to investors because servicers verify and report on a monthly basis outstanding loan amount and payments to the securitization trust; hence, such records may indicate incorrectly merged loans across the two databases. Because only the two largest mortgage balances are reported in the Equifax database, we remove borrowers that have three or more first mortgages reported in their credit file. The reason is that we are not sure if the loan in BlackBox would be part of the two largest mortgage balances reported in the Equifax database. We find that 2.01% of the loans have mismatched origination loan balances. To examine the relationship between default likelihood and incorrectly merged records, we run an OLS regression in which the dependent variable is the same 90-day delinquency dummy variable used in Table V of the published article. The right-hand variables include Balance Mismatch, the two previous misrepresentation measures, origination cohort fixed effects, and the same vector of controls used in prior loan-level regressions. Column (2) clusters standard errors by state level. Table IA.IV reports the results. While we observe a small positive coefficient on Balance Mismatch, the effect is economically small relative to what we find for our primary variables. Moreover, the effect is not statistically significant when standard errors are clustered at the state level. This suggests that there is no economically meaningful relationship between the potential merge accuracy of the two databases and subsequent loan performance. Importantly, the inclusion of Balance Mismatch does not reduce the economic or statistical significance of the misrepresentation variables. This evidence provides further support that our methodology to identifies actual misrepresentations other than incorrectly merged records. B. Constructing Underwriter-Level Measures In this subsection we discuss construction of the underwriter-level variables used in Table X of the published article. The measure of the relative importance of the RMBS underwriting business is constructed by dividing the aggregate dollar value of nonagency RMBS underwritten in 2005 by the underwriter’s total assets as of the end of 2005, as reported by Compustat. This variable has a mean of 0.243 and a standard deviation of 0.334. Years of experience in underwriting subprime deals is a deal-level variable capturing the underwriter’s experience in the subprime MBS market. To construct this measure, we take each pool’s securitization year and subtract the year in which the deal’s underwriter entered the subprime MBS market, using ABSNet’s universe of subprime MBS deals from 1999 onward. This variable has a mean of 5.98 and a standard deviation of 1.45. The commercial bank underwriter variable, which accounts for differences between commercial and investment banks, is a dummy that takes a value of one if the underwriter’s charter identifies it as a commercial bank and zero otherwise. The underwriter bonus-to-salary 4 Results are quantitatively similar regardless of how we define Balance Mismatch with respect to both the sixmonth time window and the strictness of the balance match (within 2%, 5%, $100, etc.). 4 ratio is computed by dividing the cash bonus of each underwriter’s five highest-paid executives by their base salary in 2005. These data comes from Computstat’s Execucomp database where available, and otherwise are constructed using firms’ public financial reporting. This variable has a mean of 14.98 and a standard deviation of 15.53. The Risk Management Index (RMI), based on Ellul and Yerramili (2013), is computed as the first principal component of the following six risk management variables: Credit Risk Officer (CRO) is present, CRO is an executive, CRO is a top-five executive, CRO compensation centrality, risk committee experience, and active risk committee. 5 Table IA.I Distribution of Equifax Merge Confidence The table shows the distribution of loans in the BlackBox data set across Equifax’s merge confidence level. The highest merge confidence of a given loan is 0.90. Merge Confidence Number of Loans [0.00,0.80) 3.6% [0.80,0.81) 41.8% [0.81,0.89) 10.2% Highest Confidence Available: [0.89,0.90) 44.4% Total 100% Table IA.II Delinquency Status Matching by Equifax Merge Confidence The table shows the fraction of observations with delinquency statuses that match or are within 30 days of each other in BlackBox and Equifax. The sample is restricted to borrowers with only one first mortgage record in Equifax and to observations for which the loan’s status in BlackBox is either current or 30, 60, or 90 days delinquent. A “Status Match” is defined as an exact match between the reported BlackBox and Equifax delinquency statuses. “Status within 30 Days” is defined as two statuses that are less than 60 days apart, for example, 90 days delinquent in BlackBox and 60 days delinquent in Equifax. Merge Confidence Fraction Payment Status Match Fraction Payment Status Within 30 Days [0.00,0.80) 0.841 0.924 [0.80,0.81) 0.904 0.964 [0.81,0.89) 0.942 0.991 [0.89,0.90) 0.976 0.997 6 Table IA.III Comparison of Descriptive Statistics The table presents a comparison of summary statistics for key variables. The first two columns of each panel present statistics for all loans in a given sample, and the second two columns restrict attention to loans that have the highest merge confidence. Panel A presents these statistics for loans reported as having no second lien to the RMBS trustee, while Panel B presents these statistics for mortgages reported as for owner-occupied properties. Interest Rate FICO Balance CLTV Purchase No Cash Out Refi Cash Out Refi Low or No Doc. ARM Option ARM Number of Loans Interest Rate FICO Balance CLTV Purchase No Cash Out Refi Cash Out Refi Low or No Doc. ARM Option ARM Number of Loans Panel A: Sample of Loans Reported as Having No Second Liens Full Sample Merged Sample Mean SD Mean SD 6.925 2.236 6.458 2.424 652.8 76.48 663.5 76.90 277.6 225.3 293.1 238.9 79.79 10.03 78.93 9.955 0.366 0.482 0.361 0.480 0.117 0.321 0.121 0.326 0.506 0.500 0.507 0.500 0.472 0.499 0.514 0.500 0.597 0.491 0.475 0.499 0.113 0.317 0.169 0.375 1,741,606 854,959 Panel B: Sample of Loans Reported as for Owner-Occupied Properties Full Sample Merged Sample SD Mean SD 6.990 1.947 6.605 2.024 662.6 74.95 680.2 73.20 291.0 254.2 324.6 248.3 84.33 11.44 83.72 11.71 0.483 0.500 0.496 0.500 0.149 0.356 0.136 0.343 0.360 0.480 0.358 0.480 0.419 0.493 0.463 0.499 0.606 0.489 0.527 0.499 0.0991 0.299 0.103 0.304 3,549,858 1,563,223 7 Table IA.IV Balance Mismatch Placebo Table The table presents OLS estimates from regressions in which the dependent variable takes the value of one if the loan defaults (90+ days delinquent) in the first two years since origination and zero otherwise. Balance Mismatch takes a value of one if the origination loan balance reported by BlackBox is not within 2% of any first-mortgage origination loan balance reported to Equifax during the loan’s first six months, and zero otherwise. Misreported Second takes a value of one if the loan is characterized as having a second-lien misrepresentation and zero otherwise. Misreported Non-Owner-Occupant takes a value of one if the loan is characterized as having a non-owner-occupancy misrepresentation and zero otherwise. Other Controls include origination variables such as FICO score, interest rate, and LTV ratio. Squared and cubed terms for FICO and CLTV are included to account for potential nonlinear effects. The estimates are in percentage terms; standard errors are in parentheses. * p < 0.10, ** p < 0.05, and *** p < 0.01. (1) 0.709*** (0.192) (2) 0.709 (0.443) Misreported Second 8.214*** (0.16) 8.214*** (0.859) Misreported Non-Owner-Occupant 7.485*** (0.124) 7.485*** (0.54) Other Controls Yes Yes Half-Year Origination Cohorts Yes Yes No 1487121 14.44 2.010 0.155 Yes 1487121 14.44 2.010 0.155 Balance Mismatch SEs Clustered by State Number of Loans Percent 90 DPD Percent Balance Mismatch R-Squared 8 Panel A: Origination CLTV Ratios Panel B: Origination FICO Scores Panel C: Origination Interest Rates Panel D: Origination Loan Balance Figure IA.1. Comparability of loans with no second-lien reported by merge restrictions – kernel density of observables. The figure shows the kernel density plots of loan origination CLTV ratios, FICO credit scores, interest rates, and loan balances for loans reported as not having any second liens in the BlackBox data set. The group of loans meeting the Equifax merge restrictions is represented by the solid line, and the group that fails these restrictions is represented by the dashed line. 9 Panel A: Origination CLTV Ratios Panel B: Origination FICO Scores Panel C: Origination Interest Rates Panel D: Origination Loan Balance Figure IA.2. Comparability of loans reporting owner-occupied status by merge restrictions – kernel density of observables. The figure shows the kernel density plots of loan origination CLTV, FICO credit scores, interest rates, and loan balances for loans reported as owner-occupied in the BlackBox data set. The group of loans meeting the Equifax merge restrictions is represented by the solid line, and the group that fails these restrictions is represented by the dashed line. 10 II. Non-Owner-Occupancy Misrepresentation A. Construction of Non-Owner-Occupancy Misrepresentation Measure In this section we consider a second measure of asset quality misrepresentation. This measure concerns the occupancy status of the property backing the loan. We identify a loan as misreported if the disclosure to investors shows that the loan is collateralized by owner-occupied property, when in fact the credit bureau data show that the property was owned by borrowers with a different primary residence (i.e., is an investment property or a second home). Prospectuses also make statements related to asset quality on this dimension. 5 To construct this measure we start with our base sample and consider only loans reported as owner-occupied properties to investors at the time of origination. This gives us a sample of 1,563,223 loans. We then take advantage of the mailing address zip code reported to Equifax to construct our measure. The identifying assumption here is that a zip code reported to a credit bureau, such as Equifax, should generally match the property address zip code reported to the trustee of the RMBS for owner-occupied homes. To allow for reporting delays, we follow a very conservative approach and compare the Equifax zip code reported each month over the first year of the loan’s life to the property zip code reported to investors. If none of these 12 Equifax zip codes matches with the BlackBox data set, we classify such a loan as having a Misreported NonOwner-Occupant. In addition to ensuring that the match between the two data sets, done by the credit bureau, is of high quality we have to address another issue regarding our second measure of asset quality misrepresentation. In particular, even if the match between the two databases is perfect, only our first measure – Misreported Second-Lien – allows us to directly identify asset misrepresentation. For the second measure, which is based on occupancy status, the misrepresentation is inferred since we do not observe occupancy status in the Equifax database. Instead, we infer whether a property is owner-occupied if the mailing zip code reported in the credit bureau data matches the property zip code disclosed to investors. Consequently, we need to verify that the measure based on occupancy status is reasonable. Again, our measure appears to be reasonable for several reasons. First, among loans reported as for non-owner-occupied properties, the majority (about two-thirds) have a mailing address zip code in the Equifax database that does not match the property address in the BlackBox database.6 Similarly, a majority (about 70%) of reported owner-occupants residing in the same zip code have only one first mortgage on file in the Equifax data. These features For instance, the prospectus for Series 2006-FF15 states: “The prospectus supplement will disclose the aggregate principal balance of Mortgage Loans secured by Mortgaged Properties that are owner-occupied.” The prospectus supplements also usually describe the frequency of loans for second homes or investor properties. For example, the supplement for the Wells Fargo MBS Series 2007–8 states that “Approximately 0.02% of the mortgage loans in the mortgage pool are expected to be secured by investor properties.” 6 About a third of properties whose owner occupancy status is reported to a trustee of the RMBS as being nonowner-occupied likely have both properties in their zip code of residence – a fact we corroborated to be potentially true based on our conversations with several industry practitioners. 5 11 underscore our measure of owner-occupancy misrepresentation. In addition, a majority (more than 60%) of misreported owner-occupants have multiple first mortgages on their credit file, indicating that they own multiple properties and are likely to be real estate investors.7 Second, ex post delinquencies of loans we identify as misrepresented on the dimension of owner occupancy are significantly higher when compared with otherwise similar loans. Finally, we do not detect owner-occupancy misrepresentations for cases in which the loan is truly made to a non-owner-occupant who resides in the same zip code as the property backing the loan, and we discard cases in which the zip code of the property differs from that in the credit bureau data for a period less than 12 consecutive months. It is worth noting that, for the above reasons, we will likely classify loans that may be misrepresented as not having been misrepresented. This should bias against finding that misrepresented loans have higher ex post delinquencies relative to otherwise identical loans. B. The Extent of Non-Owner-Occupancy Misrepresentation and Its Relation with Loan Performance and Mortgage Rates Table IA.V displays summary statistics for loans with reported owner-occupied status. As can be seen, 6.42% of mortgages reported as financing owner-occupied properties were given to borrowers with a different primary residence, and thus were misrepresented based on our method. Stated differently, 27% of loans obtained by non-owner-occupants misreported their true purpose. As in the case of the second-lien misrepresentation, the prevalence of owner-occupancy misrepresentation is higher for loans used to purchase the properties. Among the purchase loans with reported owner-occupied status, about 9.6% misrepresent owner occupancy compared with only 3.1% for refinance mortgages. There are a number of reasons why misrepresentation on this dimension could be more prevalent for purchase loans. For example, it may be harder to misrepresent the occupancy status for refinancing loans, especially if borrowers are involved, if the borrower does not reside in the property that is his or her primary residence. Overall, as there is some overlap in both types of misrepresentation across mortgages, our measures imply that about 9.1% of loans have misreported second-lien or misreported owneroccupancy status (or both). This estimate would be about 12.2% were we to include HELOCs when inferring misreported second liens. There is significant regional variation in the prevalence of owner-occupancy misrepresentations across the U.S. Areas where owner-occupancy misrepresentation is most prevalent correspond to markets that were booming leading up to the crisis. This is consistent with Chinco and Mayer (2012), who show significant activity of non-owner-occupants acquiring residential real estate in these regional markets for speculation prior to the recent housing crisis. Our measure indicates that a significant portion of the purchases made by non-owner-occupants in these regions was financed with mortgages that misrepresented their true purpose. For 7 This is also true for reported non-owner-occupants, as about 80% have multiple first mortgages on the credit file. 12 example, in our data about 22.5% of loans in Florida were reported to be non-owner-occupant loans. At the same time, according to our measure, about 9.5% of mortgaged properties in Florida reported as owner-occupied were actually non-owner-occupied properties. This implies that about a third of loans obtained by non-owner-occupants in Florida misreported their true purpose. Rural states such as Nebraska and South Dakota have some of the lowest fractions of loans that misrepresent owner occupancy status (2.5% and 2.8%, respectively). This lower incidence of misrepresentation is most likely tied to the less pronounced presence of non-owneroccupant borrowers in these regional markets. Table IA.VI shows that loans with misrepresentation on owner-occupancy status are more likely to have higher CLTVs. Moreover, these loans often tend to be option ARMs8 and have low or no additional documentation provided. We note that the relation between observables and asset misrepresentation also seems economically meaningful. For instance, the probability of asset misrepresentation based on the owner-occupancy status of a loan increases by more than 7% (more than 100% relative to the mean) if the loan contract is an option ARM. Finally, as before, misrepresentation of owner-occupancy status is much less common among refinance loans. Broadly, the features that are positively related to misrepresentation of occupancy status are typically associated with purchase mortgages of the Alt-A type of credit risk.9 Although the addition of origination cohort controls does not seriously impact the sign or significance of the regression coefficients, the fixed effects themselves illustrate an interesting trend. Column (2) of Table IA.VI indicates that the average propensity of owner-occupancy misrepresentation peaks in the first half of 2007 and then drops sharply thereafter. These timeseries patterns are consistent with models such as those of Povel, Singh, and Winton (2007) and Hartman-Glaser (2013), who predict that misrepresentations are most likely to occur in relatively good times and can grow over time. Our findings also suggest that the tightening of credit standards just before the collapse of the nonagency market may have led to increased scrutiny. Overall, misreported loans have characteristics that are broadly expected of similar loans with truthful reporting. As we show below, even after accounting for characteristics reported to investors, there are significant differences in the performance of misrepresented loans on the dimension of owner-occupancy relative to similar loans with no misrepresentation. This indicates that reported characteristics of mortgages, including interest rates on mortgages, may not have fully accounted for the riskiness induced by misrepresentation. 8 Unlike more traditional fixed-rate mortgages (FRMs) or ARMs, an option ARM is an adjustable-rate mortgage that lets borrowers pay only the interest portion of the debt or even less than that, with the loan balance growing above the amount initially borrowed up to a certain limit (see Piskorski and Tchistyi (2010)). 9 The Alt-A risk category consists of loans that, for various reasons (e.g., low documentation), loan originators consider riskier than prime loans but less risky than subprime mortgages. 13 As in the case of second-lien misrepresentation, we examine the relation between subsequent loan performance and misrepresentation of non-owner-occupancy status using the following loan-level specification: (IA.1) 𝑌𝑖 = 𝛼 + 𝛽𝑋𝑖 + 𝛾𝛿𝑖 + 𝜀𝑖 , where the dependent variable is a dummy that takes a value of one if the mortgage goes 90 days past due on payments during the first two years since origination and zero otherwise. The main explanatory variable is the dummy Misreported Non-Owner-Occupant (𝛿𝑖 ), which takes the value of one if loan i has misrepresentation on owner occupancy as per our method and zero otherwise. As we see from Table IA.VII, conditional on all the other observables, loans with misrepresented owner occupancy are associated with a higher likelihood of subsequent delinquency. This estimate is also economically significant: a loan with misrepresented borrower occupancy status has about a 9.4% higher likelihood of default, compared with loans with similar characteristics but with truthfully reported owner-occupancy status of the borrower. This estimate implies that loans with misreported owner-occupancy status have more than a 60% higher default rate relative to the mean default rate of owner-occupants. The positive association between Misreported Non-Owner-Occupant and the delinquency of the loan holds across all origination cohorts, and as in the case of misrepresented second liens, the absolute effect grows over time. In terms of economic magnitudes, the effect is smallest for loans originated in 2005. However, even in 2005, the estimate suggests an increase in absolute delinquency of 4.04% for loans misrepresenting owner occupancy, implying a relative increase of more than 56% when compared with the mean default rate of 7.15% for loans with truthfully reported owner-occupancy status of borrowers. So far we show that mortgage defaults on loans with misreported owner-occupancy status are significantly higher than similar loans truthfully disclosed as having owner-occupant borrowers. As in the case of second-lien misrepresentation, we now run another comparison. In Panel B of Table IA.VII, we also examine whether loans with misreported owner occupancy perform differently than loans with truthfully reported non-owner-occupant borrowers. To do so, we expand the sample to include loans backed by properties reported as either second homes or investor properties. We then estimate regressions similar to the above specification where the dependent variable is the same delinquency dummy variable used earlier. The two variables of interest are Misreported Non-Owner-Occupant and Reported Non-Owner-Occupant, a dummy that takes a value of one if the loan is reported as secured by a second home/investor property. The results show that Reported Non-Owner-Occupant has a positive and significant coefficient in the default regression, implying an approximate 3.5% absolute increase in the default rate for these loans (about 23% relative increase in the default rate compared with loans with owner-occupant borrowers). This result is in line with prior work that finds that, all else equal, loans with non-owner-occupancy status are more likely to default (see Mayer, Pence, and 14 Sherlund (2009) and Haughwout et al. (2011)). Such borrowers are likely to default more quickly on homes that are not their primary residence, for reasons such as weaker neighborhood and social ties. More interestingly, Misreported Non-Owner-Occupant has a positive and much larger coefficient in the default regression, implying an approximately 9.5% absolute increase in the default rate for these loans (about a 64% increase in the default rate relative to loans with owneroccupant borrowers). These results show that loans with misreported non-owner-occupancy status perform worse not only with respect to loans of owner-occupants, but also relative to loans of non-owner-occupants that are truthfully reported to investors. Thus, misrepresentation on the dimension of owner occupancy indicates borrowers with significantly lower-quality unobservable characteristics, even compared with truthfully reported non-owner-occupants. We also investigate how the impact of borrower occupancy status on default varies over the loan’s life. Figure IA.3 plots the resulting cumulative delinquency rates over time for properties with different borrower occupancy status, holding all other observables at the overall sample mean. Regardless of the age of the loan, those with misreported non-owner-occupants consistently default at higher levels than loans with truthfully reported non-owner-occupants and loans with truthfully reported owner-occupants. Next we investigate whether non-owner-occupancy misrepresentation is related to the mortgage interest rates charged by lenders. To do so, we estimate specifications for the mortgage interest rate with Misreported Non-Owner-Occupant and Reported Non-Owner-Occupant. In Table IA.VIII we find that Reported Non-Owner-Occupant has a positive and significant coefficient. This result is in line with the literature that finds that loans with non-owneroccupancy status are perceived to be riskier than those for owner-occupied properties. More interestingly, however, the coefficient on Misreported Non-Owner-Occupant is also positive and significant. This suggests that, to some extent, asset misrepresentations on this margin were captured in interest rates charged by lenders. Note, however, that the increase in interest rates for misreported non-owner-occupants is smaller than that for truthfully reported ones (by around 13 bps), even though we show that loans with misreported non-owner-occupants default significantly more than loans of truthfully reported non-owner-occupants.10 Overall, the evidence suggests that lenders were at least partly aware of the higher risk of misrepresented loans, because they charged higher interest rates on these loans. However, the interest rate markups on misrepresented loans are significantly smaller relative to loans with similar or even lower default risk – that is, mortgages truthfully reported to have a second lien or 10 To give an alternative perspective on these results, we also analyze the distribution of variation in interest rates across loans grouped by second-lien status and owner-occupancy status, controlling for other risk characteristics. Consistent with our earlier results, loans that truthfully report non-owner-occupant status receive higher interest rates than those of misreported non-owner-occupants, which in turn have interest rates that are higher than loans that are truthfully reported as owner-occupants (see Panel B of Figure IA.4). 15 to have a non-owner-occupant. Our results therefore suggest that, relative to prevailing interest pricing, interest rates on misrepresented mortgages did not fully reflect their higher default risk. These results are consistent with those in Table IA.VII indicating that misrepresented loans had higher default rates, even after conditioning on a host of observable characteristics including interest rates. C. Where Did Non-Owner-Occupancy Misrepresentation Occur? The data we use so far do not allow us to directly investigate where in the supply chain of credit (i.e., borrower, lender, and/or underwriter) the misrepresentations took place. To shed more light on this question, we again employ the New Century data to investigate owneroccupancy-status misrepresentation. Recall that the advantage of this data set is that it contains the loan characteristics that were internally recorded by the lender at loan origination. These data are unlikely to have been manipulated, since New Century entered bankruptcy quite quickly at the start of the crisis, and an independent trustee sold this data set during the bankruptcy process. The data allow us to assess whether New Century internal records correctly reflected non-owner occupancy status for loans that were misrepresented to investors on this dimension. Table IA.IX indicates that in the merged New Century sample, none of the loans we identify as having misreported non-owner-occupied status in Equifax was reported as for nonowner-occupied properties in the New Century data. Moreover, virtually all the loans that we identify as misrepresenting borrower occupancy status in the Equifax data were originated by brokers selling to New Century (more than 99%)—a much larger proportion relative to the overall percentage of broker-originated loans by New Century (about 65%). This evidence suggests that the misrepresentation concerning owner-occupancy status was made early in the origination process, possibly by the borrower or broker originating the loan on behalf of New Century. 11 Recall that our previous results show that loans with misrepresented owner-occupancy status had much higher default rates compared with loans that correctly reported non-owner occupancy status. Thus, if owner-occupancy misrepresentation is occurring at the borrower/broker level, the default results suggest that lenders may have been largely unaware of such misrepresentation and thus faced an adverse selection of borrowers. We further investigate the question of which party was aware of misrepresentation by assessing the credit history of borrowers associated with non-owner-occupancy misrepresentation. In particular, we exploit borrower credit card payment histories provided by Equifax and create a dummy variable that takes a value of one if the borrower is 60 days past due on any credit card account in the first two years following mortgage origination. We then estimate regressions that investigate how credit card defaults relate to our misrepresentation measures. 11 Berndt et al. (2010) analyze broker compensation using New Century data and find that brokers earned high profits on loans that turned out to be riskier ex post. 16 Table IA.X displays these results. We find that borrowers obtaining loans with misreported owner-occupancy status are about 7.6% more likely to default on their credit cards compared with borrowers who were truthfully reported as owner-occupants. Importantly, truthfully reported non-owner-occupants have a similar likelihood of default on credit cards as truthfully reported owner-occupants. In contrast, borrowers associated with second-lien misrepresentation had somewhat lower credit card defaults compared to borrowers with reported seconds. These results, taken together with our previous findings, reaffirm our prior result that financial institutions may have been unaware of the riskiness of borrowers in the case of owneroccupancy misrepresentation. Finally, we investigate the prevalence of non-owner occupancy misrepresentations across underwriters. To do so, we use specifications similar to Table IA.VI with Misreported NonOwner Occupant and Misreported Either as dependent variables with underwriter fixed effects in addition to the control variables. The Misreported Either variable takes the value of one if the loan has a misreported second lien, a misreported non-owner-occupant borrower, or both. In Figure IA.4, we plot the estimated level of underwriter misrepresentation, calculated using the underwriter fixed effects obtained from this regression and fixing other controls at their means, along with the 95% confidence interval. Panel A presents results for misrepresentation on the dimension of owner occupancy while Panel B presents results using both second-lien and owner-occupancy misrepresentation. The omitted category in these figures is Credit Suisse. We find that 4.43% of loans misrepresent the owner-occupancy status of the borrower, and 5.69% have either type of misrepresentation. The figure shows significant heterogeneity across underwriters in the propensity to securitize misrepresented loans. In particular, both panels of Figure IA.4 indicate sizeable variation across underwriters in the share of their loans having either type of misrepresentation. Importantly, all underwriters have misrepresentation levels significantly greater than zero, with pools underwritten by Countrywide and Lehman being the worst in terms of misrepresentations. Overall, although there is substantial heterogeneity across underwriters, a significant degree of misrepresentation exists across all underwriters, including the most reputable financial institutions. The result of a significant extent of asset quality misrepresentation on a dimension other than second-lien reporting confirms that our estimates regarding second-lien misrepresentation establish a lower bound for the overall amount of misrepresentation in the RMBS market. 17 References Berndt, Antje, Burton Hollifield, and Patrik Sandås, 2010, The role of mortgage brokers in the subprime crisis, NBER Working paper 16175. Chinco, Alexander, and Chris Mayer, 2012, Distant speculators and asset bubbles in the housing market, Working paper, [affiliation]. Ellul, Andrew, and Vijay Yerramilli, 2013, Stronger risk controls, lower risk: Evidence from U.S. bank holding companies, Journal of Finance 68, 1757-1803. Hartman-Glaser, Barney, 2013, Reputation and signaling in asset sales, Working paper, SSRN. Haughwout, Andrew, Donghoon Lee, Joseph Tracy, and Wilbert van der Klaauw, 2011, Real estate investors, the leverage cycle, and the housing market crisis, Federal Reserve Bank of New York Staff Reports, 514. Mayer, Christopher, Karen Pence, and Shane Sherlund, 2009, The rise in mortgage defaults, Journal of Economic Perspectives 23, 27-50. Piskorski, Tomasz, and Alexei Tchistyi, 2010, Optimal mortgage design, Review of Financial Studies 23, 3098-3140. Povel, Paul, Rajdeep Singh, and Andrew Winton, 2007, Booms, busts, and fraud, Review of Financial Studies 20, 1219-1254. 18 Table IA.V Descriptive Statistics and Percent of Loans with Misreported Non-Owner-Occupants The table presents summary statistics of key variables for mortgages reported as owner-occupied to the RMBS trustee (loans reported as such in the BlackBox dataset). The sample consists of these loans merged with a high level of confidence with the credit bureau data. Interest Rate is the loan interest rate at origination in percentage terms. FICO is the borrower’s FICO credit score at loan origination. Balance is the initial loan balance (in thousands of dollars). CLTV is the loan’s origination combined loan-to-value ratio in percentage terms reported to investors. No Cash Out Refi and Cash Out Refi are dummies that take a value of one if the loan purpose was a no cash-out refinancing or cash-out refinancing, respectively, and zero otherwise. Low or No Doc. is a dummy that takes a value of one if the loan was originated with no or limited documentation, and is zero otherwise. ARM and Option ARM are dummies that take a value of one if the loan type was an ARM or option ARM, respectively, and are zero otherwise. Interest Rate FICO Balance CLTV Purchase No Cash Out Refi Cash Out Refi Low or No Doc. ARM Mean 6.60 680. 324.6 83.72 0.49 0.13 0.35 0.46 0.52 SD 2.02 73.20 248.3 11.71 0.50 0.34 0.48 0.49 0.49 Option ARM 0.10 0.30 Number of Loans 1,563,223 Percent of Loans for Misreported Non-Owner-Occupants 6.42 Percent of Fully Documented Loans for Misreported Non-Owner-Occupants 4.79 Percent of Purchase Loans for Misreported Non-Owner-Occupants 9.67 Percent of Refinance Loans for Misreported Non-Owner-Occupants 3.12 19 Table IA.VI Non-Owner-Occupancy Misrepresentation and Loan Characteristics This table presents OLS estimates from regressions in which the dependent variable takes a value of one if the loan is Misreported Non-Owner Occupant and zero otherwise. All the controls represent the values reported to investors (the corresponding values in the BlackBox database). Column (2) adds the fixed effects corresponding to the loan origination date, with 2005 being the omitted category. Column (3) adds fixed effects corresponding to the location of the property backing the loan. Column (4) adds fixed effects capturing the identity of the underwriter that sold the loan. Column (5) reports standard errors clustered at the state level. All estimates are in percentage terms; standard errors are in parentheses. * p < 0.10, ** p < 0.05, and *** p < 0.01. (1) 0.00907*** (0.000369) (2) 0.00902*** (0.000371) (3) 0.00915*** (0.000372) (4) 0.0103*** (0.000381) (5) 0.0103*** (0.00159) 0.00262*** (0.0000872) 0.00256*** (0.0000891) 0.00133*** (0.0000948) 0.00144*** (0.0000955) 0.00144*** (0.000529) CLTV 0.0274*** (0.00200) 0.0267*** (0.00200) 0.0390*** (0.00201) 0.0378*** (0.00201) 0.0378*** (0.00884) No Cash Out Refi -5.499*** (0.0617) -5.536*** (0.0619) -5.285*** (0.0622) -5.231*** (0.0623) -5.231*** (0.508) Cash Out Refi -6.266*** (0.0495) -6.286*** (0.0496) -6.465*** (0.0499) -6.478*** (0.0501) -6.478*** (0.643) Low or No Doc. 2.824*** (0.0422) 2.821*** (0.0423) 2.510*** (0.0424) 2.577*** (0.0441) 2.577*** (0.193) ARM 2.153*** (0.0437) 2.173*** (0.0440) 1.835*** (0.0446) 1.596*** (0.0460) 1.596*** (0.182) Option ARM 7.229*** (0.0915) 7.126*** (0.0951) 6.824*** (0.0954) 6.592*** (0.0987) 6.592*** (0.468) Originated in 2006H1 0.0126 (0.0526) -0.0472 (0.0525) 0.0803 (0.0532) 0.0803 (0.104) Originated in 2006H2 0.183*** (0.0570) 0.114** (0.0570) 0.151*** (0.0580) 0.151 (0.0967) Originated in 2007H1 0.456*** (0.0664) 0.352*** (0.0664) 0.295*** (0.0680) 0.295 (0.192) Originated in 2007H2 -0.773*** (0.169) -0.787*** (0.168) -0.458*** (0.170) -0.458 (0.286) No No No 1563223 No No No 1563223 Yes No No 1563223 Yes Yes No 1563223 Yes Yes Yes 1563223 6.426 0.0300 6.426 0.0301 6.426 0.0346 6.426 0.0359 6.426 0.0359 FICO Balance State Fixed Effects Underwriter Fixed Effects SEs Clustered by State Number of Loans Percent Misrepresented R2 20 Table IA.VII Non-Owner-Occupancy Misrepresentation and Loan Default The table presents OLS estimates from regressions in which the dependent variable takes a value of one if the mortgage ever defaults (goes 90 days past due on payments) in the first two years since origination, and zero otherwise. Panel A shows the results with Misreported Non-Owner-Occupant as a control variable. Panel B shows the results with both Misreported Non-Owner-Occupant and Reported Non-Owner-Occupant as control variables, where the excluded category is loans truthfully reported as for owner-occupied properties. “Other Controls” include origination variables reported to investors used in Table IA.VI such as FICO and LTV ratios. Squared and cubed terms for FICO and LTV ratios are included to account for potential nonlinear effects. The estimates are in percentage terms; standard errors are in parentheses. * p < 0.10, ** p < 0.05, and *** p < 0.01. Misreported Non-Owner Occupant Panel A: Mortgage Default and Misreported Non-Owner-Occupant All Years 2005 2006H1 2006H2 9.35*** 4.04*** 10.13*** 15.58*** (0.11) (0.12) (0.25) (0.3) 2007H1 14.00*** (0.39) 2007H2 14.38*** (0.94) Other Controls Yes Yes Yes Yes Yes Yes Half-Year Origination Cohorts Number of Loans Percent Default Percent Misrepresented R2 Yes No No No No No 1,563,223 15.28 6.426 0.157 743,827 7.150 5.904 0.0727 327,128 17.02 6.903 0.108 282,904 26.51 7.167 0.149 187,352 27.36 6.535 0.184 22,012 17.11 6.528 0.214 Panel B: Mortgage Default and Misreported versus Reported Non-Owner Occupants (1) (2) (3) Misreported Non-Owner 10.72*** 9.992*** 9.48*** Occupant (0.11) (0.11) (0.11) (4) 9.48*** (0.71) 4.02*** (0.075) 3.57*** (0.073) 3.45*** (0.074) 3.45*** (0.62) Other Controls Yes Yes Yes Yes Half-Year Origination Cohorts No Yes Yes Yes State Fixed Effects No No Yes Yes No 1,827,497 14.89 6.424 0.103 No 1,827,497 14.89 6.424 0.153 No 1,827,497 14.89 6.424 0.161 Yes 1,827,497 14.89 6.424 0.161 Reported Non-Owner Occupant SEs Clustered by State Number of Loans Percent Default Percent Misrepresented R2 21 Table IA.VIII Non-Owner-Occupancy Misrepresentation and Mortgage Interest Rates The table presents OLS estimates from regressions in which the dependent variable is the mortgage interest rate at origination, with Misreported Non-Owner-Occupant and Reported Non-Owner-Occupant as control variables. “Other Controls” include origination variables reported to investors such as FICO and LTV ratios. Column (1) includes only these variables. Column (2) incorporates half-year origination fixed effects. Column (3) also includes property state fixed effects. Column (4) clusters standard errors by state. The estimates are in percentage terms; standard errors are in parentheses. * p < 0.10, ** p < 0.05, and *** p < 0.01. Misreported Non-Owner Occupant (1) 0.258*** (0.004) (2) 0.212*** (0.004) (3) 0.229*** (0.004) (4) 0.229*** (0.005) Reported Non-Owner Occupant 0.393*** (0.003) 0.365*** (0.003) 0.351*** (0.003) 0.351*** (0.02) Other Controls Yes Yes Yes Yes Half-Year Origination Cohorts No Yes Yes Yes State Fixed Effects No No Yes Yes No 1,827,497 6.563 6.424 0.611 No 1,827,497 6.563 6.424 0.674 No 1,827,497 6.563 6.424 0.680 Yes 1,827,497 6.563 6.424 0.680 SEs Clustered by State Number of Loans Mean Interest Rate Percent Misrepresented R-Squared 22 Table IA.IX Non-Owner-Occupancy Misrepresentation: Evidence from New Century Loans Panel A presents summary statistics of key variables for New Century loans with reported owner-occupant status that have been merged with the BlackBox-Equifax data set. Panel B shows the percentage of these loans for which the misrepresented non-owner-occupant status is correctly reported in the New Century internal data. Panel A: Characteristics Based on Owner-Occupancy Status in Merged Data Reported as Identified by Us as Having Owner-Occupant Misreported Non-Owner-Occupancy Mean SD Mean SD Default within 2 Years 0.210 0.408 0.345 0.477 Interest Rate 8.222 1.140 8.236 1.269 FICO 617.7 58.12 656.3 67.32 Number of Loans 3,160 148 Panel B: Non-Owner-Occupancy Misrepresentations in the New Century Internal Database Percentage Count Misreported Non-Owner-Occupant Reported as Non-Owner0% 148 Occupant in New Century Data set 23 Table IA.X Misrepresentations and Defaults on Credit Cards The table presents OLS estimates from regressions in which the dependent variable takes a value of one if the borrower defaults on credit card debt (60 day payment delinquency) in the first two years since mortgage origination, and zero otherwise. Panel A displays results with Misreported Second x CLTV >= 100, Misreported Second x CLTV < 100, Reported Second x CLTV >= 100, and Reported Second x CLTV < 100 as controls. The CLTV term (computed using credit bureau data) takes a value of one if the loan has a CLTV in the appropriate range, and zero otherwise. Panel B shows the results with Misreported Non-Owner-Occupant and Reported Non-Owner-Occupant as controls. Column (1) includes FICO and controls related to credit card debt, such as utilization and monthly payments. Column (2) includes state fixed effects. Column (3) includes full mortgage controls, such as loan balance and interest rate. Column (4) clusters standard errors by state. Panel B controls for the reported CLTV. The estimates are in percentage terms; t-statistics are in parentheses. * p < 0.10, ** p < 0.05, and *** p < 0.01. Panel A: Credit Card Default and Misreported versus Reported Second-Lien (1) (2) (3) Misreported Second x CLTV < 100 -3.105*** -2.640*** -1.814*** (0.384) (0.382) (0.38) (4) -1.814*** (0.591) Reported Second x CLTV < 100 -2.759*** (0.156) -2.462*** (0.156) -1.601*** (0.163) -1.601*** (0.418) Misreported Second x CLTV >= 100 -2.462*** (0.263) -1.155*** (0.262) -2.123*** (0.271) -2.123*** (0.663) Reported Second x CLTV >= 100 -2.023*** (0.124) -0.696*** (0.125) -0.854*** (0.152) -0.854 (0.61) Other Controls Yes Yes Yes Yes State Fixed Effects No Yes Yes Yes Mortgage Controls No No Yes Yes No 1,023,725 38.36 5.233 0.0769 No 1,023,725 38.36 5.233 0.0859 No 1,023,725 38.36 5.233 0.109 Yes 1,023,725 38.36 5.233 0.109 SEs Clustered by State Number of Loans Percent CC 60 DPD Percent Misrepresented R2 Panel B: Credit Card Default and Misreported versus Reported Non-Owner-Occupants (1) (2) (3) Misreported Non-Owner Occupant 7.636*** 7.305*** 5.959*** (0.156) (0.155) (0.155) (4) 5.959*** (0.319) 0.433*** (0.102) 1.144*** (0.103) -1.024*** (0.106) -1.024*** (0.364) Other Controls Yes Yes Yes Yes State Fixed Effects No Yes Yes Yes Mortgage Controls No No Yes Yes No 1,709,386 37.28 6.580 0.0760 No 1,709,386 37.28 6.580 0.0831 No 1,709,386 37.28 6.580 0.112 Yes 1,709,386 37.28 6.580 0.112 Reported Non-Owner Occupant SEs Clustered by State Number of Loans Percent Default Percent Misrepresented R2 24 Panel A: Cumulative Defualt Rate by Occupancy Status Panel B: Loan Interest Rate by Occupancy Status Figure IA.3. Owner-occupancy status, cumulative default rates, and interest rates. Panel A of the figure shows the estimated cumulative default rate of a mortgage in the first eight quarters since origination for subsamples of loans differing based on their owner-occupancy status, holding constant at their overall sample mean all other observables used in Table IA.V such as FICO. Panel B shows the kernel density plots of the observed difference between actual and predicted origination interest rates for the same subsamples of the overall data set. The predicted rates used in Panel B are given by a specification similar to that used in Column (4) of Table IA.VIII, with the Reported Non-Owner-Occupant and Misreported Non-Owner-Occupant variables removed. In Panel B the x-axis shows the error term from these regressions (the difference between the actual origination interest rate and the predicted value). 25 Panel A: Misreported Non-Owner Occupant Panel B: Misreported Either Figure IA.4. Misrepresentations across underwriters. This figure plots the percentage of loans with misreported non-owner occupants (Panel A), and misreported either (Panel B) by underwriter along with a 95% confidence interval. Coefficients result from adding underwriter fixed effects to similar specifications as in Table II. These levels are obtained by adding each underwriter fixed effect to the level of misrepresentation for the omitted category (Credit Suisse) with the other covariates at their means. The dashed straight lines show the mean misrepresentation level in the data. 26 III. Additional Results Table IA.XI Misrepresentations and Loan Characteristics The table presents probit estimates from regressions in which the dependent variable takes a value of one if the loan has the given misrepresentation, and zero otherwise. Columns (1) and (2) show the results with Misreported SecondLien as the dependent variable, while columns (3) and (4) show the results with Misreported Non-Owner-Occupant as the dependent variable. The estimates show the marginal effects in percentage terms; standard errors are in parentheses. * p < 0.10, ** p < 0.05, and *** p < 0.01. These estimates can be compared to the OLS estimates presented in Table III of the published article. FICO Misreported Second Lien (1) (2) 0.00523*** 0.00531*** (-0.00040) (-0.00040) Misreported Non-Owner-Occupant (3) (3) 0.00794*** 0.00777*** (-0.00034) (-0.00034) -0.00783*** (0.00013) -0.00741*** (0.00013) 0.00209*** (-0.00007) 0.00200*** (-0.00007) CLTV -0.118*** (0.00270) -0.112*** (0.00270) 0.0222*** (-0.00188) 0.0213*** (-0.00188) No Cash Out Refi -3.74*** (0.04221) -3.59*** (0.04251) -3.85*** (0.035422) -3.87*** (0.035349) Cash Out Refi -9.78*** (0.06194) -9.57*** (0.061551) -5.43*** (0.039087) -5.45*** (0.039135) Low or No Doc. -1.05*** (0.05019) -1.04*** (0.049642) 2.61*** (-0.03972) 2.61*** (-0.03980) ARM 5.14*** (-0.05975) 5.10*** (-0.059316) 2.13*** (-0.040891) 2.15*** (-0.040999) Option ARM -2.76*** (0.08425) -2.87*** (0.085264) 8.27*** (-0.139085) 8.02*** (-0.142755) Yes Yes Yes Yes No 854,959 7.131 Yes 854,959 7.131 No 1,563,223 6.426 Yes 1,563,223 6.426 Balance Confidence Match Controls Half-Year Origination Cohorts Number of Loans Percent Misrepresented 27 Table IA.XII Misrepresentations and Loan Default The table presents probit estimates from regressions in which the dependent variable takes a value of one if the mortgage ever defaults (goes 90 days past due on payments) in the first two years following origination, and zero otherwise. Panel A shows results with Misreported Second x CLTV >= 100, Misreported Second x CLTV < 100, Reported Second x CLTV >= 100, and Reported Second x CLTV < 100 as control variables. The CLTV term (computed from the credit bureau data) in these interactions takes a value of one if the loan has a CLTV ratio in the appropriate range, and zero otherwise. Panel B shows the results with Misreported Non-Owner-Occupant and Reported Non-Owner-Occupant as control variables. “Other Controls” include origination variables used in Table III such as FICO, interest rates, and LTV ratios. Squared and cubed terms for FICO and LTV are included to account for potential nonlinear effects. Panel B also controls for the reported CLTV to investors. Column (1) includes half-year origination fixed effects. Column (2) incorporates property state fixed effects. Column (3) clusters standard errors by state. The estimates show the marginal effects in percentage terms; standard errors are in parentheses. * p < 0.10, ** p < 0.05, and *** p < 0.01. Panel A: Mortgage Default and Misreported versus Reported Second Lien (1) (2) Misreported Second x CLTV < 100 8.20*** 8.95*** (0.375) (0.383) (3) 8.95*** (0.467) Reported Second x CLTV < 100 8.81*** (0.159) 9.19*** (0.16) 9.19*** (0.887) Misreported Second x CLTV >= 100 13.4*** (0.263) 14.3*** (0.269) 14.3*** (1.64) Reported Second x CLTV >= 100 15.5*** (0.148) 16.2*** (0.15) 16.2*** (2.11) Yes Yes No No 1,109,250 17.10 5.212 Yes Yes Yes No 1,109,250 17.10 5.212 Yes Yes Yes Yes 1,109,250 17.10 5.212 Other Controls Half-Year Origination Cohorts State Fixed Effects SEs Clustered by State Number of Loans Percent Default Percent Misrepresented Panel B: Mortgage Default and Misreported versus Reported Non-Owner Occupants (1) (2) (3) Misreported Non-Owner-Occupant 9.03*** 8.34*** 8.34*** (0.13) (0.13) (0.78) Reported Non-Owner-Occupant 3.89*** (0.08) 3.72*** (0.08) 3.72*** (0.76) Other Controls Half-Year Origination Cohorts State Fixed Effects SEs Clustered by State Number of Loans Percent Default Percent Misrepresented Yes Yes No No 1,827,497 14.89 6.424 Yes Yes Yes No 1,827,497 14.89 6.424 Yes Yes Yes Yes 1,827,497 14.89 6.424 28 Table IA.XIII Second-Lien Misrepresentation among New Century Loans: Characteristics of New Century Loans Merged with the BlackBox-Equifax Data set This table presents summary statistics of key variables for New Century loans reported as having no second lien to the RMBS trustees that have been merged with the BlackBox-Equifax data set. Default within 2 Years Interest Rate FICO Number of Loans Reported as Having No Second Lien Mean SD 0.288 0.453 8.409 1.356 594.9 55.65 10,924 29 Identified by us as Having Misreported Second Lien Mean SD 0.403 0.491 7.642 1.082 645.2 41.69 1,279 Table IA.XIV Second-Lien Misrepresentation and Pool Characteristics This table presents OLS estimates from regressions in which the dependent variable is the percent of loans in a pool with misreported second liens. The controls are the pool-level averages of the loan characteristics reported to investors (e.g., FICO, CLTV, etc). The estimates are in percentage terms; standard errors are in parentheses. * p < 0.10, ** p < 0.05, and*** p < 0.01. Percent Misreported Second Lien (1) (2) FICO -0.0719*** (0.014) -0.0707*** (0.014) Orig. Balance -0.00736*** (0.0018) -0.00795*** (0.0018) CLTV -0.931*** (0.133) -0.944*** (0.132) No Cash Out Refi -0.136*** (0.045) -0.131*** (0.044) Cash Out Refi -0.253*** (0.037) -0.251*** (0.037) Percent No or Low Doc. 0.00954 (0.015) 0.00767 (0.015) Percent ARM 0.00299 (0.008) 0.00188 (0.008) Percent Option ARM 0.0492** (0.019) 0.0409** (0.019) Half-Year FE Number of Loans Percent Misrepresented No 333 4.289 Yes 333 4.289 R2 0.220 0.233 30 Table IA.XV Heterogeneity of Misrepresentation among Underwriters The table presents the OLS estimates from regressions in which the dependent variable takes a value of one if the loan has the given violation and zero otherwise. Column (1) shows results with Misreported Second as the dependent variable, column (2) shows the results with Misreported Non-Owner-Occupant as the dependent variable (see section II of this appendix for more details), and column (3) shows the results with misrepresentation on either dimension as the dependent variable (either misreported). Underwriter RMBS Volume/Assets is the volume of loans in our base sample underwritten in 2005 by the loan’s deal underwriter divided by the underwriter’s assets as of the end of 2005. Underwriter Years in Subprime is the number of years between the loan’s securitization year and the first subprime deal underwritten by the deal’s underwriter, excluding deals prior to 1999. Commercial Bank Underwriter is a dummy that takes a value of one if the underwriter is a commercial bank and zero otherwise. RMI is an index of the risk-management strength of an institution based on Ellul and Yerramilli (2013). Bonus to Salary Ratio is the ratio of cash bonus to salary for the underwriter’s five highest-paid executives. These measures are discussed in section I of this appendix. “Other Controls” include origination variables used in column (3) of Table III, such as FICO, interest rate, and CLTV ratio. Standard errors are clustered by underwriter; the estimates are in percentage terms; standard errors are in parentheses. * p < 0.10, ** p < 0.05, and *** p < 0.01. (1) Misreported Second 0.546 (-2.600) (2) Misreported Non-Owner Occupant 1.319* (-0.753) (3) Either Misreported 1.528 (-1.273) Underwriter Years in Subprime (Post-1999) -0.299 (0.543) -0.0341 (0.155) -0.200 (0.215) Commercial Bank Underwriter -0.0161 (1.610) -0.464 (0.545) -0.110 (1.222) RMI -0.434 (3.338) -0.581 (0.806) -0.636 (1.817) Bonus to Salary Ratio -0.0126 (0.052) -0.00423 (0.010) -0.0141 (0.028) Half-Year Origination Cohorts Yes Yes Yes Other Controls Number of Loans Percent Misrepresented R2 Yes 853,954 7.131 0.0672 Yes 1,561,625 6.426 0.0304 Yes 1,700,264 9.192 0.0401 Underwriter RMBS Volume/Assets 31 Figure IA.5. Distribution of interest rates across second lien status. This figure shows the kernel density plots of the observed difference between actual and predicted origination interest rates for different subsamples of the overall data set. The predicted rates are given by a specification similar to that used in Table II, with the Misreported Second x CLTV >= 100, Misreported Second x CLTV < 100, Reported Second x CLTV >= 100, and Reported Second x CLTV < 100 variables removed. The x-axis shows the error term from these regressions (the difference between the actual origination interest rate and the predicted value). 32 A Simple Example of Equilibrium with Asset Misrepresentation Consider a simple one-period economy with an equal measure of households and bankers. The utility function of both households and bankers is linear in consumption (endowment good): u(c)=c. Each household is endowed with two units of the endowment good. The endowment can be consumed or invested. Each banker has no initial endowment but can invest on behalf of households. Hence, given the equal population of bankers and households, the per capita initial endowment equals one. There are two types of banks of equal measure. “Good” banks have access to an investment opportunity that requires one unit of investment at time 0 and pays households four units of the consumption good with probability 0.5 or zero units with probability 0.5 at time 1 at no additional cost. This is a positive NPV project. “Bad” banks have a technology that pays households four units with probability 0.1 and zero with probability 0.9 from one unit of investment. Bad banks are also able to use the technology to generate a hidden benefit of 0.1*investment (e.g., by expropriating some part of the investment). This is a negative NPV project. A. Pooling Equilibrium with Asymmetric Information Consider the case in which the bank’s type is unobserved to households (i.e., bad banks can pretend to be good banks). The households know that half of the banks are bad. Without loss of generality we can assume that households evenly split their endowment across banks so each bank gets one unit. A competitive pooling equilibrium in this case will be such that: (i) households give all their endowment to bankers, (ii) bankers offer investment service at zero cost, (iii) the aggregate payment to households that invest two units (average expected payment to a household) is: 0.5*4+0.5*0.1*4=2.2, and (iv) each of the “bad” bankers consumes 0.1 . Note that in this equilibrium households cannot tell apart positive and negative NPV projects. They know that in equilibrium half of the banks that pretend to be good are actually bad. Yet they decide to invest because the average NPV across projects of good and bad banks is positive, that is, 2.2>2. Overall, due to banks, the economy moves from one unit of consumption good per capita to 1.15 units (0.5*2.2+0.5*0.1). Importantly, despite the fact that households correctly anticipate the extent of average misrepresentation in equilibrium, eliminating misrepresentation can increase the overall surplus. To see this, consider the case in which there is no asymmetric information about bank type, which we discuss next. 33 B. Equilibrium with No Asymmetric Information When the bank’s type is observed by households (bad banks cannot pretend to be good banks), a competitive equilibrium in this case will be such that: (i) households give all their endowment to good banks, (ii) good banks offer investment services at zero cost, (iii) the aggregate payment to households (average payment to a household is): 2*0.5*4=4, and (iv) each of the “bad” bankers gets no investment and consumes zero. In this case, due to good banks, the economy moves from one unit of the consumption good to two units per capita. Comparing this case to the one with asymmetric information, it is clear that the overall net surplus from the investment is larger (1 versus 0.15 in the previous case). 34