Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Copyright by Shinya Wakao 2012 The Report Committee for Shinya Wakao Certifies that this is the approved version of the following report: Wall Street News on Main Street APPROVED BY SUPERVISING COMMITTEE: Robert C. Luskin, Supervisor Scott Moser Wall Street News on Main Street by Shinya Wakao, B.A.; M.A. REPORT Presented to the Faculty of the Graduate School of The University of Texas at Austin in Partial Fulfillment of the Requirements for the Degree of MASTER OF ARTS THE UNIVERSITY OF TEXAS AT AUSTIN August 2012 Wall Street News on Main Street Shinya Wakao, M.A. The University of Texas at Austin, 2012 Supervisor: Robert C. Luskin Over the past decades, people have had an increasing chance to receive economic information, especially news related to the stock market. This is because the fraction of the U.S. population owning stocks has increased rapidly. However, it does not mean that a majority of news sources have started to deal with financial news more. We do not know how traditional media, such as newspapers, have dealt with financial news during the same period, nor do we know the influence of this environmental change on political attitudes. In this report, I analyze the type of contexts in which the stock market has been described in The New York Times from 1981 to 2011 by Wordfish and the Latent Dirichlet Allocation (LDA) model. I find that a plunge in the stock market and political events affect the amount of political topics in stock market news. In particular, after the financial crisis of 2008–2009, stock market news consisted of economic, political, and social topics. iv Table of Contents Abstract iv List of Tables vi List of Figures vii Chapter 1. Introduction 1 Chapter 2. Background 2.1 Analysis of News Coverage of the Stock Market . . . . . . . . . . . . Chapter 3. Empirical Study 3.1 Data . . . . . . . . . . . . . . . . . . 3.2 Method: Wordfish and Topic Models 3.2.1 Assumption of Wordfish . . . . 3.2.2 Process . . . . . . . . . . . . . 3.3 Topic Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Chapter 4. Results 4.1 Term Frequencies . . . . . . . . . . . . . . . . . . 4.1.1 Wall Street, Stock Market, and Dow Jones 4.1.2 Presidential candidates . . . . . . . . . . . 4.1.3 Congress, Political Parties . . . . . . . . . . 4.1.4 Policy-related Words . . . . . . . . . . . . . 4.2 Wordfish Estimation . . . . . . . . . . . . . . . . 4.3 Topic Model Estimation . . . . . . . . . . . . . . Chapter 5. Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 7 . . . . . 10 10 14 16 16 18 . . . . . . . 20 20 20 21 22 23 25 29 37 Bibliography 39 v List of Tables 4.1 4.2 4.3 4.4 4.5 4.6 4.7 the Estimated Five Topics in Five Topics in Five Topics in Five Topics in Five Topics in Five Topics in β and ψ from Select Terms in Figures 4.1 to 4.4 the Stock Market Paragraphs from 1981 to 1985 the Stock Market Paragraphs from 1986 to 1990 the Stock Market Paragraphs from 1991 to 1995 the Stock Market Paragraphs from 1996 to 2000 the Stock Market Paragraphs from 2001 to 2005 the Stock Market Paragraphs from 2006 to 2011 vi . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 31 32 33 34 35 36 List of Figures 3.1 4.1 4.2 4.3 4.4 4.5 4.6 (Top) Numbers of Stock Market Articles in The New York Times from 1981 to 2011. (Bottom) Percentage of “Wall Street” Articles in terms of “Desk” from 1981 to 2011 . . . . . . . . . . . . . . . . . . . . . . . Frequency of “Stock Market,” “Wall Street,” and “Dow Jones” in The New York Times from 1981 to 2011 . . . . . . . . . . . . . . . . . . . Frequency of Presidential Candidates in The New York Times from 1981 to 2011 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Frequency of “Congress” and Political Parties in The New York Times from 1981 to 2011 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Frequency of Policy Terms in The New York Times from 1981 to 2011 the Estimated Location of Terms in The New York Times from 1981 to 2011 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . the Estimated Location of Stock Market Paragraphs in The New York Times from 1981 to 2011 . . . . . . . . . . . . . . . . . . . . . . . . . vii 13 21 22 23 24 27 28 Chapter 1 Introduction People receive economic information from many sources - the TV, Internet, print, radio, and other news media outlets - using the information to evaluate the health of the national economy. The media provide us with not only economic indicators such as unemployment rate and GDP growth, but also related information and analyses that help people better understand the health of the economy. In addition, people viscerally feel the effects of inflation, such as when they put gas into their cars. In other words, people consciously and unconsciously understand how healthy the national economy is through media coverage as well as their daily activities. How the public evaluates the health of the national economy politically is a major concern for politicians - both incumbents and challengers - during elections because economic conditions affect the evaluation of incumbents as well as the election outcomes (Kinder and Kiewiet, 1979, 1981; Kiewiet, 1983; MacKuen, Erickson and Stimson, 1992; Fiorina, 1981). Yet what kinds of economic indicators does the public examine when evaluating the economic conditions as well as incumbents political performance? As information via the media is the primary source of information for most 1 of the public, how the media treat the national economy politically is also an important matter. Government agencies report periodic economic indicators, such as the unemployment rate, inflation rate, and GDP, which the public uses for political evaluations (e.g., Vavreck, 2009). Meanwhile, the media cover stock market conditions more often than other indictors. Indeed, business and financial media such as CNBC report on it every moment. When the stock market rises or falls dramatically, it becomes sensational news. People can also check past and current stock prices online. However, the media also treat the stock market differently. Companies are evaluated based on their stock conditions. When a company goes public, it sometimes becomes a social phenomenon, such as Netscape in 1995, Google in 2004, and Facebook in 2012. In other words, news about the stock market is more dynamic and is reported upon at a higher frequency compared to news related to other economic indicators. I believe that this lack of study results from at least two reasons. First, the stock market per se has not been as important for middle-class citizens in the U.S. Although the media reported the stock markets ups and downs, such news did not affect the majority of people, only those who manage their assets in the stock market - or at least people did not care about this information. However, the relationship between the stock market and American citizens has become closer over time. As of 2012, more than fifty percent of American households invest their money in the stock market directly and/or indirectly. One incentive for people to invest in the stock market, even if they were initially not interested in doing so, is that fact that many companies provide 401(k) benefits. The American taxation system also provides people with an incentive to invest their money in the stock market as a 2 form of a retirement plan. For 2011 and 2012, people can contribute as much as 5,000 dollars a year, which is tax deductible, to their traditional or Roth individual retirement accounts (IRAs). However, the recent financial crisis had a significant impact on such investments. According to Bricker et al. (2011), more than sixty percent of families lost their wealth from 2007 to 2009; stocks demonstrated the sharpest declines among their assets. Needless to say, the financial crisis since 2008 and its aftermath became a main concern politically and economically among many citizens. The second reason for the lack of research into the relationship between the stock market and media relates to methodological reasons. Traditionally, scholars employ a human-coding method when they conduct a content analysis of text data. Manually coding the data is difficult because the researcher must deal with the significant size of data in the form of text files (e.g., newspaper articles or transcripts of audio tapes). For instance, most previous studies that have included a content analysis based on newspaper articles have focused on articles from specific newspapers within a very limited period of time. As a result, we do not understand the change in news coverage for certain topics over the longer term. However, scholars have recently begun to employ the text mining method, which conducts content analysis using computer programs to overcome the limitations of the traditional content analysis methodology. This new approach enables scholars to deal with the significant amount of text data at lower costs of resources in terms of time and funds. For example, Ho and Quinn (2008) analyzed the media context of more than 1,500 editorials in 25 major American newspapers from 1994 3 to 2004, which would be practically impossible using the traditional content analysis method. The objective of the current report is to analyze whether the stock market has only been mentioned in economic news, or it also appears with other topics such as politics. If the stock market became politically important, politicians would discuss it more in their speeches, and newspapers would include more information about it. At the same time, editorials would blame the president for declines in the stock market. I collected stock market news reported in The New York Times from 1981 to 2011 and examined term frequencies and estimated the latent trend by year as well as the latent topics in each year using two text mining methods: Wordfish (Slapin and Proksch, 2008) and topic models (Blei and Lafferty, 2009). 4 Chapter 2 Background Compared to political information, people have more opportunities to receive economic information regardless of their ability, opportunity, and motivation.1 The by-product theory of information suggests that people receive economic information through daily activities; for example, people recognize increased inflation rates in gasoline prices (Popkin, 1991). Other studies have found that people recognize increases in the unemployment rate not only from personal experience, but also from conversations with others (known as the interpersonal context) in terms of co-workers’ layoffs, for example (Mutz, 1992). Meanwhile, communication scholars have emphasized the importance of media among the public. The media-dependency theory argues that the more individuals rely on media, the more important the media have become to those individuals (BallRokeach and DeFleur, 1976). In the context of politics and economy, the public relies on television and newspapers for economic information as understanding the economy requires a variety of knowledge and information (MacKuen, Erickson and Stimson, 1992). 1 Regarding the role of motivation, opportunity, and ability in political knowledge, see Luskin (1990). 5 First, economic voting literature shows that the electorate’s voting choice depends on the health of the U.S. national economy (Kinder and Kiewiet, 1979, 1981; Kiewiet, 1983; MacKuen, Erickson and Stimson, 1992; Fiorina, 1981). Since the seminal work by Kinder and Kiewiet (1979), many scholars have analyzed the economic determinants of voting choice among the electorate, focusing on whether the change of personal economic situation affects voting choice (pocketbook hypothesis) or the health of state/national economy affects it (sociotropic hypothesis). According to Lewis-Beck (2006), more than 400 articles about economic voting behaviors have been published. In the U.S., more evidence has substantiated the sociotropic hypothesis than the pocketbook hypothesis. Second, if the electorate understands the condition of the national economy accurately and without bias, what scholars need to do is simply focus on the relationship between the macro-economy indicators and election results. However, in reality, media coverage about the national economy is neither accurate nor unbiased; rather, the media have demonstrated a tendency to report more negative news than positive (Harrington, 1989; Hetherington, 1996; Zaller, 1992), report more incumbent-friendly coverage (Hofstetter, 1978; Page and Shapiro, 1992; Brody, 1991), and lean toward specific parties (Conover, Feldman and Knight, 1987; Mutz, 1992; Jackman, 1993). Leaning toward a specific party or ideology is especially evident in talk radio (Barker, 1998) and on cable channels (Della Vigna and Kaplan, 2007). In addition, the media per se influence citizen perception in two ways: priming and framing. Iyengar (1991) defined two types of framing effects: episodic and thematic. Episodic framing provides more individual-oriented coverage and attributes 6 personal responsibilities. For example, news coverage about poverty on TV tends to use episodic framing, which implies that the viewer perceives poverty not as a social issue, but as a personal problem. On the other hand, thematic coverage includes more general, background information about the issue. In addition, viewers and readers interpret coverage using thematic framing as a social issue; thus, people recognize the issue as a social problem and blame decision makers. Iyengar (1991) also demonstrated that unemployment coverage on TV news tends to frame the content using a thematic structure. In other words, media coverage of unemployment tends to provide more general statistical information and social background than personal stories of individuals. Although Iyengar (1991) only dealt with the coverage of unemployment, I assume that we can apply the same characteristics of media coverage for other issues commonly covered, including the inflation rate and the GDP, as regular government statistics related to these issues are similarly reported. 2.1 Analysis of News Coverage of the Stock Market When scholars analyze the relationship between national economy and polit- ical attitudes, they employ unemployment rate, inflation, and GDP. For example, Nadeau et al. (1999) analyzed economic news coverage in the AP and Washington Post from January 1, 1977, to July 1, 1995, but they focused on the unemployment rate and inflation. Harrington (1989), on the other hand, analyzed TV networks’ economic coverage, using the unemployment rate, inflation rate, and growth rate of real GNP. I argue that such previous studies focus less on the effect of media coverage of the stock market and its relationship with politics because, for the majority of 7 Americans throughout history, the stock market per se did not personally affect them whereas other indicators did. Prior to the 1990s, primarily rich people owned stocks, and the condition of the stock market had less of an impact on Americans’ personal financial conditions. In addition, people did not consider stock market conditions when they evaluated the overall health of the national economy. However, several factors have changed for personal finances, making the stock market-related news relevant not only for the rich, but also for ordinary citizens in the U.S. First, many more people have come to use stocks and mutual funds as retirement assets. Many companies provide 401(k) and stock options as employee benefits, while many individuals invest in stocks and mutual funds through IRAs. The percentage of American families owning stock has risen from approximately 25% in 1978 to 32% in 1989 and 51% in 2007 (Avery et al., 1984; Bucks et al., 2009). This rapid increase occurred mainly with the entrance of the middle class into the stock market during the 1990s. Similarly, the percentage of middle-income families (in the 40 to 59.9 income percentile) owning stocks has risen from 29% in 1989 to about 50% in 2007 (Bucks et al., 2009). Second, during this same time period, economic news became more popular in the U.S. The media environment in the U.S. has changed dramatically over the last several decades. The decline of traditional media sources and the rapid increase of new media are conspicuous: Newspaper circulation has declined as more people watch the news on cable networks or via the Internet compared to network news. In terms of economic news, people can receive more specific information at a lower cost than ever before. For example, CNBC, a satellite and cable business news channel, 8 was established in 1989; by October, 2010 about 98 million households watched this channel.2 In addition, the government regularly reports official unemployment and inflation rates, but not every day. In contrast, stock market indicators such as the Dow Jones industrial average and S&P 500 constantly change, and the media report on what is happening in the market at least once a day. Moreover, stories about the stock market are mentioned in a variety of types of newspaper articles. People use “Wall Street” to describe not only the stock market per se, but also politicians (e.g., “He is from Wall Street, not Main Street”) or culture (e.g., Gordon Gekko in the movie, “Wall Street”). http://tvbythenumbers.zap2it.com/2010/11/12/cable-news-ratings-for-thursday-november-112010/71919 2 9 Chapter 3 Empirical Study 3.1 Data Many previous studies have used newspapers to analyze how the media de- scribe the national economy and affects people’s understanding of the economy, presidential approval, and voter choice. For instance, Goidel and Langley (1995) examined The New York Times front pages from 1982 to 1992 and found that the articles followed negative economic conditions more often than positive economic conditions. In this respect, news coverage is linked to the public’s evaluation of the economy. Although traditional media have been in decline over the last decade as fewer people read newspapers or watch network news than they used to, traditional media’s influence is still significant. In addition, when we analyze the changes of news coverage over time, it is better to analyze the content of traditional media than new media because no data on new media from the 1980s exist. In addition, it is not appropriate to analyze business newspapers or business cable channels because ordinary people do not use these sources as major news sources. Rather, it is important to know how the traditional media describe the stock market because these sources still reach a very large portion of the middle-class; indeed, during most of this time frame (1981-2011) newspapers were still one of the most important means of acquir- 10 ing news. I chose to analyze The New York Times because it remained the premier paper for political, economic, and business news during this time. Using LexisNexis Academic and focusing on The New York Times Section A1 from 1981 to 2011, I collected 11,035 articles using three keyword phrases: “Wall Street,” “stock market,” and “Dow Jones.” The top of Figure 3.1 shows some important trends. When the stock market declined dramatically, the number of articles increased. For instance, the average number of articles was fewer than 150 each year from 1981 to 1986. However, after Black Monday in 1987, the number increased rapidly in 1987 and 1988. A similar trend occurred in 2008 and 2009 after the financial crisis. The New York Times features eight main desks: Editorial, Foreign, National, Business/Financial, Metropolitan, Cultural, Style, and Sports.2 In general, economic news is issued from the Business/Financial Desk while political news is issued from National Desk. Therefore, if the stock market is purely economic news, the ratio of articles from the Financial Desk should be higher than those from other desks. On the other hand, if the stock market becomes political news, the ratio of articles from the National Desk should increase over time. In order to verify whether or not this trend exists in The New York Times, I sorted each article in Section A based on desks; the bottom of Figure 3.1 shows the ratio of stock market articles. According The New York Times is organized by sections; most business and economic news is located in the Business Section. However, I only analyzed Section A for two reasons. First, Section A is the main section; it includes the front page and the main articles. Second, the purpose of this study is to analyze how the news of the stock market described in political contexts and political news is located in Section A. 2 There are twenty three desks in total. 1 11 to this figure, stock market news is not only featured at the Financial Desk; all desks discuss the stock market and Wall Street. In the 1980s, the Business/Financial Desk (Financial Desk hereafter) and Editorial Desk reported stock market news. However, the ratio of Financial Desk declined in the early 1990s while that of the National Desk peaked in 1992 and 1994, which were both election years. In particular, 1992 was the election year in which Clinton won against George H. Bush with the quote, “It’s the economy, stupid.” Another peak period for the National Desk was in 2000 and 2003. It is possible that these peaks stem from the debate about the privatization of social security and the capital gain tax, which was cut by the Bush administration in 2003. Although Figure 3.1 is useful in understanding the structure of stock-related news over time, it does not provide any information regarding the kinds of terms The New York Times chose and reported on, or the topics behind the terms. To determine this information, first I counted each term appearing in the paragraphs that included three keywords. Second, using Wordfish, I estimated the latent trend of articles over time. Finally, I estimated the latent topics of stock market paragraphs using topic models. 12 13 Figure 3.1: (Top) Numbers of Stock Market Articles in The New York Times from 1981 to 2011. (Bottom) Percentage of “Wall Street” Articles in terms of “Desk” from 1981 to 2011 3.2 Method: Wordfish and Topic Models Content analysis has been widely used to study newspaper articles. Tradi- tionally, researchers have employed a manual coding method - a methodology in which researchers create rules for coding in advance and several researchers on the team code each article manually. The advantage of coding by hand is that the researcher can design the original coding scheme as appropriate to the research goal. However, a major drawback to this method is that the quality of the research relies heavily on each coder’s objectivity and ability. In addition, it is difficult to deal with a large dataset with this method. Therefore, in previous studies, researchers only analyzed newspapers within short periods, such as analyzing newspaper articles during campaign periods or a specific area. Recent computer software improvements for content analysis have decreased the cost involved in analyzing text-oriented data. According to Quinn et al. (2010), content analysis includes five methods: reading, human coding, dictionaries, supervised learning, and topic models (the last three methods use computer software for coding text data). As an example of the dictionary method, scholars have used InfoTrend (Fan, 1988) for political science content analyses since the 1990s (Shah et al., 1999; Nadeau et al., 1999). Using InfoTrend, researchers prepare the lexicon that includes the related terms and keywords as well as word relationship rules. In the case of Nadeau et al. (1999), who analyzed presidential campaign articles for 1984, 1988, 1992, and 1996, the researcher created the rules that allowed the program to sort each paragraph based on whether it is for or against the candidates. At the same time, some coders randomly code-select paragraphs and compare them with 14 the results from the program. Compared to other coding methods, topic models (Blei and Lafferty, 2009) and Wordfish (Slapin and Proksch, 2008) allow scholars to analyze a large amount of text data at a low cost (Ho and Quinn, 2008; Quinn et al., 2010; Lowe, 2008; Slapin and Proksch, 2008). Using these methods, scholars do not need a dictionary or code scheme. The significant difference between the previous method and the topic model is that the computer counts a frequency of certain terms in each document and locates them in one or more dimensions. For example, Slapin and Proksch (2008) used the Wordfish package in R, which does not rely on human coding, but counts the frequencies of the terms in articles within groups (e.g., as they manifest in each political party), then estimates each document’s position. The advantage of Wordfish is that the procedure is completely automated, thereby enabling researchers to analyze large numbers of articles. In this report, I initially employ Wordfish by Slapin and Proksch (2008) for two reasons. First, the purpose of this study is to understand whether the stock market has been presented in the newspaper framed as a political issue. If so, I assume that we might see large frequencies of political terms within the same paragraph that include specific keywords (“Wall Street,” “stock market,” and “Dow Jones”). The Wordfish methodology thus supports this study. Second, I use three decades of newspaper articles in the same study. Therefore, a completely automated method is necessary both to retain objectivity and to keep the time costs of research low enough. 15 3.2.1 Assumption of Wordfish Wordfish assumes that each word’s frequency has Poisson distribution. That is: yij ∼ P oisson(λij ) (3.1) λij = exp(αi + ψj + βj ∗ ωi ) (3.2) where yij is the count of word j in year i’s The New York Times articles, α is a set of year fixed effects, ψ is a set of word fixed effects, β is the estimate of a word-specific weight capturing the importance of word j in discrimination between years, and ω is the estimate of year i’s position. In other words, if ω is the same between year i and year i+1, it means that the stock market news is very similar between those years. On the other hand, if ω changed over the years, it means that the stock market news changed during that period. That is, ω and β allow us to understand which words differentiate stock market news between years. 3.2.2 Process To use the news articles in this computer software, we have to convert the articles into computer-friendly text data. Step 1: Extract only paragraphs that include keywords In this method, the unit of analysis is each term. Thus, it is important to eliminate the parts of the articles that are not important for the analysis. The goal of this paper is to analyze the context through which the newspaper has described 16 the stock market. Although each article includes at least one of the three keywords (“Wall Street,” “stock market,” or “Dow Jones”), the initial search does not reveal which paragraph is related to the stock market. Even if the rest of the article is not related to the stock market, the computer counts the frequency of each term in each article, which would bias the results. To counteract this potential for bias, I extracted each paragraph that includes the keyword(s) and created a single text file for each year from 1981 to 2011. Step 2: Convert the paragraphs to computer-friendly text data files After making 31 text files, I converted them to a computer-friendly text data file. In much of the text-mining software, the data file should be in a corpus format. Using the package tm (Feinerer, 2010) in R, I removed numbers, punctuation, and common words in English (e.g., is, you, me). Next, using Snowball (Hornik, 2009), a stemmer package in R, all terms were converted to stems.3 I need to note that I did not use entire articles for my analyses. Rather, I collected articles containing at least one of three keywords using LexisNexis, extracted paragraphs that included the keywords, and created new documents by year. In many cases, a single article included many topics. Suppose there are fifteen paragraphs in one article and “stock market” is only mentioned in the last paragraph. If I want to know the kinds of terms with which “stock market” is mentioned, it is appropriate to analyze only the last paragraph; if I analyze the entire article, there will be many terms that may not be used in relation to the keyword “stock market.” As a result, 3 For example, higher and highest become high. 17 I may not be able to find terms and topics related to the keyword. 3.3 Topic Models Although the Wordfish method is simple and easy to employ via the software, some limitations arose for the current study. First, we do not know what the Wordfish score for ω means intuitively. If each article were categorized by topics in advance, ω would show the change in location of topics over time. For example, Proksch, Slapin and Thies (2011) estimated the party position in Japan using newspaper articles from 1960 to 1998. Before estimating the Wordfish score for ω or party location, Proksch, Slapin and Thies (2011) sorted each document according to three topics: domestic and social policy, economic policy, and foreign policy. However, in the current study, I do not categorize each paragraph and document by topics. Rather, my goal is to find the relationship between stock market terms and political terms without using a pre-coding process. Because each document is a corpus of paragraphs in year i, Wordfish estimates document i’s one-dimensional location over time. In other words, we can see the “change” in documents over time but we do not know what it means. On the other hand, topic models via Latent Dirichlet Allocation (LDA) (Blei and Lafferty, 2009) estimate the probabilistic distribution of terms and find latent topics behind terms used in documents. LDA assumes that we observe word wd in document d, where there are latent topics k and the distribution of topic k is described as βk . The proportion of topics for the dth document is expressed as θd . Finally, topic assignments for the dth document are zd . That is: 18 p(β1:K , θ1:D , z1:D , w1:D ) = K � p(βi ) i=1 � N � n=1 19 D � d=1 p(θd ) � p(zd,n |θd )p(wd,n |β1:K , zd,n ) Chapter 4 Results 4.1 Term Frequencies This section discusses the frequency of important terms in my data. The data provided more than 20,000 words in the paragraphs related to the stock market from 1981 to 2011. However, many words only appeared a few times in certain years, making their calculation inefficient. For this reason, I removed all the words from the files with a zero word count in 95% of the documents. 4.1.1 Wall Street, Stock Market, and Dow Jones Figure 4.1 shows the number of the three keyword phrases (“Wall Street,” “stock market,” and “Dow Jones”) from 1981 to 2011. “Wall Street” has the highest frequency in The New York Times over this entire time period. Although “Dow Jones” indicated no specific trend, two of the keyword phrases (“Wall Street” and “stock market”) demonstrated the same trend: Their frequency rose in 1987, 2002, and 2008. The reason for the increase in 1987 was Black Monday, which occurred in October of that year. The frequency of these two phrases declined during the early 1990s, then increased again during the late 1990s, with two peaks in 1998 and 2002. The numbers declined again after 2003 until the financial crisis of 2008. 20 Figure 4.1: Frequency of “Stock Market,” “Wall Street,” and “Dow Jones” in The New York Times from 1981 to 2011 4.1.2 Presidential candidates Figure 4.2 shows the frequencies of terms related to the presidential candi- dates. Except in 1987, The New York Times did not use “president” in relation to the stock market from 1981 to 1990. However, the frequency of “president” increased gradually during the 1990s, peaking in 2002. It fell again in the mid-2000s and increased again in 2008. This trend could be related to the decline of the stock market. Obviously, the peak in 1987 stems from Black Monday and the peak in 2008-09 is related to the financial crisis. On the other hand, presidential candidates’ names were used in election years in combination with stock market terms. Two spikes occurred during the 2000s - in 2000 and 2002 - which might be related to the debate about the privatization of social security during the elections. These data provide 21 strong evidence that stock news became political news in the 2000s. The financial crisis and the bailout of the financial sector provide additional evidence as well. Figure 4.2: Frequency of Presidential Candidates in The New York Times from 1981 to 2011 4.1.3 Congress, Political Parties The trends related to Congress and the political parties are similar to those related to presidents’ names. Four spikes occurred over the three decades: 1987, 2002, 2008, and 2010. In 1987, “Congress” appeared 56 times, while it appeared 47, 65, and 62 times in 2002, 2008, and 2010, respectively. On the other hand, specific party names did not appear many times during the 1980s but they increased after the 1990s. An interesting phenomenon is that, during the off-peak period, no gap of usage occurred between Congress and party names. However, during the four spikes, 22 the gap increased significantly. Moreover, party names appeared more often than usual in 2002, 2008, and 2010 while “Congress” was used more than party names in 1987. From this phenomenon, I find that stock market news has become partisan since the 2000s. Figure 4.3: Frequency of “Congress” and Political Parties in The New York Times from 1981 to 2011 4.1.4 Policy-related Words Public policies were also mentioned as related topics of the stock market in the newspaper articles. Figure 4.4 shows the frequencies of five keywords (i.e., bailout, mortgage, retire, social security, tax) within this period. “Tax” was mentioned more than the other terms, especially when the stock market declined in 1987, 2002, and 2008. Obviously, “mortgage” spiked after the financial crisis in 2008 due to the sub23 prime problems. Although I expected the frequencies of “social security” and “retire” to increase in 2000 due to the debate about the privatization of social security, no significant trend related to these terms emerged. Based on these results, I find that Figure 4.4: Frequency of Policy Terms in The New York Times from 1981 to 2011 the stock market has been used to describe the economy in The New York Times at an increasing rate since the 1990s. More importantly, it was mentioned in an effort to discuss not only economic issues, but also political issues. In particular, when the stock market fell significantly, the media mentioned the stock market in terms of social and political issues. These results indicate that the stock market was described more often after 1990 than during the 1980s. More importantly, it was mentioned not only in terms of the economy, but also in relationship to political issues. In particular, the stock 24 market is currently an important issue not only for stockowners, but also for nonstockowners as it is related to their retirement and mortgage. 4.2 Wordfish Estimation Wordfish calculates latent positions of each term (Figure 4.5) and document (Figure 4.6). Table 4.11 and Figure 4.5 show the estimated location of terms and their plots, the Wordfish estimation of each word j’s location β and fixed effect ψ. The high score of ψ (y axis) means that the word’s frequency is high. On the other hand, if the word only occurred during certain years, then the word requires a larger absolute number of β (x axis). Because all of the paragraph data include any of the three keyword phrases, their ψ is high. On the other hand, terms related to Congress and political parties appeared over time. In terms of policy-related terms, “bailout” was used in certain years while “tax” and “pension” were commonly used from 1981 to 2011. If many political terms do not distinguish the year, which terms differentiate stock market news between years? To answer this question, first I checked terms with large β and small ψ. This category includes many IT-related terms such as “Facebook” and “Twitter.” It also includes political terms like “occupy” and “debtceil.” How about low β and low ψ terms? Many of these are business-related terms such as “drysdale (Drysdale Government Securities)” and “conoco.” Although “regan” is also in this category, no other political or social terms are included. 1 Table 4.1 shows only the β and ψ from terms in Figure 4.1 to 4.4 25 From these arguments, I find that, although presidential names distinguish stock market news by year, common political terms such as “Congress” or “president” do not. Rather, in the early 1980s, business and economic terms distinguish these years from stock market news in other periods. Finally, news about “occupy (Wall Street)” and “debt ceilings” are uniquely related to the stock market in the 2010s. Table 4.1: the Estimated β and ψ from Select Terms in Figures 4.1 to 4.4 Term β ψ wall 0.615 5.593 street 0.609 5.673 stock 0.119 5.432 dow 0.133 3.777 congress 0.459 2.497 republican 0.733 2.582 democrat 0.754 2.677 reagan -2.494 0.778 clinton 0.146 2.185 bush 0.427 2.857 obama 2.399 -0.363 mccain 1.375 0.074 mondale -2.633 -3.690 president 0.230 1.594 dukakis -1.009 -3.347 kerry 0.643 -3.944 gore -0.138 0.916 tax 0.330 3.425 retire 0.529 0.277 bailout 1.484 0.998 mortgage 0.704 -2.606 social 0.196 2.675 security 0.277 0.787 pension 0.485 1.981 26 Figure 4.5: the Estimated Location of Terms in The New York Times from 1981 to 2011 27 Figure 4.6: the Estimated Location of Stock Market Paragraphs in The New York Times from 1981 to 2011 Wordfish also estimates the location of newspaper articles ωi by year (Figure 4.6). In this case, ω shows the relative locations of stock market articles in each year. There was no significant movement from 1981 to around 2000, while the slope became sharper after 2000. This means that the contents of stock market articles are similar from 1981 to around 2000 but changes year-by-year after 2000. 28 4.3 Topic Model Estimation In order to find latent topics in stock market paragraphs, I used topicmodels package in R (Gruen and Hornik, 2011). Table 4.2-4.7 are the results of LDA with Gibbs sampling. I selected five topics in each year and each topic is expressed by five terms. Because all paragraphs include at least one of three key words (“Wall Street,” “stock market,” “Dow Jones”), it is obvious that many topics are related to these keywords. On the other hand, if there are any political terms in Table 4.2-4.7, then the stock market is mentioned in political topics. From these results, I find that there are some trends in topics, in terms of the time period. First, from the late 1980s to the end of 1990s, most topics are about the stock market per se or economy, while some are about the “president” or “government.” Second, the stock market was described with the topic of Social Security in 1998, 1999, and 2000. Interestingly, there are no political terms in other topics within these three years. Topic 3 in 2000 consists of “social,” “security,” “money,” “stock,” and “Bush.” It is clear that stock market news became Social Security news because George W. Bush proposed a privatization of Social Security in the 2000 presidential election. Third, after the financial crisis in 2008, stock market news were included in a variety of topics, such as specific financial company’s name (e.g., “Lehman Brothers,” “Goldman Sachs”), protest movements (e.g., “Occupy Wall Street”), and topics related to the financial crisis in 2008 (e.g., “bailout”). These results are consistent with the results shown in Figure 3.1. That is, stock market news used to be mainly economic news during the 1980s but became political, policy-related news in later years. In particular, the privatization of social 29 security and the financial crisis in 2008 had significant impacts toward these changes. 30 Table 4.2: Five Topics in the Stock Market Paragraphs from 1981 to 1985 Year 1981 1982 1983 1984 1985 Topic 1 street company investment billion share street prices service cities reagan oil bank million investors yen page months financial hong price market time industry dow close Topic 2 president time republican administration house wall securities million government chase stock market percent jones dow wall street firm bank people street wall analysts investment president Topic 3 wall street rates streets money stock market rates dow average street wall company investment economic public federal service billion power company wall government chief shares 31 Topic 4 stock market dow jones yesterday economic president markets company budget time administration inflation president week economy percent reagan news president wall street companies yesterday million Topic 5 economic reagan financial program budget street wall york economy tax government industrial rates tax commercial stock market rates prices dow stock chairman regan exchange banks Table 4.3: Five Topics in the Stock Market Paragraphs from 1986 to 1990 Year 1986 1987 1988 1989 1990 Topic 1 trading boesky exchange insider investors markets percent dow dollar average wall street firms investment streets market stock trading markets investors street firm company executives american Topic 2 Topic 3 street wall wall street company york million avenue securities properties wall president street york people house firms reagan securities world stock october market crash trading collapse york firm index house percent crash rates yesterday tax october average economic dow bush york stock people market million percent federal markets economy rates 32 Topic 4 Topic 5 yesterday stock stocks market jones markets dow financial average economic stock deficit market budget trading economy investors american plunge federal company markets market percent billion dow securities jones companies average company wall street street investment business million securities economy people wall financial street government drexel business securities money firms city Table 4.4: Five Topics in the Stock Market Paragraphs from 1991 to 1995 Year 1991 1992 1993 1994 1995 Topic 1 Topic 2 street market wall investors firm trading salomon yesterday company executives wall street company wall financial million federal firms securities city street wall business financial company president firms economic city money wall street million wall street firms real company firm financial percent stock economy market bond stocks investors trading billion shares Topic 3 Topic 4 Topic 5 wall stock city president dow investment people percent business government jones bond rating market financial business street stock economic investment market clinton billion percent government companies economy york american dow street market stock investment percent markets firm companies york public million day house jones clinton percent investment stock bond fund market markets companies trading rates money prices investors billion american companies wall markets dow street government jones company federal average million mexico financial analysts mexican 33 Table 4.5: Five Topics in the Stock Market Paragraphs from 1996 to 2000 Year 1996 1997 1998 1999 2000 Topic 1 markets economy money economic federal markets financial currency american economic company investment business banks bank percent dow investors stocks jones percent dow stocks jones average Topic 2 company time investment million mutual wall street york company business percent market dow stocks jones stock market rates economic growth market stock economy prices rates Topic 3 Topic 4 Topic 5 wall market percent street stock dow financial funds jones people companies stocks city government average stock economy percent market rates dow investors federal average money companies stocks government greenspan jones stock stock wall market markets street social investors billion government economy analysts security financial city wall market companies street social markets york investment street banks money company time security day security wall street social street wall money companies york stock financial public bush company million 34 Table 4.6: Five Topics in the Stock Market Paragraphs from 2001 to 2005 Year 2001 2002 2003 2004 2005 Topic 1 Topic 2 Topic 3 street percent wall york financial street city dow companies center day investors wall index analysts market percent corporate stock company bush money markets economy tax dow president people shares economic street percent tax wall companies city exchange york president firms average health million grant budget stock street financial market million economic percent wall time investors firms federal dow executives companies wall morgan percent street company financial time investment stock people executive markets job billion average 35 Topic 4 Topic 5 stock wall market president economy bush markets government economic money wall stock street investors companies enron firms financial analysts time wall stock street market investors markets war time house money york wall company street photo analysts city money fund bank day market president stock security investors government quarter bush price Table 4.7: Five Topics in the Stock Market Paragraphs from 2006 to 2011 Year 2006 2007 2008 2009 2010 2011 Topic 1 street wall goldman firms time wall million companies billion investment crisis economic bailout plan mccain obama economy economic people public wall street money people industry stock market percent investors markets Topic 2 Topic 3 Topic 4 Topic 5 wall stock dow investment street market chief funds business percent average capital house investors executive company york markets executives fund york street stock wall people analysts market street business economy percent investors page firms markets mortgage news cut dow funds york economy percent banks home billion markets firms industry people investors credit main business dow government brothers american day bank york stock bonuses financial business market executives banks life percent goldman money company markets bank government president investors executive firms wall market street york street stock financial business banks percent obama investment financial federal democrats president goldman investors government firm wall street occupy investment street wall street former financial people wall business crisis protests protesters fund banks obama york american 36 Chapter 5 Conclusion Many historical documents have been recently converted to digital files. For example, we are able to analyze the frequency of terms in many books from 1800 to the present via Google.1 With the increase in digital text data, the potential for computer-driven content analysis has been extended. Through the use of digital text data, this report aims to analyze how the stock market has been described in traditional media over the last three decades. In particular, I focus on the political keywords included in the stock market articles using three unique methods. First, I analyzed newspaper article contents from over the last three decades (about 11,000 articles) in Section A of The New York Times. A text mining computer program allowed me to analyze this large text-based dataset, which would have been essentially impossible if manual coding were necessary. Second, I analyzed not only the articles that were categorized as business and economic articles, but also all of the articles that mentioned the stock market in Section A. In other words, I examined the context of the stock market not only as an economic issue, but also as a political or a cultural issue. Finally, using Wordfish and LDA, The result with “Wall Street” is: http://books.google.com/ngrams/graph?content=Wall+ Street&year_start=1800&year_end=2008&corpus=0&smoothing=0 1 37 I estimated the relative locations of each term and latent topics in stock market paragraphs during the last three decades. According to the two estimation methods, I find some characteristics of stock market news during the last three decades. First, in terms of quantity of news, there are two significant events that increase the quantity of news: a decline in the stock market and policy proposals by election candidates. In particular, Black Monday of 1987 and the financial crisis of 2008–2009 had strong impacts on increasing discussion on the topic of the stock market and economy. Additionally, the debate about privatization of Social Security made stock market news into a political topic. The bailout of the financial industry after the financial crisis also extended a range of topics. That is, stock market news consists of not only Wall Street topics, but also topics about the president, political parties, policies, social movements, and ordinary citizens—main street. Finally, in order to extend this study, I have to overcome some issues. First, I would apply this analysis to other media. One possibility is to analyze other national and local newspapers. The other possibility is to analyze other media, such as radio and TV. In that case, I should obtain transcripts of the media that I will analyze. Second, even if I can obtain recent newspapers or transcripts as digital data, there is no guarantee on how long we will be able to go back in history. For example, we can obtain USA Today articles from 1989, while NBC news articles are available from 1997 via LexisNexis. If we want to examine the relationship between stock market news and political news, we need data from a longer time period. 38 Bibliography Avery, Robert B., Glenn B. Canner, Gregory E. Elliehausen and Thomas A. Gustafson. 1984. “Survey of Consumer Finances, 1983.” Federal Reserve Bulletin 70:679–692. Ball-Rokeach, Sandra and Melvin DeFleur. 1976. “A Dependency Model of MassMedia Effects.” Communication Research 3(1):3–21. Barker, David C. 1998. “Rush to Action: Political Talk Radio and Health Care (un)Reform.” Political Communication 15(1):83–97. Blei, David M. and John. D. Lafferty. 2009. Topic Models. In Text Mining: Classification, Clustering, and Applications, ed. Ashok Srivastava and Mehran Sahami. London, England: Taylor and Francis chapter 4, pp. 71–94. Bricker, Jesse, Brian Bucks, Arthur Kennickell, Traci Mach and Kevin Moore. 2011. “Surveying the Aftermath of the Storm: Changes in Family Finances from 2007 to 2009.”. Brody, Richard A. 1991. Assessing teh President: The Media, Elite Opinion, and Public Support. Palo Alto, CA: Stanford University Press. Bucks, Brian K., Arthur B. Kennickell, Traci L. Mach and Kevin B. Moore. 2009. “Changes in U.S. Family Finances from 2004 to 2007: Evidence from the Survey of Consumer Finances.” Federal Reserve Bulletin 95:A1–A55. 39 Conover, Pamela J., Stanley Feldman and Kathleen Knight. 1987. “The personal and Political Underpinnings of Economic Forecasts.” Journal of Politics 31(3):559–583. Della Vigna, Stefano and Ethan Kaplan. 2007. “The Fox News Effect: Media Bias and Voting.” Quarterly Journal of Economics 122(3):1187–1234. Fan, David P. 1988. Predictions of Public Opinion from the Mass Media: Computer Content Analysis and Mathematical Modeling. Westport, CT: Greenwood Press. Feinerer, Ingo. 2010. Text Mining Package. R package version 0.5-4.1. Fiorina, Morris P. 1981. Retrospective Voting in American National Elections. New Heaven, CT: Yale University Press. Goidel, Robert K. and Ronald E. Langley. 1995. “Media Coverage of the Economy and Aggregate Economic Evaluations: Uncovering Evidence of Indirect Media Effects.” Political Research Quarterly 48(2):313–328. Gruen, Bettina and Kurt Hornik. 2011. “topicmodels: An R Package for Fitting Topic Models.” Journal of Statistical Software 40(13):1–30. Harrington, David E. 1989. “Economic News on Television: The Determinants of Coverage.” Public Opinion Quarterly 53(1):17–40. Hetherington, Marc J. 1996. “The Media’s Role in Forming Voters’ National Economic Evaluations in 1992.” American Journal of Political Science 40(2):372–395. Ho, Daniel E. and Kevin M. Quinn. 2008. “Measuring Explicit Positions of Media.” Quarterly Journal of Political Science 3(4):353–377. 40 Hofstetter, Richard C. 1978. “News bias in the 1972 campaign: A cross-media comparison.” Journalism Monographs 58. Hornik, Kurt. 2009. Snowball: Snowball Stemmers. Iyengar, Shanto. 1991. Is Anyone Responsible?: How Television Frames Political Issues. Chicago, IL: University Of Chicago Press. Jackman, Simon. 1993. “Economic Expectations, Realizations, and Presidential Approval.” Presented at the annual meeting of the Midwest Political Science Association, Chicago, IL. Kiewiet, Roderick. 1983. Macroeconomics and Micropolitics: Electoral Effects of Economic Issues. Chicago, IL: University of Chicago Press. Kinder, Donald R. and Roderick Kiewiet. 1979. “Economic Discontent and Political Behavior: The Role of Personal Grievances and Collective Economic Judgments in Congressional Voting.” American Journal of Political Science 23(3):495–527. Kinder, Donald R. and Roderick Kiewiet. 1981. “Sociotropic Politics: The American Case.” British Journal of Political Science 11:129–161. Lewis-Beck, Michael S. 2006. “Does Economics Still Matter? Econometrics and the Vote.” Journal of Politics 68(1):208–212. Lowe, Will. 2008. “Unerstanding Wordscores.” Political Analysis 16(4):356–371. Luskin, Robert C. 1990. “Explaining Political Sophistication.” Political Behavior 12(4):331–361. 41 MacKuen, Michael B., Robert S. Erickson and James A. Stimson. 1992. “Peasants or Bankers? The American Electorate and the U.S. Economy.” American Political Science Review 86(3):597–611. Mutz, Diana C. 1992. “Mass Media and the Depoliticization of Personal Experience.” American Journal of Political Science 36(2):483–508. Nadeau, Richard, Richard G. Niemi, David P. Fan and Timothy Amato. 1999. “Elite Economic Forcasts, Economic News, Mass Economic Judgments, and Presidential Approval.” Journal of Politics 61(1):109–135. Page, Benjamin I. and Robert Y. Shapiro. 1992. The Rational Public: Fifty Years of Trends in Americans’ Policy Preferences. Chicago, IL: The University of Chicago Press. Popkin, Samuel L. 1991. The Reasoning Voter: Communication and Persuasion in Presidential Campaigns. Chicago, IL: The University of Chicago Press. Proksch, Sven-Oliver, Jonathan B. Slapin and Michael F. Thies. 2011. “Party system dynamics in post-war Japan: A quantitative content analysis of electoral pledges.” Electoral Studies 30(1):114–124. Quinn, Kevin M., Burt L. Monroe, Michael Colaresi, Michael H. Crespin and Dragomir R. Radev. 2010. “How to Analyze Political Attention with Minimal Assumption and Costs.” American Journal of Political Science 54(1):209–228. 42 Shah, Dhavan V., Mark D. Watts, David Domke, David P. Fan and Michael Fibison. 1999. “News Coverage, Economic Cues, and the Public’s Presidential Preferences, 1984-1996.” Journal of Politics 61(4):914–943. Slapin, Jonathan B. and Sven-Oliver Proksch. 2008. “A Scaling Model for Estimating Time-Series Party Positions from Texts.” American Journal of Political Science 52(3):705–722. Vavreck, Lynn. 2009. The Message Matters: The Economy and Presidential Campaigns. Princeton, New Jersey: Princeton University Press. Zaller, John R. 1992. The Nature and Origins of Mass Opinion. New York, NY: Cambridge University Press. 43