Survey
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
B8114-001 Applied Regression Analysis FALL 2013 David Juran Professor Office Location: Uris 218D E-mail: [email protected] Office Hours: Wednesday 12:15 – 2:15 TEACHING ASSISTANTS TBD REQUIRED PREREQUISITES AND CONNECTION TO THE CORE The learning in this course will utilize, build on and extend concepts covered in the following core courses: Core Course Managerial Statistics Connection with Core Applied Regression should primarily be viewed as an extension to the core stats course. We assume and will build upon students’ understanding of these core concepts: Data Analysis (measures of central tendency, dispersion, and association; graphical presentation and summary of data) Probability Theory (Normal, Student’s T, and F probability distributions) Sampling and Interval Estimation Hypothesis Testing Linear Regression Decision Models Optimization using Excel Solver Monte Carlo Simulation Operations Management Page 1 of 3 Service and Make-to-order Operations (Queueing Theory) COURSE DESCRIPTION This course is about the family of statistical data analysis tools called regression analysis. The course can be viewed as a logical successor to the core B6014 Managerial Statistics course, and is most frequently taken by 2nd-year MBA students wishing to solidify and extend their quantitative skills. The purpose of a regression analysis is to build a mathematical/statistical representation of the relationships between variables that can be used for prediction of outcomes and enhanced understanding of causes. When regression is used in a business context the ultimate managerial goal is, of course, better business decisions. Linear regression models are widely used in the business world, as well as in many other fields such as economics, engineering, social research and in the health and biological sciences. In the business world regressions have been used in marketing analyses of customer behavior, in financial analyses of investment opportunities, in human resources to test the fairness of employment policies, in operations to identify the determinants of output quality, and in strategic planning to create sales forecasts. Contemporary computing hardware and modern statistical software has made it extraordinarily easy to produce regression models. For example, Microsoft Excel has a quite powerful regression tool that is very easy to use with absolutely no knowledge of the underlying concepts. Though it has become child’s play to “run a regression,” it is quite a challenge to create a regression model that is really useful and reliable. One might argue that regression is almost too easy to do – it being very easy to mechanically produce “bad” regressions. The goal of this course is to learn how to create good regressions, and how to judge if a good regression is even possible. The course is based on the premise that successful applications of regression require sound understanding of both the underlying theory and the practical problems that are encountered when building and using models of real-life situations with serious consequences. Therefore this course seeks to blend theory and applications effectively, thereby avoiding the extremes of presenting theory in isolation and of giving elements of applications without the foundation concepts that are needed for practical understanding. The course will deal with three topics in an interwoven way. First and most basic, is an approach to data and data analysis that is based on statistical theory, the scientific method and on some very pragmatic epistemology. Second, is regression analysis itself, including extensions to the basic linear model such as logistic regression and some basic multivariate models. Third, is forecasting of time series from historical data. The title of our textbook is descriptive of our approach: regression by example. Each concept and procedure shall be explained, indeed, whenever possible introduced by an example. Moreover, we will emphasize examples in which the business context matters and may influence the analytic approach taken. Computing The course will be very computationally hands-on from the very first lecture. Your laptop computer will be used extensively for data analysis. Most of this work can be done in Excel and we assume a basic familiarity with the data analysis tools of Excel – including the regression tool. However, even though most of the work can be done in Excel, there can be some advantages to using a statistical software package. Moreover, several of the special and important analytic tools --- stepwise regression and logistic regression – cannot be done in Excel. Therefore we will supplement Excel with the Minitab statistical analysis system. Minitab gives us professional statistical analysis capabilities while Page 2 of 3 being easy to learn and use. Any version of Minitab that can do regression, stepwise regression and logistic regression will be adequate. Students who already are familiar with another software package that has the aforementioned capabilities are welcome to use it. (BMDP, SAS, S4, JMPIN) Conduct of the Course Course Project: A major part of the course work will be a data analysis project. The project will be a significant data analysis in a real business context. I will provide a standard project. However, I strongly suggest that students who have particular interests propose their own project. This can be a way of increasing the value you get out of the course. Projects will be due on March 14th. The term project will be an individual effort. Specifications for the final project report will be provided. Workload and Grading: It is expected that students will attend class regularly and participate fully in class discussions. Since many of these discussions will be based on our analytic assignments (mini-cases), it is important that assigned work be done thoroughly and on time. Assignments can be done individually or in teams of two students. The final course grade will be composed of three components: METHOD OF EVALUATION Written Assignments 33% Project 33% Attendance and Participation 33% Textbooks and Software The course will follow the same general outline as the text by Chatterjee, Hadi and Price which is listed below. This book strikes a good balance between providing a theoretical understanding and keeping a very concrete focus on applications. I will, in general use different examples than those in the text. It will thereby offer a second view on most issues and procedures. I recommend purchasing it. In addition to Excel we will use the Minitab statistical package, the software for which comes with a helpful user’s manual. Textbook: Samprit Chatterjee and Ali Hadi. Regression Analysis by Example, 5th edition (Wiley 2012) ISBN: 978-0-47090584-5. The textbook is recommended, not required. It is entirely possible to do very well in this course without buying the textbook. However, it is a very good resource on this subject, and goes into greater depth than we will have time for in the class meetings. Computer Software: Minitab 16. A free 30-day trial version may be downloaded from www.minitab.com ASSIGNMENTS Regular homeworks in this class are of the Type A variety, but with the group size limited to 2 people (group/group, one submission and one grade for both members of the group). You have the option of doing these individually as well, if you choose. The term project is also Type A, but with a larger group. Page 3 of 3