Survey							
                            
		                
		                * Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
The SPSS Portfolio Tim Daciuk Services Manager, Canada SPSS Inc. May 5, 2017 1 SPSS At A Glance  Leadership    Stability   30+ year heritage in analytic technologies Proven track record    Market leader in Predictive Analytics Focus on online & offline customer data acquisition and analysis 250,000+ customers worldwide NASDAQ: SPSS Analytics standard    80% of Fortune 500 are SPSS customers 80% plus market share in Survey & Market Research sector Ranked #1 Data Mining solution by KD Nuggets 2 Public Sector Customers Centers for Disease Control Internal Revenue Service DHHS Office of Inspector General TX Comptroller of Public Accounts NY Department of Public Health UK Gov’t Communications Bureau Canada Revenue Agency Department of Justice Centers for Medicare & Medicaid USAF Leaders Project Florida Department of Revenue US Department of State HM Customs & Excise US Army Recruiting Command New York City Human Resources US Joint Forces Command US Dept. of Agriculture 3 Predictive Analytics: Defined Predictive analysis helps connect data to effective action by drawing reliable conclusions about current conditions and future events. — Gareth Herschel, research director, Gartner group 4 Core Technologies 1. 2. 3. 4. 5. 6. Statistical Analysis Data Mining Text Mining Web Analytics Data Collection Deployment 5 Statistical Analysis 6 Descriptive Analysis Analytic software:  Data displays (e.g., frequency distributions)  Graphic displays of data (e.g. histogram)  Measures of central tendency (e.g., mean, median)  Estimates of variance (e.g., standard deviation) Satisfaction with service 1-10 80 60 40 Frequency  20 Std. Dev = 1.65 Mean = 8.3 N = 248.00 0 3.0 SPSS product: SPSS Base 4.0 5.0 6.0 7.0 8.0 9.0 10.0 Satisfaction with service 1-10 7 Inferential Analysis  Predicting numerical or categorical outcomes  Linear regression  GLM Multivariate/Repeated Measures  Non-linear regression  Weighted least squares  Two Stage Least Squares  Survival Analysis/Cox regression  Structural Equation Modeling SPSS products: SPSS Base, Regression Models, Advanced Models & AMOS 8 Reporting  Graphical software  Visually communicate your results  Create more visually compelling information SPSS products: SPSS Base (and Trinity) 9 Powerful Data Management  Control Panel for OMS  Allows users to turn output into:  XML  HTML  Text  SPSS data file 10 Data Mining 11 Where does Data Mining fit?  Three classes of data mining algorithms    Prediction Association Clustering Cluster Group cases that exhibit similar characteristics. What events occur together? Given a series of actions; what action is likely to occur next? Data Mining Predict Associate Predict who is likely to exhibit specific behavior in the future. 12 Profile and Predict: Supervised Learning  Build a predictive profile of the historical outcome using a collection of potential input fields. Credit ranking (1=default) Cat. % Bad 52.01 Good 47.99 Total (100.00) n 168 155 323 Paid Weekly/Monthly P-value=0.0000, Chi-square=179.6665, df=1 Weekly pay Monthly salary Cat. % n Bad 86.67 143 Good 13.33 22 Total (51.08) 165 Cat. % n Bad 15.82 25 Good 84.18 133 Total (48.92) 158 Age Categorical P-value=0.0000, Chi-square=30.1113, df=1 Age Categorical P-value=0.0000, Chi-square=58.7255, df=1 Young (< 25);Middle (25-35) Cat. % n Bad 90.51 143 Good 9.49 15 Total (48.92) 158 Old ( > 35) Cat. % Bad 0.00 Good 100.00 Total (2.17) n 0 7 7 Young (< 25) Middle (25-35);Old ( > 35) Cat. % n Bad 48.98 24 Good 51.02 25 Total (15.17) 49 Cat. % n Bad 0.92 1 Good 99.08 108 Total (33.75) 109 Social Class P-value=0.0016, Chi-square=12.0388, df=1 Management;Clerical  Explores all combinations, interactions and contingencies.  Use this profile to understand and predict future cases. Cat. % Bad 0.00 Good 100.00 Total (2.48) n 0 8 8 Professional Cat. % n Bad 58.54 24 Good 41.46 17 Total (12.69) 41 13 Cluster and Associate: Unsupervised Learning  Find emerging patterns and unusual cases.   Use data mining to examine the differences and shifts across all dimensions of the data. Select large groups to identify common patterns. Select small groups to identify unusual patterns. 14 The Product: Clementine 15 Read your data in… 16 Define your Target and Predictors 17 Build a Rule-based Predictive Model 18 Text Mining 19 From Concepts to Predictive Analytics Components LexiQuest Mine Discover concepts, relationships and trends LexiQuest Categorize Linguistic Terminology Extractor Understand documents and assign in pre-defined categories Text Mining for Clementine Add text fields to data mining for better prediction 20 Underlying Technology is Linguistic based Text is: Unstructured Ambiguous Language dependent Linguistic Approach Does not treat a document as a bag of words Removes ambiguity by extracting structured concepts Concepts are the DNA of text 21 Core Technology… Linguistics Based Search Technology  SPSS LexiMine  Concept Extraction and Query Building Classification Document Management  SPSS Categorize  Supervised Learning in Existing Systems 22 Text Mining for Clementine Text Mining for Clementine consists of three nodes:  Text Mining Source - uses LexiQuest Mine to automatically extract concepts, categories and frequencies from a set of documents  Text Mining Process - uses LexiQuest Mine to automatically extract concepts, categories and frequencies from text data stored in a database, and links these results to structured data  Document Viewer - displays the document or documents selected from the Text Mining Source node 23 Web Analytics 24 Web Measurement Continuum Insight Value ROI •# Users •# Visits •# Page Views •Top Pages •Top Referrers •# Errors •Recency •Frequency •Average Visit Streak •Campaign Sales •Eventstream •Sectionstream •Predict Likelihood to Respond •Automatic User Segments •Content Clustering •Significant Activity Sequences •Content & Activity Associations •Textbook Visits •Homepage Bouncing PREDICTIVE WEB ANALYTICS WEB STATS Activity Counts WEB ANALYTICS WEB ANALYTICS WEB STATS WEB STATS Business Insight Customer Intimacy 25 Web Mining for Clementine (WM4C)  Takes web data preparation directly into Clementine – removes the need for NetGenesis  Turns huge volumes of web logs into business events data  Allows for very fast deployment of data mining on top of web data 26 Data Collection 27 Dimensions Capabilities 28 Dimensions Objectives  Software and Development Platform, not just a set of applications  Design Once use Many.  Powerful: handles the most complex survey designs and multimodal deployment.  Centralized: defines and translates metadata that drives all downstream processes 29 Dimensions Solutions  Survey Design   Interview Builder – web based (included w/ mrInterview) Data Collection Methods  mrInterview data collection engine       web-based and/or Call-Center functions available mrPaper – create paper surveys within MS Word mrPaper/mrScan – scan solution with Readsoft EHF CATI – call center solution software mrDialer – call center automated/predictive dialers Palm PDA –data collection with Techneos EntryWare 30 Dimensions Solutions  Analysis/ Reporting/ Publishing      mrTables – web based reporting and publishing mrStudio – desktop and script based - automate processes Dimensions Component Pack – server processing Interview Reporter for real-time reporting of web data mrTranslate : Managing Multiple Languages    Easy to use tool that does not require research knowledge to use Writes directly to the questionnaire metadata Supports all single and double byte character sets  European (Spanish, French, etc.)  Double Byte (Japanese, Mandarin, Korean, etc.) 31 Deploying SPSS Models 32 Deployment Solutions Data Collection Reporting/Analysis Deployment Existing Data SPSS Web Deployment Framework Survey Data Web Behavior Text Extraction Text Mining for Clementine Web Mining for Clementine Web Based Applications Predictive Marketing Predictive Call Center 33 Summary  Predictive analytics  Wide range of products      Collect data Analyze data Mine data Score and handle data Wide range of applications     Predict Group Associate Anomalies 34 Questions ? 35 Contacts     Tim Daciuk Services Manager, Canada 416-410-7921 [email protected]  Angie Mohr  SPSS Sales, Canada  613-599-3377  [email protected] www.spss.com 36