Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
The SPSS Portfolio Tim Daciuk Services Manager, Canada SPSS Inc. May 5, 2017 1 SPSS At A Glance Leadership Stability 30+ year heritage in analytic technologies Proven track record Market leader in Predictive Analytics Focus on online & offline customer data acquisition and analysis 250,000+ customers worldwide NASDAQ: SPSS Analytics standard 80% of Fortune 500 are SPSS customers 80% plus market share in Survey & Market Research sector Ranked #1 Data Mining solution by KD Nuggets 2 Public Sector Customers Centers for Disease Control Internal Revenue Service DHHS Office of Inspector General TX Comptroller of Public Accounts NY Department of Public Health UK Gov’t Communications Bureau Canada Revenue Agency Department of Justice Centers for Medicare & Medicaid USAF Leaders Project Florida Department of Revenue US Department of State HM Customs & Excise US Army Recruiting Command New York City Human Resources US Joint Forces Command US Dept. of Agriculture 3 Predictive Analytics: Defined Predictive analysis helps connect data to effective action by drawing reliable conclusions about current conditions and future events. — Gareth Herschel, research director, Gartner group 4 Core Technologies 1. 2. 3. 4. 5. 6. Statistical Analysis Data Mining Text Mining Web Analytics Data Collection Deployment 5 Statistical Analysis 6 Descriptive Analysis Analytic software: Data displays (e.g., frequency distributions) Graphic displays of data (e.g. histogram) Measures of central tendency (e.g., mean, median) Estimates of variance (e.g., standard deviation) Satisfaction with service 1-10 80 60 40 Frequency 20 Std. Dev = 1.65 Mean = 8.3 N = 248.00 0 3.0 SPSS product: SPSS Base 4.0 5.0 6.0 7.0 8.0 9.0 10.0 Satisfaction with service 1-10 7 Inferential Analysis Predicting numerical or categorical outcomes Linear regression GLM Multivariate/Repeated Measures Non-linear regression Weighted least squares Two Stage Least Squares Survival Analysis/Cox regression Structural Equation Modeling SPSS products: SPSS Base, Regression Models, Advanced Models & AMOS 8 Reporting Graphical software Visually communicate your results Create more visually compelling information SPSS products: SPSS Base (and Trinity) 9 Powerful Data Management Control Panel for OMS Allows users to turn output into: XML HTML Text SPSS data file 10 Data Mining 11 Where does Data Mining fit? Three classes of data mining algorithms Prediction Association Clustering Cluster Group cases that exhibit similar characteristics. What events occur together? Given a series of actions; what action is likely to occur next? Data Mining Predict Associate Predict who is likely to exhibit specific behavior in the future. 12 Profile and Predict: Supervised Learning Build a predictive profile of the historical outcome using a collection of potential input fields. Credit ranking (1=default) Cat. % Bad 52.01 Good 47.99 Total (100.00) n 168 155 323 Paid Weekly/Monthly P-value=0.0000, Chi-square=179.6665, df=1 Weekly pay Monthly salary Cat. % n Bad 86.67 143 Good 13.33 22 Total (51.08) 165 Cat. % n Bad 15.82 25 Good 84.18 133 Total (48.92) 158 Age Categorical P-value=0.0000, Chi-square=30.1113, df=1 Age Categorical P-value=0.0000, Chi-square=58.7255, df=1 Young (< 25);Middle (25-35) Cat. % n Bad 90.51 143 Good 9.49 15 Total (48.92) 158 Old ( > 35) Cat. % Bad 0.00 Good 100.00 Total (2.17) n 0 7 7 Young (< 25) Middle (25-35);Old ( > 35) Cat. % n Bad 48.98 24 Good 51.02 25 Total (15.17) 49 Cat. % n Bad 0.92 1 Good 99.08 108 Total (33.75) 109 Social Class P-value=0.0016, Chi-square=12.0388, df=1 Management;Clerical Explores all combinations, interactions and contingencies. Use this profile to understand and predict future cases. Cat. % Bad 0.00 Good 100.00 Total (2.48) n 0 8 8 Professional Cat. % n Bad 58.54 24 Good 41.46 17 Total (12.69) 41 13 Cluster and Associate: Unsupervised Learning Find emerging patterns and unusual cases. Use data mining to examine the differences and shifts across all dimensions of the data. Select large groups to identify common patterns. Select small groups to identify unusual patterns. 14 The Product: Clementine 15 Read your data in… 16 Define your Target and Predictors 17 Build a Rule-based Predictive Model 18 Text Mining 19 From Concepts to Predictive Analytics Components LexiQuest Mine Discover concepts, relationships and trends LexiQuest Categorize Linguistic Terminology Extractor Understand documents and assign in pre-defined categories Text Mining for Clementine Add text fields to data mining for better prediction 20 Underlying Technology is Linguistic based Text is: Unstructured Ambiguous Language dependent Linguistic Approach Does not treat a document as a bag of words Removes ambiguity by extracting structured concepts Concepts are the DNA of text 21 Core Technology… Linguistics Based Search Technology SPSS LexiMine Concept Extraction and Query Building Classification Document Management SPSS Categorize Supervised Learning in Existing Systems 22 Text Mining for Clementine Text Mining for Clementine consists of three nodes: Text Mining Source - uses LexiQuest Mine to automatically extract concepts, categories and frequencies from a set of documents Text Mining Process - uses LexiQuest Mine to automatically extract concepts, categories and frequencies from text data stored in a database, and links these results to structured data Document Viewer - displays the document or documents selected from the Text Mining Source node 23 Web Analytics 24 Web Measurement Continuum Insight Value ROI •# Users •# Visits •# Page Views •Top Pages •Top Referrers •# Errors •Recency •Frequency •Average Visit Streak •Campaign Sales •Eventstream •Sectionstream •Predict Likelihood to Respond •Automatic User Segments •Content Clustering •Significant Activity Sequences •Content & Activity Associations •Textbook Visits •Homepage Bouncing PREDICTIVE WEB ANALYTICS WEB STATS Activity Counts WEB ANALYTICS WEB ANALYTICS WEB STATS WEB STATS Business Insight Customer Intimacy 25 Web Mining for Clementine (WM4C) Takes web data preparation directly into Clementine – removes the need for NetGenesis Turns huge volumes of web logs into business events data Allows for very fast deployment of data mining on top of web data 26 Data Collection 27 Dimensions Capabilities 28 Dimensions Objectives Software and Development Platform, not just a set of applications Design Once use Many. Powerful: handles the most complex survey designs and multimodal deployment. Centralized: defines and translates metadata that drives all downstream processes 29 Dimensions Solutions Survey Design Interview Builder – web based (included w/ mrInterview) Data Collection Methods mrInterview data collection engine web-based and/or Call-Center functions available mrPaper – create paper surveys within MS Word mrPaper/mrScan – scan solution with Readsoft EHF CATI – call center solution software mrDialer – call center automated/predictive dialers Palm PDA –data collection with Techneos EntryWare 30 Dimensions Solutions Analysis/ Reporting/ Publishing mrTables – web based reporting and publishing mrStudio – desktop and script based - automate processes Dimensions Component Pack – server processing Interview Reporter for real-time reporting of web data mrTranslate : Managing Multiple Languages Easy to use tool that does not require research knowledge to use Writes directly to the questionnaire metadata Supports all single and double byte character sets European (Spanish, French, etc.) Double Byte (Japanese, Mandarin, Korean, etc.) 31 Deploying SPSS Models 32 Deployment Solutions Data Collection Reporting/Analysis Deployment Existing Data SPSS Web Deployment Framework Survey Data Web Behavior Text Extraction Text Mining for Clementine Web Mining for Clementine Web Based Applications Predictive Marketing Predictive Call Center 33 Summary Predictive analytics Wide range of products Collect data Analyze data Mine data Score and handle data Wide range of applications Predict Group Associate Anomalies 34 Questions ? 35 Contacts Tim Daciuk Services Manager, Canada 416-410-7921 [email protected] Angie Mohr SPSS Sales, Canada 613-599-3377 [email protected] www.spss.com 36