Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Chicago Business Objects User Group HANA Vs Sybase IQ Chinni Ranganath, Deloitte June 1, 2012 Perspective Integration with # Prod Inst. Scalability Cost BO and others Query Performance Our Perspective • HANA or Sybase IQ only comes into the picture with ‘Big Data’ • HANA is being built to allow truly real-time analysis on transaction data by handling OLTP and OLAP processing workloads by one instance while IQ is being built to compete in specialized analytics markets competing with Greenplums, Teradatas, Netezzas, Exalytics and ParAccels of the world. Analogy Objective: Is not to show the superiority of one tool versus other, but rather bring some reality and perspective to the HANA discussion 2 Big Data Defined In information technology, big data consists of data sets that grow so large and complex that they become awkward to work with using on-hand database management tools. Difficulties include capture, storage, search, sharing, analytics, and visualizing. This trend continues because of the benefits of working with larger and larger data sets allowing analysts to "spot business trends, prevent diseases, combat crime." Though a moving target, current limits are on the order of petabytes, Exabyte and zettabytes of data. Wikipedia You have a Big Data Challenge/Opportunity, 1. If your data volume is growing into unmanageable levels (petabytes, Exabyte or zettabytes) 2. If your organization is generating variety of unstructured data: email, web, logs, scientific, machine generated and etc 3. If database latency is causing business operational challenges, velocity at which data need to be analyzed for informed real-time decision making. Volume 3 Velocity Variety Big Data Solution(s) Addressing the challenge of Big Data will require a technology or combination of technologies that are capable of : Supporting automatic parallelization such as query optimization, queries across segment servers 5. Linear scalability to linearly scale compute performance Oracle Exalytics 4. SAP HANA Supporting Not Only Structured Query Language (NO SQL) but also for unstructured data HP Vertica 3. Sybase IQ Supporting Massive Parallel Processing (MPP) , i.e., potentially distributed across thousands of heterogeneous processors EMC Greenplum 2. IBM Netezza Appliance Supporting large volumes of data such as petabytes and Exabyte. Hadoop Cassandra 1. Big Data Deloitte’s Point Of View We do not view In-Memory and In-database technologies as overlapping, but rather as complementing technologies having distinct roles to play in ‘Big Data Solution Framework’ depending upon the analytical application and business priorities. 4 What is in-memory technology/computing? • Brings data close to the CPU for quick reads and/or writes • Stores the data off disks into the system's main memory which significantly minimizes the overall time taken by the CPU to access data due to the reduced I/O for retrieving data. • Utilizes a memory resident database for data management and access. Similar to traditional database management systems, In-Memory database management also supports the standard atomicity, consistency, isolation, durability (ACID) properties. 5 * Source: SAP HANA Overview & Update presentation What is in-database technology/computing? • In-database processing, sometimes referred to as in-database analytics, refers to the integration of data analytics into data warehousing functionality. Today, many large databases, such as those used for credit card fraud detection and investment bank risk management, use this technology because it provides significant performance improvements over traditional methods. – Wikipedia • Traditional approaches to data analysis require data to be moved out of the database into a separate analytics environment for processing, and then back to the database. Indatabase processing moves processing to database thus avoiding physical movement of data into separate analytics environment for processing. Doing the analysis in the database, where the data resides, eliminates the costs, time and security issues associated with the old approach by doing the processing in the data warehouse itself • With Sybase IQ in-database analytics enterprises and application vendors answer complex questions without having to move mountains of data to 3rd party tools. With hundreds of statistical and data mining techniques, advanced text analytics capabilities, and APIs to execute proprietary algorithms safely inside Sybase IQ, data scientists can gain insights in unparalleled time. And with fast, accurate insights enterprises can quickly make the decisions • Sybase IQ supports a DBLytix library from Fuzzy Logix containing hundreds of advanced analytic, statistical and data mining algorithms that can run inside Sybase IQ. 6 HANA – How it works 8 SAP HANA 1. 2. 3. Real Time is anything that is too fast for your current ETL (Kimball) Requires integration of data and events from operational processes/systems in real-time Is a Just-In-Time Information Infrastructure providing real-time insights into operational events An adaptive enterprise with the ability to manage more effectively and optimize daily business activities by integrating operational processes 9 10 11 12 Sybase – How it works In-Database Analytics – Sybase IQ • Sybase IQ is a highly optimized analytics server, designed specifically to deliver • ultra-high-speed business intelligence and reporting on standard hardware and • operating systems. • Unlike traditional databases, Sybase IQ is architected for analytics—not transactions—with a column-based structure. • Sybase IQ provides a reduction in disk and CPU requirements (by reducing I/O bottlenecks) compared to traditional row-based RDBMS systems that have to be retro-fitted to support Data Warehousing and Analytics. 14 * Source: Sybase IQ Technical Overview 15 * Source: Sybase IQ Technical Overview 16 * Source: Sybase IQ Technical Overview HANA and Sybase IQ compared Compare HANA and Sybase IQ HANA • Business Analytics in-memory computing technology that comprises the SAP Business Analytic Engine (BAE) — an in-memory columnar data store with compression technology, and library of statistical and data mining algorithms via native support to ‘R’ functions in HANA, combined with optimized hardware from various partners. • For High performance-low latency Sybase IQ • Business Analytics in-database computing technology that comprises of shared everything, MPP , columnar data store and analytics DBMS engine to analyze structured, semi-structured, unstructured data with native MapReduce API, comprehensive and flexible Hadoop Integration, PMML Support and an expanded library of statistical and data mining algorithms that leverage the power of distributed query processing across Massively Parallel Processing Grid. • High performance without latency issues BEx BI tools MDX Analytics Engine BO BI Tools BI Tools BICS MDX Calculation Engine In-memory Engine Datawarehousing Platform DS** BW RTDS* HANA Source Systems Source Systems Source Systems (OLTP, DW, 3rd) Data Flow *Images Source: SAP 18 Read Compare HANA and Sybase (cont.) HANA Sybase IQ Technical Use Cases Technical Use Cases • Real time operations analysis • Historical Analysis • Rapid creation of analytic models without impacting established BI environment • Predictive Analysis • Mash up of data from multiple sources • Unstructured Data Processing • Enabled for all BO BI Tools (“Aurora” BO 4.0) • Enabled for ‘Big Data’ Analytics Business Use Cases Business Use Cases • Point of Sales • Operational Continuity • Demand Signal Repository • Demand Forecasting • Market Measurements Analysis • Customer Churn Analysis • Traffic Analysis • Operational Intelligence • Liquidity Risk Management • Point of Sales • Situational Awareness • Market Measurement Analysis • Operational Intelligence • Real-Time Business Intelligence 19 • Text Analytics Utilities Case Study HANA in action Real-Time Enterprise Convergence of Robust Technologies in-memory Business Events Business Processes Robust business processes and business rules management Powerful event processing technology Business Intelligence State of the art analytics HANA In-Memory SAP Event Insight SAP Business Process Management SAP Business Rules Management SAP BOBJ Xcelsius SAP BOBJ Crystal Reports Examples Examples Examples • • • • • 21 Purchase orders Complaint logs News feeds Temperature readings Meter data • • • • Automated BPM&BRM response Process steps with real-time reports Instant notifications Approval Process • Email and RSS Feeds notifications • Real-time Dashboards • Integration with decision support and historical data Real Time Enterprise Architecture Slide 22 Outage Scenario – Solution Workflow Mike Control Center Operator Action Dist. SCADA Alert indicating outage Mike is alerted with an outage event from SCADA systems 1> Power Outage Detected Power to be restored to minimize cost. Event correlation suggests transformer overheat problem Decision Meters are tending to RED. Mike is alerted with the list of “Zero Read Meters” indicating outage 1> Dispatch Work Force with appropriate equipment and skillset 2> Communication Outage Detected No Power Outage Detected but network outage detected Decision Event Insight Insight Trouble Ticket Volume Mike is alerted with increasing volume of customer trouble tickets in CRM system 2>Since no power outage is detected, notify concerned dept to repair comms network problems 1> Mike selects the ‘Create Service Notification’ option which initiates the process in ERP (via Workforce Mgmt System) 2> Mike sends an Alert Notification to comms team reporting problem with communication network Create Service Notification Dispatch Outage Service Order Closed Email Notification Sent with Context Details Bottom Line: Real-time insight, context data, recommended decision steps and linkage into exception process handling all work23hand in hand to resolve outage and minimize the cost and impact of outage and reduce restoration time. Supply Chain Case Study HANA in action Scenario # 2 Perfect Order Visibility, Insight & Action Action 1> ‘High’ priority VMI order She selects this in danger of missing SLA, root-cause analysis suggests carrier breakdown Decision The Perfect Order Metric is tending to Red. Ann Smith is alerted with the list of “Problem Orders” in danger of SLA non-compliance 1>Rush Order Delivery with an alternate carrier recommended 2> ‘Medium’ priority order She selects this in danger of missing SLA, trend analysis suggests ‘inclement weather’ Decision Event Insight Insight Ann Smith Sales Operations Manager 2>Since no delivery as been yet initiated, notify delivery to change mode of transport from ‘Sea’ to ‘Air’ Freight 1>Ann Smith selects the ‘Rush Order Delivery’ option which initiates the process in ERP (via BPM) 2>Ann Smith sends an Alert Notification to delivery recommending the change in mode of transport Create rush Delivery w new carrier Pick Pack Ship Billing & AR POD Email Notification Sent to Delivery with Context Details Bottom Line: Real-time insight, context data, recommended decision steps and linkage into exception process handling all work25hand in hand to resolve perfect order inefficiency and meet customer SLA Appendix 27 28 29 30 31 Thank you, Chinni Ranganath 732.325.5155 [email protected] © 2011 Deloitte Touche Tohmatsu Limited Deloitte refers to one or more of Deloitte Touche Tohmatsu Limited, a UK private company limited by guarantee, and its network of member firms, each of which is a legally separate and independent entity. Please see www.deloitte.com/about for a detailed description of the legal structure of Deloitte Touche Tohmatsu Limited and its member firms. This publication is for internal distribution and use only among personnel of Deloitte Touche Tohmatsu Limited, its member firms, and their related entities (collectively, the “Deloitte Network”). None of the Deloitte Network shall be responsible for any loss whatsoever sustained by any person who relies on this publication.