Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Cloud-Native Data Warehousing Bob Muglia 1 Scenarios with affinity for cloud Gartner 2016 Predictions: By 2018, six billion connected things will be requesting support. Connecting applications, devices, and “things” Reaching employees, business partners, and consumers Anytime, anywhere mobility On demand, unlimited scale Understanding behavior; generating, retaining, and analyzing data 2 Cloud attributes and requirements DYNAMIC Scalable Elastic Adaptive EASY Lower cost Faster implementation FLEXIBLE SECURE Supports many scenarios Trust by design 3 The data has evolved structured • • • • Transactional data Relational Fixed schema Dominant in traditional environments semi-structured • • • • Machine-generated Non-relational Varying schema Most common in cloud environments 4 Today’s reality Data Warehouse(s) Web IOT Datamarts 3rd-party Enterprise apps Data challenges Hadoop & noSQL Costly, complex infrastructure Barriers to insight 5 The evolution of data platforms Relational database Oracle, DB2, SQL Server 1980s Data warehouse appliance Teradata Data warehouse & platform software Vertica, Greenplum, Paraccel, Hadoop, Redshift 1990s 2000s Proprietary and Confidential Cloud-native data warehouse Snowflake 2010s 6 Multiple approaches to scaling Shared-disk Shared-nothing Multi-cluster, shared data Scalability limited by storage contention Better scalability but still limited concurrency Scalability with Concurrency Oracle Exadata Teradata, Netezza, Greenplum, Vertica, Hadoop Snowflake 7 Snowflake’s multi-cluster, shared data architecture 01010 01101 00011 Cloud Services Management layer that brings everything together Compute Where queries are processed Database Storage Where data loaded into Snowflake is stored 8 What does Snowflake enable? Cost effective storage and analysis of GBs, TBs, or even PB’s Lightning fast query performance Continuous data loading without impacting query performance Java Unlimited user concurrency Full SQL relational support of both structured and semi-structured data >_ Support for the tools and languages you already use Scripting 9 Possibilities are endless Companies providing easy access to analytics to 80% of their employees Provide ability to flexibly combine semistructured and structured data in one place, while scaling Driving attendance and fan experience through dynamic analytics Building data driven applications that provide secure access to insights to 11000+ pharmacies across the country 10 11