Download Evolution of Data warehousing

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts

Extensible Storage Engine wikipedia, lookup

Database wikipedia, lookup

Entity–attribute–value model wikipedia, lookup

Big data wikipedia, lookup

Clusterpoint wikipedia, lookup

Relational model wikipedia, lookup

Functional Database Model wikipedia, lookup

Database model wikipedia, lookup

Transcript
Cloud-Native Data Warehousing
Bob Muglia
1
Scenarios with affinity for cloud
Gartner 2016
Predictions:
By 2018, six
billion connected
things will be
requesting
support.
Connecting applications, devices, and
“things”
Reaching employees, business partners,
and consumers
Anytime, anywhere mobility
On demand, unlimited scale
Understanding behavior; generating,
retaining, and analyzing data
2
Cloud attributes and requirements
DYNAMIC
Scalable
Elastic
Adaptive
EASY
Lower cost
Faster
implementation
FLEXIBLE
SECURE
Supports many
scenarios
Trust by
design
3
The data has evolved
structured
•
•
•
•
Transactional data
Relational
Fixed schema
Dominant in
traditional
environments
semi-structured
•
•
•
•
Machine-generated
Non-relational
Varying schema
Most common in
cloud environments
4
Today’s reality
Data
Warehouse(s)
Web
IOT
Datamarts
3rd-party
Enterprise apps
Data challenges
Hadoop &
noSQL
Costly, complex
infrastructure
Barriers to
insight
5
The evolution of data platforms
Relational
database
Oracle, DB2,
SQL Server
1980s
Data
warehouse
appliance
Teradata
Data warehouse
& platform
software
Vertica,
Greenplum,
Paraccel, Hadoop,
Redshift
1990s
2000s
Proprietary and Confidential
Cloud-native
data
warehouse
Snowflake
2010s
6
Multiple approaches to scaling
Shared-disk
Shared-nothing
Multi-cluster,
shared data
Scalability limited by
storage contention
Better scalability but
still limited concurrency
Scalability with
Concurrency
Oracle Exadata
Teradata, Netezza,
Greenplum, Vertica,
Hadoop
Snowflake
7
Snowflake’s multi-cluster, shared data
architecture
01010
01101
00011
Cloud Services
Management layer that brings
everything together
Compute
Where queries are processed
Database Storage
Where data loaded into
Snowflake is stored
8
What does Snowflake enable?
Cost effective storage and analysis of GBs, TBs, or even PB’s
Lightning fast query performance
Continuous data loading without impacting query performance
Java
Unlimited user concurrency
Full SQL relational support of both structured and
semi-structured data
>_
Support for the tools and languages you already use
Scripting
9
Possibilities are endless
Companies
providing
easy access
to analytics to
80% of their
employees
Provide ability
to flexibly
combine semistructured and
structured
data in one
place, while
scaling
Driving
attendance
and fan
experience
through
dynamic
analytics
Building data
driven
applications
that provide
secure access
to insights to
11000+
pharmacies
across the
country
10
11