Download Data Access Operations in OLAP

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Sparse Data Storage of Multi-Dimensional Data
for OLAP and Data Mining
Sanjay Goil
Alok Choudhary
Department of Electrical & Computer Engineering and
Center for Parallel and Distributed Computing,
Northwestern University,
Technological Institute
데이터베이스 연구실
석사 2학기
임성은
OLAP Seminar
1
Contents
 Introduction
 Multi-dimensional databases
Multi-dimensional Arrays
Compressed Multi-dimensional Arrays
 Sparse Data Storage
Bit-Encoded Sparse Storage(BESS)
 Other Sparse Storage Schemes
Bit-Index-Value pairs
Offset-Value pairs
Compressed Sparse Dimensions
Sparse-Dense split storage
OLAP Seminar
2
Contents
 Data Access Operations in OLAP and Data Mining
Data Cubes
Basic Operations
 Data Access Performance
Operations
Choosing a sparse structure
 Performance Results
Decoding dimension index values
Retrieval along a dimension and Range Queries
Aggregation along a combination of dimensions
 Conclusions
OLAP Seminar
3
Introduction
Introduction of BESS
retrieval of dimensional index for OLAP operation along
specified dimensions : bit operation
A comparison with other sparse data structure
Analysis and performance results in terms of design
parameters :
number of dimensions, dimension sizes, sparsity of data,
cell size
OLAP Seminar
4
Multi-dimensional databases
Multi-Dimensional databases
OLAP Seminar
5
Multi-dimensional databases
Multi-Dimensional databases
OLAP Seminar
6
Multi-dimensional databases
Multi-Dimensional Arrays
OLAP Seminar
7
Multi-dimensional databases
Compressed Multi-Dimensional Arrays
Figure 3: Storage of data in chunks
OLAP Seminar
8
Sparse Data Storage
Sparse Data Storage
Figure 4: Dense and sparse chunk storage for the datacube
OLAP Seminar
9
Sparse Data Storage
Bit-Encoded Sparse Storage (BESS)
Figure 5: Bit Encoded Sparse Storage
OLAP Seminar
10
Sparse Data Storage
Bit-Encoded Sparse Storage (BESS)
OLAP Seminar
11
Sparse Data Storage
Bit-Encoded Sparse Storage (BESS)
OLAP Seminar
12
Other Sparse Storage Schemes
Index-Value pairs
Offset-Value pairs
Compressed Sparse
Dimensions
Figure 8 : Sparse data structures for a
sample sparse array
(a) Index-Value pairs
(b) Offset- Value pairs
(c) Compressed Sparse Dimensions
OLAP Seminar
13
Other Sparse Storage Schemes
Sparse-Dense split storage
OLAP Seminar
14
Data Access Operations in OLAP
Data Cubes
Pivoting(Rotating) : rotating the cube to change the
dimensional orientation of a report or page on display
Slicing-dicing : selecting some subset of the cube
Roll-up : going up the hierarchy to higher levels of
generation
Drill-Down : traversing the hierarchy from lower to
higher levels of detail
Trend analysis over sequential time periods
OLAP Seminar
15
Data Access Operations in OLAP
Basic Operations
Retrieval of a random cell element
Retrieval along a dimension or a combination of
dimensions
OLAP Seminar
16
Data Access Operations in OLAP
Basic Operations
Retrieval for values of dimensions within a range (Range
Queries)
Aggregation operations on dimensions
OLAP Seminar
17
Data Access Operations in OLAP
Basic Operations
Aggregation operations on dimensions
OLAP Seminar
18
Data Access Operations in OLAP
Basic Operations
Multi-dimensional Aggregation
(Generalization/Consolidation: lower to higher level in
hierarchy)
OLAP Seminar
19
Data Access Performance
Data Access Performance
OLAP Seminar
20
Data Access Performance
Operations
Retrieval of a random cell element
Retrieval along a dimension
Aggregation of a dimension
Choosing a sparse structure
OLAP Seminar
21
Performance Results
Decoding dimension index values
OLAP Seminar
22
Performance Results
Decoding dimension index values
OLAP Seminar
23
Performance Results
Decoding dimension index values
OLAP Seminar
24
Performance Results
Decoding dimension index values
OLAP Seminar
25
Performance Results
Retrieval along a dimension and Range Queries
OLAP Seminar
26
Performance Results
Aggregation along a combination of dimensions
OLAP Seminar
27
Conclusions
Performance results show better performance for
OLAP queries using BESS over other storage
schemes.
OLAP Seminar
28
Related documents