Survey
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
Sparse Data Storage of Multi-Dimensional Data for OLAP and Data Mining Sanjay Goil Alok Choudhary Department of Electrical & Computer Engineering and Center for Parallel and Distributed Computing, Northwestern University, Technological Institute 데이터베이스 연구실 석사 2학기 임성은 OLAP Seminar 1 Contents Introduction Multi-dimensional databases Multi-dimensional Arrays Compressed Multi-dimensional Arrays Sparse Data Storage Bit-Encoded Sparse Storage(BESS) Other Sparse Storage Schemes Bit-Index-Value pairs Offset-Value pairs Compressed Sparse Dimensions Sparse-Dense split storage OLAP Seminar 2 Contents Data Access Operations in OLAP and Data Mining Data Cubes Basic Operations Data Access Performance Operations Choosing a sparse structure Performance Results Decoding dimension index values Retrieval along a dimension and Range Queries Aggregation along a combination of dimensions Conclusions OLAP Seminar 3 Introduction Introduction of BESS retrieval of dimensional index for OLAP operation along specified dimensions : bit operation A comparison with other sparse data structure Analysis and performance results in terms of design parameters : number of dimensions, dimension sizes, sparsity of data, cell size OLAP Seminar 4 Multi-dimensional databases Multi-Dimensional databases OLAP Seminar 5 Multi-dimensional databases Multi-Dimensional databases OLAP Seminar 6 Multi-dimensional databases Multi-Dimensional Arrays OLAP Seminar 7 Multi-dimensional databases Compressed Multi-Dimensional Arrays Figure 3: Storage of data in chunks OLAP Seminar 8 Sparse Data Storage Sparse Data Storage Figure 4: Dense and sparse chunk storage for the datacube OLAP Seminar 9 Sparse Data Storage Bit-Encoded Sparse Storage (BESS) Figure 5: Bit Encoded Sparse Storage OLAP Seminar 10 Sparse Data Storage Bit-Encoded Sparse Storage (BESS) OLAP Seminar 11 Sparse Data Storage Bit-Encoded Sparse Storage (BESS) OLAP Seminar 12 Other Sparse Storage Schemes Index-Value pairs Offset-Value pairs Compressed Sparse Dimensions Figure 8 : Sparse data structures for a sample sparse array (a) Index-Value pairs (b) Offset- Value pairs (c) Compressed Sparse Dimensions OLAP Seminar 13 Other Sparse Storage Schemes Sparse-Dense split storage OLAP Seminar 14 Data Access Operations in OLAP Data Cubes Pivoting(Rotating) : rotating the cube to change the dimensional orientation of a report or page on display Slicing-dicing : selecting some subset of the cube Roll-up : going up the hierarchy to higher levels of generation Drill-Down : traversing the hierarchy from lower to higher levels of detail Trend analysis over sequential time periods OLAP Seminar 15 Data Access Operations in OLAP Basic Operations Retrieval of a random cell element Retrieval along a dimension or a combination of dimensions OLAP Seminar 16 Data Access Operations in OLAP Basic Operations Retrieval for values of dimensions within a range (Range Queries) Aggregation operations on dimensions OLAP Seminar 17 Data Access Operations in OLAP Basic Operations Aggregation operations on dimensions OLAP Seminar 18 Data Access Operations in OLAP Basic Operations Multi-dimensional Aggregation (Generalization/Consolidation: lower to higher level in hierarchy) OLAP Seminar 19 Data Access Performance Data Access Performance OLAP Seminar 20 Data Access Performance Operations Retrieval of a random cell element Retrieval along a dimension Aggregation of a dimension Choosing a sparse structure OLAP Seminar 21 Performance Results Decoding dimension index values OLAP Seminar 22 Performance Results Decoding dimension index values OLAP Seminar 23 Performance Results Decoding dimension index values OLAP Seminar 24 Performance Results Decoding dimension index values OLAP Seminar 25 Performance Results Retrieval along a dimension and Range Queries OLAP Seminar 26 Performance Results Aggregation along a combination of dimensions OLAP Seminar 27 Conclusions Performance results show better performance for OLAP queries using BESS over other storage schemes. OLAP Seminar 28