Download Data Warehouse Concepts - Calgary Oracle Users Group

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Nonlinear dimensionality reduction wikipedia , lookup

Transcript
Dr. Randall Dyck
http://rdyck.mynucleus.ca
Calgary Oracle User’s Group
2015 November 15
Data Warehousing and Data Mining
Data Warehouse
Definition – a database containing several tables with totals in them to speed up queries
Reasons to have – several years of data, many departments, many databases,
many servers, XML files, Fixed and comma formatted text files
Consolidated formats – dates, names, case, keys, currencies
Design – In OLTP systems, we have a snowflake design with many joins and
normalized data. In OLAP systems, we have one central fact table,
no joins, many duplicates in the fact table, and many indexes on
the fact table
Architecture – data can be put straight into a data warehouse but it is better if it is put
into a staging area first so that it may be cleaned up before moving it to
the data warehouse
Tools GUI – Designer, Discoverer, Warehouse Builder
Loading – exp/imp, expdp/impdp, transportable tablespaces,
data streams, sql*loader, external tables,
table functions, insert all, merge, regular expressions,
utl files, material views
Maintaining – triggers, material view logs
Tuning – partitions, bitmap indexes, dimensions, outlines, parallels
Data Mining
Analysis types – most common, most common for repeat customers, unusual patterns,
patterns over time, compare to previous periods, related items,
predict sales for current customers, compare to totals,
predict sales for new customers
Tools GUI – Report Builder, Graphics Builder, Analyzer, Data Miner, Express
PLSQL – cursors, arrays
SQL – report writer, correlated joins, rollup, cube, grouping sets,
hierarchical queries, case/decode, regressions, listagg, models,
lead, lag, sum(sum)
Tips sql*loader – use unrecoverable, direct, and parallel options
Simple data warehouse – use material views for fact and dimension tables
Advanced data warehouse – use triggers to move production data to
staging area, triggers in staging area to do cleanups and move
data to fact table, and triggers on fact table to move data to
dimension tables
Web Sites
Data Warehouse Concepts
https://docs.oracle.com/cd/B10501_01/server.920/a96520/concept.htm#43555
SQL*Loader Tutorial
http://orafaq.com/wiki/SQL%2ALoader_FAQ
Oracle Warehouse Builder Tutorials
http://communities.bmc.com/communities/docs/DOC9903;jsessionid=FBCCA512DBA831734CE854B2CE71D728.node0
Oracle Designer Tutorials
http://www.cob.unt.edu/itds/courses/bcis5420/Lectures/Oracle%20Intro%20to%20Designer%20
Abridged.pdf
Data Mining Concepts
www.thearling.com/text/dmtechniques/dmtechniques.htm
Data Mining Tools
http://www.kdnuggets.com/polls/2015/analytics-data-mining-data-science-software-used.html
Oracle Packages (see DBMS and UTL commands)
http://psoug.org/reference/library.html