Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Dr. Randall Dyck http://rdyck.mynucleus.ca Calgary Oracle User’s Group 2015 November 15 Data Warehousing and Data Mining Data Warehouse Definition – a database containing several tables with totals in them to speed up queries Reasons to have – several years of data, many departments, many databases, many servers, XML files, Fixed and comma formatted text files Consolidated formats – dates, names, case, keys, currencies Design – In OLTP systems, we have a snowflake design with many joins and normalized data. In OLAP systems, we have one central fact table, no joins, many duplicates in the fact table, and many indexes on the fact table Architecture – data can be put straight into a data warehouse but it is better if it is put into a staging area first so that it may be cleaned up before moving it to the data warehouse Tools GUI – Designer, Discoverer, Warehouse Builder Loading – exp/imp, expdp/impdp, transportable tablespaces, data streams, sql*loader, external tables, table functions, insert all, merge, regular expressions, utl files, material views Maintaining – triggers, material view logs Tuning – partitions, bitmap indexes, dimensions, outlines, parallels Data Mining Analysis types – most common, most common for repeat customers, unusual patterns, patterns over time, compare to previous periods, related items, predict sales for current customers, compare to totals, predict sales for new customers Tools GUI – Report Builder, Graphics Builder, Analyzer, Data Miner, Express PLSQL – cursors, arrays SQL – report writer, correlated joins, rollup, cube, grouping sets, hierarchical queries, case/decode, regressions, listagg, models, lead, lag, sum(sum) Tips sql*loader – use unrecoverable, direct, and parallel options Simple data warehouse – use material views for fact and dimension tables Advanced data warehouse – use triggers to move production data to staging area, triggers in staging area to do cleanups and move data to fact table, and triggers on fact table to move data to dimension tables Web Sites Data Warehouse Concepts https://docs.oracle.com/cd/B10501_01/server.920/a96520/concept.htm#43555 SQL*Loader Tutorial http://orafaq.com/wiki/SQL%2ALoader_FAQ Oracle Warehouse Builder Tutorials http://communities.bmc.com/communities/docs/DOC9903;jsessionid=FBCCA512DBA831734CE854B2CE71D728.node0 Oracle Designer Tutorials http://www.cob.unt.edu/itds/courses/bcis5420/Lectures/Oracle%20Intro%20to%20Designer%20 Abridged.pdf Data Mining Concepts www.thearling.com/text/dmtechniques/dmtechniques.htm Data Mining Tools http://www.kdnuggets.com/polls/2015/analytics-data-mining-data-science-software-used.html Oracle Packages (see DBMS and UTL commands) http://psoug.org/reference/library.html