Download Discussion Topic 3 Question - Answer

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Nonlinear dimensionality reduction wikipedia , lookup

Transcript
BIS4435 Discussion Topic 3 Question on DW and Operation database
By Nawaz Khan
Can you apply data mining tools on operational database for business intelligence or it is
absolutely necessary to have a data warehouse in order to apply data mining tools for
business intelligence.
Discuss and give feedback to others.
BIS4435 Discussion Topic 3 Answers on DT3: DW and Operation database
BIS4435 Discussion Topic 3 Answer by Laith Gharib
The question is “Why do we need to use a separate database to perform data mining and
reporting?” Many companies include major task on-line data mining tools and knowledge
management that work directly on operational data, some of these tools begin to confuse
us between the databases and warehouses.
Because most people are familiar with commercial related database systems, it is to
understand what a data warehouse is by comparing these two kinds of systems.
The major take of on-line operational database systems is to perform online transactions
processing (OLTP) systems. They cover most of the day to day operations of an
organisation, such as purchasing, inventory, manufacturing, banking, payroll and
accounting. Data warehouse systems, on the other hand, serve users or knowledge
workers in the role of data analysis and decision making. Such systems can organise and
present data in various formats in-order to accommodate the diverse needs of the
different users. These systems are known as online analytical processing OLAP systems.
I feel I need to talk about the major distinguishing features between OLTP and OLAP
which I have summarised as follow:
Users and system orientation: an OLTP system is customer oriented and is used for
transaction and query processing by clerks, clients, and information technology
professionals. An OLPA system is market oriented and is used for data analysis by
knowledge workers, including managers, executives and analseist.
Data contests: an OLTP system manages current data that, typically, are too detailed to
be easily used for decision making. An OLAP system manages large amounts of historical
data, provides facilities for summarization and aggregation and stores and manages
information at different levels of granularity. These features make the data easier to use
in informed decision making.
Database design: an OLTP system usually adopts and entity relationship data model and
an application oriented database design. And OLAP system typically adopts either a star
or snowflake model and a subject oriented database design.
View: an OLTP system focuses mainly on the current data within an enterprise or
department, without referring to historical data or data in different organisation. In
contrast, an OLAP system often spans multiple versions of a database schema. Due to the
evolutionary process of an organisation, OLAP systems also deal with information that
originates from different organisations, integrating information from many data stores.
Because of the huge volume, OLAP data are stored on multiple storage media.
Access patterns: the access pattern of an OLTP system consists mainly of short, atomic
transactions. Such a system requires currency control and recovery mechanisms,
however, access to OLAP systems, are mostly read only operations.
Summary:
Because operational databases store huge amount of data, you may wonder “why not
perform online analytical processing directly on such databases instead of spending
additional time and resources to construct a separate data warehouse?” a major reason
for a separation is to help promote the high performance of both systems. An operational
database is designed and tuned from known tasks and work loads, such as indexing and
hashing using primary keys searching for particular records and optimizing “canned”
queries. On the other hand data warehouse queries are often complex. They involve the
computation of large groups of data at summarised levels and may require the use of
special data organisation, access, and implementation methods based on multi
dimensional views. Processing OLAP queries in operational databases would substantially
degrade the performance of operational tasks.
Moreover, an operational database supports the concurrent processing of multiple
transactions. Concurrency control and recovery mechanism, such as locking and logging,
are acquired to ensure the consistency and robustness of transaction. An OLAP query
often needs read only access of data records for summarisation and aggregation.
Concurrency control and recovery mechanisms, if applied for such OLAP operations, may
jeopardise the execution of concurrent transactions and thus substantially reduce the
throughputs of an OLTP system.
References:
Data Mining: concepts and techniques-second edition- Jiawei Han and Micheline Kamber.
Logical database design principles: John Garmany, Jeff Walker and Terry Clarck.
BIS4435 Discussion Topic 3 Answer by Faith Ime Abakada
Data Mining tools can be applied not only on data warehouses but also on operational
database .
The ‘but’ there is whether operational database or data warehouse allows the application
of data mining tools for Business intelligence.
The basic difference between operational databases and data warehouses is that
operational databases are designed to support transaction processing whereas data
warehousing systems are designed to support online analytical processing (OLAP) which
elaborates report generation unlike that of the operational database that has minimal
back-end reporting. As a result, data warehouses are designed and optimized using
methodologies that drastically differ from that of operational database systems.
My response to the question would be that in as much as data mining can be used on
both, when Business Intelligence is included data mining tools would be more efficient
when applied on data warehouses.
Data warehousing is a decision support database maintained separately from the
organization’s operational database.
Data mining tools are designed to search through detailed historical data to identify
hidden patterns that are not obvious to humans or query tools and in the process, reveal
intelligence that can be integrated into business processes to provide predictive
capabilities for improving strategic business decision making.
A lot of companies in many industries such as financial services, telecommunications,
retail, and market research, business intelligence is a critical resource for decisionmakers.
Data warehouse is essential for business Intelligence because it provides the company
management with historical data from both internal and external sources whereas a day
to day operational database excludes external data that can be used to weigh and predict
the performance and future performance of the company and also how to attain such
targets.
In summary I would like to quote a Gartner Group report which emphasis on the
necessity of mining tools for Business Intelligence. ‘‘Information Democracy will emerge
in forward-thinking enterprises, with Business Intelligence information and applications
available broadly to employees, consultants, customers, suppliers, and the public. The
key to thriving in a competitive marketplace is staying ahead of the competition. Making
sound business decisions based on accurate and current information takes more than
intuition. Data analysis, reporting, and query tools can help business users wade through
a sea of data to synthesize valuable information from it - today these tools collectively fall
into a category called Business Intelligence.’’
http://www.scribd.com/doc/18638941/Data-Warehousing-and-Data-Mining
http://www.rpi.edu/datawarehouse/dw-about-cmp.html
BIS4435 Discussion Topic 3 Answer by Afiff Bahalwan
First of all, before answering the question, let us see the difference between operational
database and data warehouse.
Operational database can be defined as ‘the database-of-record, consisting of systemspecific reference data and event data belonging to a transaction-update system’
(geekinterview.com, 2009). This kind of database is created in order to handle daily
transactions for a business. Therefore, operational database is continuously updated and
reflect the current detail of the last transactions.
While data warehouse, as described by Inmon (1993), is ‘a subject-oriented, integrated,
time-variant, and non-volatile collection of data in support of management’s decisionmaking process’. In other words, data warehouse can be defined as a repository of an
organisation’s electronically stored data. Data warehouses are designed to provide
efficient analytical reporting. Instead of handling daily transactions, data warehouse is
developed to store historical data, for instance the data of the company within the last 5
years.
‘In the data warehouse, operational databases are not accessed directly to perform
information processing’ (gantthead.com, 2009). In fact, the operational database is the
source of data for the data warehouse. The main difference is operational database is
designed for real time business operations, while data warehouse is designed for analysis
of business measures.
To sum up, the answer for the question is yes, we can apply data mining tools on both
operational database and data warehouse. However, data mining tools would work
efficiently when applied on data warehouse. Data mining is the process of extracting data
in order to find patterns and trends. For that reason, Connolly (2005) says that data
mining requires a single, separate, clean, integrated, and self-consistent source of data.
Extract, Transform, and Load (ETL) data from the operational database into the
warehouse, in which the data is cleaned and organised into a meaningful manner, are the
critical steps in constructing a data warehouse. It becomes clear that data inside the
warehouse must have been already cleaned and organised. As a result, many
organisations are using data mining tools on a data warehouse for business intelligence.
References:
Connolly, T. M. & Begg, C. E. (2005). Database Systems: A Practical Approach to Design,
Implementation, and Management. Pearson Education: Harlow.
http://www.geekinterview.com/kb/Operational-Database.html
http://www.gantthead.com/content/processes/9076.cfm
BIS4435 Discussion Topic 3 Answer by Althea Walters
Before answering the question let us examine the difference between operational
database and Data warehouse.
Operational database is a “database-of-record, consisting of system-specific reference
data and event data belonging to a transaction-update system.” (www.metro-designdev.com, 2009) In other word, operational database contains data that is essential to the
day-to-day operation of the business. This data consistently changes through updates
and shows the current value of the last details.
Data warehouse is consisting of “informational database used to store sharable data
sourced from an operational database-of-record.”( www.metro-design-dev.com,2009) In
other word it is a repository of historical data that is used to forecast trends, analyze
reports and supports management decision making- stores historical data, of the
organization.
Therefore the answer is yes we can apply data mining tools to both operational data and
data warehouse since operational databases are the source data for data warehouses.
However, with operational data base the information extracted will be contradictory since
the values of operational databases are constantly changing through daily updates. As
such, according to wikipedia data-mining (“which is the process of extracting hidden
patterns and trends from data”) are much more efficient for data warehouses, since the
larger pool of information is much more consistent, and can be better used to analyze
reports and support management decision. (www.en.wikipedia.com, 2009)
References
www.metro-design-dev.com
www.en.wikipedia.org/wiki/Data_mining
BIS4435 Discussion Topic 3 Answer by Priyadharshini Packrisamy
Data mining is the process of extracting valid, previously unknown comprehensive and
actionable information from large database and using it to make crucial business
Decision.
This may be as simple as a cursory glance at a spreadsheet, to check the numbers for
"sanity". Or it may involve graphical analysis with a full-blown OLAP tool. In short, Data
Mining can be applied anywhere in your business or organization where you are
interested in identifying and exploiting predictable outcomes[1].
Operational Database is the database-of-record, consisting of system-specific reference
data and event data belonging to a transaction-update system. The operational database
is the source of data for the data warehouse.So Data mining technique can be applied to
operational database also.
But when we analyze the data mining process it needs cleaned, self consistent, separate,
integrated data[2].
The results of data mining study are useful if there is some way to further investigate the
uncovered patterns. Data warehouse provide the capability to go back to data source. In
DW data from multiple sources to discover as many interrelationship as possible.
Many data mining tools currently operate outside of the warehouse, requiring extra steps
for extracting, importing, and analyzing the data. Furthermore, when new insights require
operational implementation, integration with the warehouse simplifies the application of
results from data mining.[3]
So the conclusion is it is possible to apply data mining in operational database but data
mining in data warehouse is more effective compared to operational database.
References
[1] http://www.albionresearch.com/data_mining/why.php
[2] Connolly, T. M. & Begg, C. E. (2005). Database Systems: A Practical Approach to
Design, Implementation, and Management. Pearson Education: Harlow.
[3]http://www.thearling.com/text/dmwhite/dmwhite.htm
BIS4435 Discussion Topic 3 Answer by Chintankumar Patel
Data mining is the process of extracting valid, previously unknown comprehensive and
actionable information from large database and using it to make crucial business
Decision.This may be as simple as a cursory glance at a spreadsheet, to check the
numbers for "sanity". Or it may involve graphical analysis with a full-blown OLAP tool. In
short, Data Mining can be applied anywhere in your business or organization where you
are interested in identifying and exploiting predictable outcomes[1].
Operational Database is the database-of-record, consisting of system-specific reference
data and event data belonging to a transaction-update system. The operational database
is the source of data for the data warehouse.So Data mining technique can be applied to
operational database also. But when we analyze the data mining process it needs
cleaned, self consistent, separate, integrated data[2].The results of data mining study are
useful if there is some way to further investigate the uncovered patterns. Data warehouse
provide the capability to go back to data source. In DW data from multiple sources to
discover as many interrelationship as possible.Many data mining tools currently operate
outside of the warehouse, requiring extra steps for extracting, importing, and analyzing
the data. Furthermore, when new insights require operational implementation, integration
with the warehouse simplifies the application of results from data mining.[3]So the
conclusion is it is possible to apply data mining in operational database but data mining in
data warehouse is more effective compared to operational database.
References
[1] http://www.albionresearch.com/data_mining/why.php
[2] Connolly, T. M. & Begg, C. E. (2005). Database Systems: A Practical Approach to
Design, Implementation, and Management. Pearson Education: Harlow.
BIS4435 Discussion Topic 3 Answer by Durga Devi Rajan
Basically data mining is a technique to bring out statistical result of an organization by
analysing the data.
Data mining technique can be applied to any database but that dbase should obey the
following rules
>data in the dbase must be cleaned and integrated one
>data in the dbase should be collected from multiple dbase
> Dbase supposed to allow further investigation
ODS is a operational database which obey above mentioned condition and when we think
about data warehouse obviously supports all the above rules
Since historical data is one of the important concept to consider .With historical data we
can end up with better decision making. So it’s always best to apply data mining
technique to data warehouse then operational data base.
ref Database Systems A Practical approach to design by
connolly &begg
BIS4435 Discussion Topic 3 Answer by Sanjay Kumar
In my opinion we can apply data mining tools on operational database for business
intelligence but at the time when we are going to apply data mining tools on operational
management for business intelligence , should be aware of some facts like:- what is the
difference between data mining & data warehousing?
Data warehousing is the process of combining data from different data sources into one
database. Data mining is analyzing data to find relationships, some of which are nonintuitive. Data mining is often performed on data warehouses.
Data warehouse is a repository of an organization's electronically stored data. Data
warehouses are designed to facilitate reporting and analysis while Data mining is the
process of extracting patterns from data. data mining can be used to uncover patterns in
data samples, it is important to be aware that the use of non-representative samples of
data may produce results that are not indicative of the domain.
In Data Warehousing, the Operational Database is one which is accessed by an
Operational System to carry out regular operations of an organization.Data Warehouses
use an OLAP Database (Online Analytical Processing) which is optimized for faster
queries.
REFERENCES:http://en.wikipedia.org/wiki/Data_mining
http://en.wikipedia.org/wiki/Data_warehouse
http://en.wikipedia.org/wiki/Operational_database
BIS4435 Discussion Topic 3 Answer by Allen Rajiev
We can apply data mining tools on operational database for business intelligence. Though
we can do this, it is not the best way to use Data mining tools.
Operational databases are designed for storing and retrieving day-to-day transactions
happening in the organization which will be helpful for transaction processing systems in
the organization. Data mining tools are having enormous capabilities to do data
preparation, selection of data mining operations, facilities for understanding results,
improving performance and so on. If we are using such a tool on an operational database
the tool cannot be utilized to its full capacity and also it will affect the performance of the
operational database.
Good BI can be gathered only by analyzing historical data which are usually part of Data
Warehouse. It is better to have a separate source of data for Data mining tools as it can
work on it for data cleansing, data describing, data transforming and data sampling. Data
mining tools require only read only access to the data. An operational DB is usually used
for read/write operations.
If data mining tools are applied on operational database it will surely affect the
performance of DB and hence it’s advisable to use data mining tools on Data warehouse.
Reference :
1. Thomas M Connolly & Carolyn E Begg , Database Systems : A Practical approach to
design, Implementation and management Fourth Edition Pages(1241)
2. Jiawei Han & Micheline Kamber, Data Mining concepts and Techniques, Second Edition
Pages(109,110)
BIS4435 Discussion Topic 3 Answer by Maria Chrysandreou
BIS4435 Discussion Topic 3 Answer by Sujatha Dingari
Data mining software allows users to analyze large databases to solve business decision
problems. Data mining is, in some ways, an extension of statistics, with a few artificial
intelligence and machine learning twists thrown in. Like statistics, data mining is not a
business solution, it is just a technology.
Data mining is not an “intelligence” tool or framework. Business intelligence, typically
drawn from an enterprise data warehouse, is used to analyze and uncover information
about past performance on an aggregate level. Data warehousing and business
intelligence provide a method for users to anticipate future trends from analyzing past
patterns in organizational data. Data mining is more intuitive, allowing for increased
insight beyond data warehousing. An implementation of data mining in an organization
will serve as a guide to uncovering inherent trends and tendencies in historical
information. It will also allow for statistical predictions, groupings and classifications of
data.
DATA WAREHOUSING:
Data warehouse is a repository of an organization's electronically stored data. Data
warehouses are designed to facilitate reporting and analysis.
This classic definition of the data warehouse focuses on data storage. However, the
means to retrieve and analyze data, to extract, transform and load data, and to manage
the data dictionary are also considered essential components of a data warehousing
system. Many references to data warehousing use this broader context. Thus, an
expanded definition for data warehousing includes business intelligence tools, tools to
extract, transform, and load data into the repository, and tools to manage and retrieve
metadata.
Data warehouse is essential for business Intelligence because it provides the company
management with historical data from both internal and external sources whereas a day
to day operational database excludes external data that can be used to weigh and predict
the performance and future performance of the company and also how to attain such
targets.
http://www.thearling.com/
http://www.aspfree.com/c/a/MS-SQL-Server/Using-Data-Mining-for-BusinessIntelligence/
BIS4435 Discussion Topic 3 Answer by Mohamed Sameem Syed Sahabudeen
Data Mining is “A process of transforming data into information and making it available to
users in a timely enough manner to make a difference”(Forrester Research, April 1996
1. Data quality: Different sources typically use inconsistent data representations, codes,
and formats which have to be reconciled.
To this end, we can say that Data mining automates analysis of operational data with the
intention of finding previously unknown data characteristics, relationships, dependencies,
and/or trends, so therefore, data mining, can be used for business intelligence for both
operational database, and data warehouse.
Chapter 8 of Database Modeling and Design: Logical Design, Fourth Edition by Toby
Teorey, published by Elsevier in 2006.
In the data warehouse, operational databases are not accessed directly to perform
information processing’ (gantthead.com, 2009). In fact, the operational database is the
source of data for the data warehouse. The main difference is operational database is
designed for real time business operations, while data warehouse is designed for analysis
of business measures. think data mining tools can be apllied on operational database for
business intelligence and it is not necessary to have a datawarehouse.But data mining is
a long process and is time consuming. And where as,datawarehouse focuses on data
storage which is usually used for analysis,storage purpose,etc.So data warehouse helps
make the retriving of information easier and helps to get a better and faster result
Data warehouse is essential for business Intelligence because it provides the company
management with historical data from both internal and external sources whereas a day
to day operational database excludes external data that can be used to weigh and predict
the performance and future performance of the company and also how to attain such
targets.
Data warehouse is a repository of an organization's electronically stored data. Data
warehouses are designed to facilitate reporting and analysis while Data mining is the
process of extracting patterns from data. data mining can be used to uncover patterns in
data samples, it is important to be aware that the use of non-representative samples of
data may produce results that are not indicative of the domain.
In Data Warehousing, the Operational Database is one which is accessed by an
Operational System to carry out regular operations of an organization.Data Warehouses
use an OLAP Database (Online Analytical Processing) which is optimized for faster
queries.
http://en.wikipedia.org/wiki/Data_warehouse
http://www.aspfree.com/c/a/MS-SQL-Server/Using-Data-Mining-for-BusinessIntelligence/
BIS4435 DISCUSSION TOPIC 3 READING MATERIAL