Download CUSTOMER_CODE SMUDE DIVISION_CODE SMUDE

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Forecasting wikipedia , lookup

Data assimilation wikipedia , lookup

Transcript
CUSTOMER_CODE
SMUDE
DIVISION_CODE
SMUDE
EVENT_CODE
JULY2016
ASSESSMENT_CODE MC0088_JULY2016
QUESTION_TYPE
DESCRIPTIVE_QUESTION
QUESTION_ID
5227
QUESTION_TEXT
Distinguish the features between OLTP and OLAP
SCHEME OF
EVALUATION
Users and sytem orientatioin: An OLTP system is customer oriented and
is used for transaction and query used for transacrion and
query processing by clerks,clients and information technology
professionals. An OLAP system is market oriented and is usd for data
analysis by knowledge workers, including managers, executives and
analysts. (2 marks)
Data contents: An OLTP system managers current data that typically are
too detailed to be easily used for decision making. An OLTP system
managers large amounts of historiacl data, provides facilities for
summarization and aggregation and stores and managers information at
different levels of granularity. These features make the data easier to use
in informed decision making. (2 marks)
Database design:An OLTP system usually adopts an entity relationship
data model and an application oriented database design. An OLAP
system typically adopts either a star or snowflake model and subject –
oriented database design. (2 marks)
View:An OLTP system focuses mainly on the current data within an
enterprise or department without referring to historical data or data
in different organizations. In contrast OLAP system often spans multiple
versions of a database schema, due to the evolutionary process of an
organization. OLAP systems also deal with information that originates
from different organizations, integrating information from many data
stores. Because of their huge volume, OLAP data are stored on multiple
storage media. (2 marks)
Access patterns: The access patterns of an OLTP system consists manily
of short, atomic transactions. Such a system requires
concurrency concurrency control and recovery mechanisms. However
access to OLAP systems are mostly read only operations although many
could be complex queries. (2 marks)
(Total 10 marks)
QUESTION_TYPE
DESCRIPTIVE_QUESTION
QUESTION_ID
5228
QUESTION_TEXT
Explain the concept of data transformation.
SCHEME OF
EVALUATION
In data transformation, the data are transformed or consolidated into
forms appropriate for mining. It involves the following: (2 marks)
Smoothing: which works to remove the noise form data? Such
techniques include binning, clustering and regression.
Aggregation, where summary of aggregation operations are applied to
the data. This step is typically used in constructing a data cube for
analysis of the data at multiple granularities. (2 marks)
Generalization of the data, where low level or primitive data are
replaced by higher level concepts through the use of concept
hierarchies. Ex like street can be generalizes to city or country (2
marks)
Normalization, where attribute data are scaled so as to fall within a
small specified range such as 1.0 to 1.0 or 0.0 to 1.0 (2 marks)
Attribute construction where new attributes are constructed and added
from the given set of attributes to help the mining process. (2 marks)
QUESTION_TYPE
DESCRIPTIVE_QUESTION
QUESTION_ID
72561
QUESTION_TEXT
Discuss key features of a Data warehouse as per W. H. Inmon’s
statement.
SCHEME OF
EVALUATION
Key features are:
● Subject-oriented
● Integrated
● Time variant
● Non-volatile
QUESTION_TYPE
DESCRIPTIVE_QUESTION
QUESTION_ID
72562
QUESTION_TEXT
What are the Four different views regarding the design of a data
warehouse? Explain.
SCHEME OF
EVALUATION
The views are:
● Top-down view
● Data source view
● Data warehouse view
● Business query view
QUESTION_TYPE
DESCRIPTIVE_QUESTION
QUESTION_ID
117791
QUESTION_TEXT
Discuss Data Mining technologies.
a.
Decision trees
b.
Rule induction
c.
Genetic algorithms
d.
Nearest neighbor
e.
Artificial neural networks
SCHEME OF EVALUATION
(2 marks each with explanation)
QUESTION_TYPE
DESCRIPTIVE_QUESTION
QUESTION_ID
117794
QUESTION_TEXT
List and explain the web content mining challenges.

Data/Information extraction

Web information integration and schema matching

Opinion extraction from online sources

Knowledge synthesis

Segmenting web pages and detecting noise
SCHEME OF EVALUATION
5×2=10 marks