Download Which of the following is the most popularly available and rich

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Extensible Storage Engine wikipedia , lookup

Entity–attribute–value model wikipedia , lookup

Big data wikipedia , lookup

Database wikipedia , lookup

Relational model wikipedia , lookup

Clusterpoint wikipedia , lookup

Functional Database Model wikipedia , lookup

Healthcare Cost and Utilization Project wikipedia , lookup

Database model wikipedia , lookup

Transcript
11. What is the single dimensional association
rule for the following predicate notation,
which in multidimensional association rule.
1.
Which of the following is the most popularly
available and rich information repositories?
a.
b.
c.
2.
Temporal databases
Relational databases
Transactional databases
Which of the following databases is used
to store time-related data?
a.
contains(T,
"software")
a.
Computer ==
software
b.
Software ==
computer
c.
Software ==
computer
Spatial databases
c.
Multimedia databases
d.
Temporal databases
3. From a DWH perspective, data mining can be
viewed as an advanced stage of
a.
On-Line Transaction Processing
b.
On-Line Data Processing
c.
On-Line Analytical Processing
d.
On-Line Electronic Processing
4. A _ _ _ _ _ _ is a group of heterogeneous
databases?
a.
Time series databases
b.
Object oriented databases
c.
Legacy databases
d.
Spatial databases
5. Spatial databases includes
a.
Legacy databases
b.
Time series databases
c.
Satellite image databases
d.
Temporal databases
7. Many people treat data mining as synonym for
another popularly used term
a.
Knowledge Discovery in databases
b.
knowledge inventory in databases
c.
Knowledge acceptance in databases
d.
knowledge disposal in databases.
7. A database is a collection of
a.
Related data
b.
Interrelated data
c.
Irrelevant data
d.
Distributed data
9. A Relational database is a collection of
a.
tables
b.
events
c.
attributes
d.
values
10. A _ _ _ _ _ _ _ is a repository of information
collected from multiple squares stored under a
unified schema, and which usually resides at a
single site.
a.
Data mining
b.
Database
c.
Data warehouse
d.
legacy databases
10. Which of the following databases is used
to store image, audio, and video data?
a.
b.
c.
d.
Contains(T, "computer") ==
Heterogeneous databases
Temporal databases
Legacy databases
Multimedia databases
d.
Computer ==
software
12. Which of the following analysis attempt to
identify attributes that do not contribute
to the classification or prediction process?
a.
Cluster analysis
b.
Outlier analysis
c.
Relevance analysis
d.
Evolution analysis
13. Which of the following is a summarization of
the general characteristics or features of a
target class of data?
a.
Data discrimination
b.
Data characterization
c.
Data compression
d.
Meta data
14. _ _ _ _ _ _ _ is a comparison of the general
features of target class data objects with
general features of objects from one or a set
of contrasting classes.
a.
Data characterization
b.
Data summarization
c.
Data discrimination
d.
Meta data
15. _ _ _ _ _ _ _ interestingness measures are
based on user beliefs in the data.
a.
Objective
b.
Descriptive
c.
Collective
d.
Subjective
16. _ _ _ _ _ _ mining tasks characterize the
general properties of the data in the
databases.
a.
b.
c.
d.
Descriptive
Predictive
Metadata
Data
17. _ _ _ _ _ mining tasks perform inference on the
current data in order to make predictions.
a.
Descriptive
b.
Predictive
c.
Data
d.
Metadata
18. The derived model may be represented in the
form of
a.
b.
c.
d.
ER model
Flow chart
Decision trees
DFD
19. Which of the following is the classification of
data mining systems?
a.
Summarization
b.
Visualization
c.
Discrimination
d.
Characterization
20. _ _ _ _ _ _ _ analysis describes and models
regularities or trends for objects whose
behavior changes over time.
a.
Data evolution
b.
Cluster
c.
Outlier
d.
Summarization
21. Which of the following issues relation to the
diversity of database type?
28. Pattern evolution is an issue related to
a.
Mining methodology and user
interaction issues
b.
Performance issues
c.
Issues relating to the diversity of
database types
d.
Issues relating to the Measurement
29. A DWH is a subject oriented, integrated,
time-variant, and _ _ _ _ _ _ collection of
data in support of management's decisionmaking process.
a.
b.
c.
a.
Nonvolatile
b.
Volatile
c.
Disintegrated
d.
Object- oriented
30. An _ _ _ system focuses mainly on the current
data with in an enterprise or department,
without referring to historical data or data in
different organizations .
a.
a.
On-Line Analytical Processing
b.
On-Line Data Processing
c.
On-Line Electronic Processing
d.
On-Line Transaction Processing
31. The
basic
characteristic
of
On-line
Analytical Processing is
Handling noisy or incomplete data
Incorporation of background knowledge
Handling
of
relational
and
complex types of data
d.
Efficiency and scalability of data
mining algorithms
22. Which of the following is not major issue
in data mining?
Mining
methodology
and
user
interaction issues
b.
Performance issues
c.
Issues relating to the diversity of database
types
d.
Issues relating to the Measurement
23. Processing _ _ _ _ _ queries in operational
databases would substantially degrade the
performance of operational tasks.
a.
On-Line Transaction Processing
b.
On-Line Electronic Processing
c.
On-Line Data Processing
d.
On-Line Analytical Processing
24. An _ _ _ _ _ _ System typically adopts either a
star or snow flake model and subject oriented
database design.
a.
b.
c.
d.
On-Line Transaction Processing
On-Line Electronic Processing
On-Line Analytical Processing
On-Line Data Processing
25. The access patterns of an _ _ _ _ system
consist mainly of short, atomic transactions.
a.
On-Line Analytical Processing
b.
On-Line Transaction Processing
c.
On-Line Electronic Processing
d.
On-Line Data Processing
26. Which of the following approach requires
complex information filtering and
integration processes and competes for
resources with processing at local sources?
a.
Update-driven approach
b.
Integrate-driven approach
c.
Query-driven approach
d.
Data-driven approach
27. Mining different kinds of knowledge
in databases is an issue in
a.
b.
c.
d.
Performance issue
Mining methodology and user
interaction issues
Diversity of database types issues
time complexity
a.
Informational processing
b.
Operational processing
c.
Data processing
d.
Data cleaning
32. Which of the following cuboid that holds
the highest level of summerization?
a.
b.
c.
d.
Cuboid
Base cuboid
Non-base cuboid
Apex coboid
33. _ _ _ _ _ _ _ _ _ _ is a visualization operation
that rotates the data axes in view in order to
provide an alternative presentation of the data
a.
Rollup
b.
Drill down
c.
Pivot
d.
Slice & dice
34. _ _ _ _ _ _ tables can be specified by users
or experts, or automatically generated and
adjusted based on data distributions.
a.
Fact
b.
Summarized
c.
Dimension
d.
Relational
35. _ _ _ _ _ _ _ executes queries involving
more than one fact table
a.
Drillthroughb.
Drillacross c. Drilldown
d.
Rotate
36. A _ _ _ _ _ allows data to be modeled and
viewed in multiple dimensions.
a.
b.
c.
d.
Meta data
Data cube
Database
Fact table
37. The major difference between the snowflake
and star schema models is that the dimension
tables of the snowflake model image kept in _
_ _ _ form
a.
Standard
b.
De-normalized
c.
Normalized
d.
Multi dimensional
38. Which of the following is not a measure,
which is based on the kind of aggregation
functions used.
a.
Cumulative
b.
Distributed
c.
Algebraic
d.
Holistic
39. A concept hierarchy that is a total or partial
order among attributes in database schema
is called a _ _ _ _ _ _ _ _ _ _ _ hierarchy.
a.
Set-grouping
b.
Grouping
c.
Decision
d.
Schema
40. Which of the following focuses on
socioeconomic applications?
a.
Statistical database systems
b.
Online Analytical Processing systems
c.
Spatial database systems
d.
Temporal database systems
41. A _ _ _ _ _ _ _ _ _ model consists of radial
lines emanating from a central point, where
each line represents a concept hierarchy for
a dimension
a.
Cube net
b.
Triangle net
c.
Square net
d.
Star net
42. Which of the following is constructed where
the enterprise warehouse is the sole
custodian of all warehouse data. Which is then
distributed to the various dependent data
marts.
a.
Enterprise DWH
b.
Two- tier DWH
c.
Multi-tier DWH
d.
Virtual warehouse
43. Which
of
the
following
is
a
Multi
Dimensional Online Analytical Processing?
a.
Ess base
b.
Database
c.
Swiss base
d.
Red brick
44. The _ _ _ _ _ _ view includes fact tables and
dimension tables.
a.
DWH
b.
Top-down
c.
Data source
d.
Business Query
45. Which of the following is a Hybrid
OLAP server?
a.
MS SQL server 1.0
b.
MS SQL 5.0
c.
MS SQL server 7.0
d.
MS SQL server 3.0
46. ETL stands for
a.
b.
Evaluate, Transport and Link
Extract Transfer and Load
c.
Error, Tracking and Load
d.
Extract, Transient and Load
47. To architect the DWH, the major driving factor
to support is
a.
An inability to cope with requirements
evolution
b.
Not populating the warehouse
c.
Day- to- day management of
the warehouse
d.
Supporting Online Transaction processing
48. A _ _ _ _ _ _ _ contains a subset of corporatewide data that is of value to a specific group of
users.
a.
Enterprise warehouse
b.
Virtual warehouse
c.
Data warehouse
d.
Data mart
49. A _ _ _ _ _ _ _ is a set of views over
operational databases
a.
Enterprise warehouse
b.
Virtual warehouse
c.
Data warehouse
d.
Data mart
50. What kind of the intermediate servers that
stand in between a relational back-end server
and client front-end tools?
a.
Hybrid OLAP servers
b.
Multidimensional OLAP server
c.
Relational OLAP servers
d.
Specialized SQL servers
51. Choose the _ _ _ _ _ _ _ _ _ that will
populate each fact table record
a.
Measures
b.
Dimensions
c.
Grain
d.
Business Process
52. How many cuboids are there in an ndimensional data cube?
a.
b.
c.
d.
53. Meta data repository contains
a.
b.
c.
Operational meta data
Data irrelevant to system performance
The mapping from the DWH to
the operational environment
d.
Summarized data
54. Which of the following support the bitmap
indices
a.
Sybase IQ
b.
Oracle 7
c.
CoBoL
d.
SQL
55. _ _ _ _ _ _ _ are created for the data names
and definitions of the given warehouse
a.
b.
c.
Data cube
Summarized data
Meta data
d.
Detailed Information
56. Chunking technique involves "overlapping" some of the aggregation computations, it is referred to as _ _
_ _ _ aggregation in data cube computation
a.
Two way array
b.
Three way array
c.
Multi way array
d.
Sparse array
57. The _ _ _ _ _ _ _ operator computes aggregates over all subsets of the dimensions specified in the
operation.
a.
Data base
b.
Computer cube
c.
Define cube
d.
Group by
58. Which of the following is a subcuge that is small enough to fit into the memory available for cube
computation?
a.
Bulk
b.
Array
c.
Structure
d.
Chunk
59. The bit mapped join indices method is an integrated form of
a.
Composite join indexing and bitmap indexing
b.
Join indexing and composite join indexing
c.
Join indexing and bitmap indexing
d.
Bitmap indexing and outer join indexing
60. A set of attributes in a relation schema that forms a primary key for another relation schema is called a _
______
a.
b.
c.
d.
61. Which
Primary key
Foreign key
Secondary key
Composite key
of the following typically gathers data from multiple, heterogeneous, and external sources?
a.
b.
c.
d.
62. OLAM
Data cleaning
Load
Refresh
Data extraction
is particularly important for the following reason
a.
How quality of data in DWH
b.
Data processing
c.
OLTP-based exploratory data analysis
d.
Online selection of data mining functions
63. Which of the following sets a good example for interactive data analysis and provides the necessary
preparations for exploratory data mining?
a.
b.
c.
d.
64. Which
a.
b.
c.
OLP
OLAP
OLTP
OLDP
of the following is not exception indicator?
Out Exp
Self Exp
In Exp
d.
Path Exp
65. _ _ _ _ _ _ _ _ _ can help business managers find and reach more suitable customers, as well as gain critical
business insights that may help to drive market share and raise profits.
a.
Data warehouse
b.
Data mining
c.
Data summarization
d.
Data processing
66. _ _ _ _ _ _ _ _ _ _ _ is an alternative approach in which pre-computed measures indicating data exceptions
are used to guide the user in the data analysis process at all levels of aggregation.
a.
b.
c.
d.
Hypothesis-driven exploration
Inventory-driven exploration
Discovery-driven exploration
Exception-driven exploration
67. Which of the following is an exception indicator that indicates that indicates the degree of surprise of the cell
value, relative to other cells at the same level of aggregation?
a.
b.
c.
d.
68. _ _ _ _
a.
b.
c.
d.
Out Exp
In Exp
Path Exp
Self Exp
_ is a powerful paradigm that integrates OLAP with data mining technology.
Online Analytical Modeling
Online Analytical Machine
Online Analytical Mining
Online Analytical Monitoring
69. Data warehouse application is _ _ _ _ _ _ _ _ _
a.
Data Processing
b.
Transaction Processing
c.
Datacube
d.
Datamining
70. _ _ _ _ _ _ _ _ _ cubes compute complex queries involving multiple dependent aggregates as multiple
granularities
a.
b.
c.
d.
Multi feature
Data
Meta
Solid
71. Which of the following performs a linear transformation on the original data?
a.
b.
c.
d.
72. Which
Z-score normalization
Normalization with decimal scaling
Zero-standard deviation
Min-max normalization
of the following is the best method for missing values in data cleaning?
a.
b.
c.
Fill in the missing value manually
Use the most probable value to fill in the missing value
Use the attribute mean to fill the missing
value
d.
Use a global constant to fill in the missing value
73. The minimum and maximum values in a given bin are identified as the
a.
b.
c.
Bin means
Bin average
Bin media