CS614-Midterm
1 / 50
Data mining uses _________ algorithms to discover patterns and regularities in data.
2 / 50
Data mining evolve as mechanism to cater the limitations of _____ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc...
3 / 50
Pre-computed _______ can solve performance problems
4 / 50
There are many variants of the traditional nested-loop join. If the index is built as part of the query plan and subsequently dropped, it is called
5 / 50
All data is ______________ of something real. I An Abstraction II A Representation Which of the following option is true?
6 / 50
For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:
7 / 50
Multidimensional databases typically use proprietary __________ format to store pre-summarized cube structures.
8 / 50
De-Normalization normally speeds up
9 / 50
To measure or quantify the similarity or dissimilarity, different techniques are available. Which of the following option represent the name of available techniques?
10 / 50
The goal of ______is to look at as few block as possible to find the matching records.
11 / 50
NUMA stands for __________
12 / 50
Many data warehouse project teams waste enormous amounts of time searching in vain for a _______.
13 / 50
If every key in the data file is represented in the index file then index is :
14 / 50
Cube is a __________ entity containing values of a certain fact at a certain aggregation level at an intersection of a combination of dimensions.
15 / 50
For a smooth DWH implementation we must be a technologist.
16 / 50
The technique that is used to perform these feats in data mining modeling, and this act of model building is something that people have been doing for long time, certainly before the _______ of computers or data mining technology.
17 / 50
_______________, if fits into memory, costs only one disk I/O access to locate a record by given key.
18 / 50
Virtual cube is used to query two similar cubes by creating a third “virtual” cube by a join between two cubes.
19 / 50
The automated, prospective analyses offered by data mining move beyond the analyses of past events provided by _____________ tools typical of decision support systems.
20 / 50
Focusing on data warehouse delivery only often end up _________.
21 / 50
B-Tree is used as an index to provide access to records
22 / 50
Analytical processing uses ____________ , instead of record level access.
23 / 50
5 million bales.
24 / 50
It is observed that every year the amount of data recorded in anorganization is
25 / 50
A single database, couldn‟t serve both operational high performance transaction processing and DSS, analytical processing, all at the same time.
26 / 50
DTS allows us to connect through any data source or destination that is supported by ____________
27 / 50
During the application specification activity, we also must give consideration to the organization of the applications.
28 / 50
People that design and build the data warehouse must be capable of working across the organization at all levels
29 / 50
30 / 50
The growth of master files and magnetic tapes exploded around the mid- _______. :
31 / 50
Rearranging the grouping of source data, delivering it to the destination database, and ensuring the quality of data are crucial to the process of loading the data warehouse. Data ____________ is vitally important to the overall health of a warehouse project. 1. Cleansing 2. Cleaning 3. Scrubbing Which of the following options is true?
32 / 50
In DWH project, it is assured that ___________ environment is similar to the production environment
33 / 50
It is observed that every year the amount of data recorded in an organization :
34 / 50
Pre-join technique is used to avoid
35 / 50
30.Data Warehouse is about taking / colleting data from different ________ sources:
36 / 50
The goal of star schema design is to simplify ________
37 / 50
38 / 50
39 / 50
The divide & conquer cube partitioning approach helps alleviate the ____________ limitations of MOLAP implementation.
40 / 50
The degree of similarity between two records, often measured by a numerical value between _______, usually depends on application characteristics.
41 / 50
42 / 50
_____________ is a process which involves gathering of information about column through execution of certain queries with intention to identify erroneous records.
43 / 50
44 / 50
The STAR schema used for data design is a __________ consisting of fact and dimension tables. :
45 / 50
_____modeling technique is more appropriate for data warehouses.
46 / 50
A data warehouse implementation without an OLAP tool is always possible.
47 / 50
in agriculture extension is that pest population beyond which the benefit of spraying outweighs levels
48 / 50
________ gives total view of an organization
49 / 50
Multi-dimensional databases (MDDs) typically use ___________ formats to store pre-summarized cube structures.
50 / 50
DOLAP allows download of “cube” structures to a desktop platform with the need for shared relational or cube server.
Your score is
The average score is 0%
Restart quiz