CS614-Midterm
1 / 50
Slice and Dice is changing the view of the data.
2 / 50
Which statement is true for De-Normalization?
3 / 50
Pipeline parallelism focuses on increasing throughput of task execution, NOT on __________ sub-task execution time.
4 / 50
Focusing on data warehouse delivery only often end up _________.
5 / 50
If someone told you that he had a good model to predict customer usage, the first thing you might try would be to ask him to apply his model to your customer _______, where you already knew the answer.
6 / 50
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The smaller the portion of the program that must be executed __________, the greater the scalability of the computation.
7 / 50
For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:
8 / 50
For a relation to be in 4NF it must be:-
9 / 50
People that design and build the data warehouse must be capable of working across the organization at all levels
10 / 50
Data mining evolve as a mechanism to cater the limitations of ________ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.
11 / 50
Ad-hoc access means to run such queries which are known already.
12 / 50
____________ in agriculture extension is that pest population beyond which the benefit of spraying outweighs its cost.
13 / 50
If every key in the data file is represented in the index file then index is :
14 / 50
The performance in a MOLAP cube comes from the O(1) look-up time for the array data structure.
15 / 50
The STAR schema used for data design is a __________ consisting of fact and dimension tables. :
16 / 50
DOLAP allows download of “cube” structures to a desktop platform with the need for shared relational or cube server.
17 / 50
We must try to find the one access tool that will handle all the needs of their users.
18 / 50
Change Data Capture is one of the challenging technical issues in _____________
19 / 50
To judge effectiveness we perform data profiling twice.
20 / 50
_________ breaks a table into multiple tables based upon common column values.
21 / 50
The goal of star schema design is to simplify ________
22 / 50
23 / 50
Data warehousing and on-line analytical processing (OLAP) are _______ elements of decision support system.
24 / 50
DTS allows us to connect through any data source or destination that is supported by ____________
25 / 50
It is observed that every year the amount of data recorded in an organization :
26 / 50
Data mining derives its name from the similarities between searching for valuable business information in a large database, for example, finding linked products in gigabytes of store scanner data, and mining a mountain for a _________ of valuable ore.
27 / 50
Non uniform distribution, when the data is distributed across the processors, is called ______.
28 / 50
There are many variants of the traditional nested-loop join, if there is an index is exploited, then it is called……
29 / 50
________ gives total view of an organization
30 / 50
During the application specification activity, we also must give consideration to the organization of the applications.
31 / 50
Analytical processing uses ____________ , instead of record level access.
32 / 50
_____ contributes to an under-utilization of valuable and expensive historical data, and inevitably results in a limitedcapability to provide decision support and analysis.
33 / 50
For a smooth DWH implementation we must be a technologist.
34 / 50
The technique that is used to perform these feats in data mining modeling, and this act of model building is something that people have been doing for long time, certainly before the _______ of computers or data mining technology.
35 / 50
36 / 50
Horizontal splitting breaks a table into multiple tables based upon_______
37 / 50
Pre-computed _______ can solve performance problems
38 / 50
Naturally Evolving architecture occurred when an organization had a _______ approach to handling the whole process of hardware and software architecture.
39 / 50
_______________, if too big and does not fit into memory, will be expensive when used to find a record by given key.
40 / 50
Data mining is a/an __________ approach, where browsing through data using data mining techniques may reveal something that might be of interest to the user as information that was unknown previously.
41 / 50
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The __________ the portion of the program that must be executed sequentially, the greater the scalability of computation.
42 / 50
Companies collect and record their own operational data, but at the same time they also use reference data obtained from _______ sources such as codes, prices etc.
43 / 50
De-Normalization normally speeds up
44 / 50
The degree of similarity between two records, often measured by a numerical value between _______, usually depends on application characteristics.
45 / 50
For a given data set, to get a global view in un-supervised learning we use
46 / 50
In horizontal splitting, we split a relation into multiple tables on the basis of
47 / 50
48 / 50
49 / 50
: The goal of ___________ is to look at as few blocks as possible to find the matching records(s).
50 / 50
.______ is class of Decision Support Environment.
Your score is
The average score is 0%
Restart quiz