CS614-Midterm
1 / 50
If „M‟ rows from table-A match the conditions in the query then table-B is accessed „M‟ times. Suppose table-B has an index on the join column. If „a‟ I/Os are required to read the data block for each scan and „b‟ I/Os for each data block then the total cost of accessing table-B is _____________ logical I/Os approximately.
2 / 50
People that design and build the data warehouse must be capable of working across the organization at all levels
3 / 50
.______ is class of Decision Support Environment.
4 / 50
The input to the data warehouse can come from OLTP or transactional system but not from other third party database.
5 / 50
For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:
6 / 50
For a given data set, to get a global view in un-supervised learning we use
7 / 50
Horizontal splitting breaks a table into multiple tables based upon_______
8 / 50
Multidimensional databases typically use proprietary __________ format to store pre-summarized cube structures.
9 / 50
_____modeling technique is more appropriate for data warehouses.
10 / 50
: The goal of ___________ is to look at as few blocks as possible to find the matching records(s).
11 / 50
Non uniform distribution, when the data is distributed across the processors, is called ______.
12 / 50
The need to synchronize data upon update is called
13 / 50
It is observed that every year the amount of data recorded in anorganization is
14 / 50
B-Tree is used as an index to provide access to records
15 / 50
Cube is a __________ entity containing values of a certain fact at a certain aggregation level at an intersection of a combination of dimensions.
16 / 50
The goal of star schema design is to simplify ________
17 / 50
Which statement is true for De-Normalization?
18 / 50
As apposed to the out come of classification, estimation deal with ____________ valued outcome.
19 / 50
20 / 50
If w is the window size and n is the size of data set, then the complexity of merging phase in BSN method is___________
21 / 50
NUMA stands for __________
22 / 50
23 / 50
Analytical processing uses ____________ , instead of record level access.
24 / 50
The growth of master files and magnetic tapes exploded around the mid- _______. :
25 / 50
Data mining uses _________ algorithms to discover patterns and regularities in data.
26 / 50
________ is the technique in which existing heterogeneous segments are reshuffled, relocated into homogeneous segments.
27 / 50
5 million bales.
28 / 50
Data Warehouse provides the best support for analysis while OLAP carries out the _________ task.
29 / 50
: An optimized structure which is built primarily for retrieval, with update being only a secondary consideration is
30 / 50
The Kimball s iterative data warehouse development approach drew on decades of experience to develop the _________.
31 / 50
Data mining evolve as a mechanism to cater the limitations of ________ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.
32 / 50
The degree of similarity between two records, often measured by a numerical value between _______, usually depends on application characteristics.
33 / 50
If someone told you that he had a good model to predict customer usage, the first thing you might try would be to ask him to apply his model to your customer _______, where you already knew the answer.
34 / 50
Naturally Evolving architecture occurred when an organization had a _______ approach to handling the whole process of hardware and software architecture.
35 / 50
_______________, if fits into memory, costs only one disk I/O access to locate a record by given key.
36 / 50
For a DWH project, the key requirement are ________ and product experience.
37 / 50
Data mining is a/an ______ approach , where browsing through data using mining techniques may reveal something that might be of interest to the user as information that was unknown previously.
38 / 50
With data mining, the best way to accomplish this is by setting aside some of your data in a vault to isolate it from the mining process; once the mining is complete, the results can be tested against the isolated data to confirm the model's _______.
39 / 50
Focusing on data warehouse delivery only often end up _________.
40 / 50
Investing years in architecture and forgetting the primary purpose of solving business problems, results in inefficient application. This is the example of _________ mistake.
41 / 50
_________ breaks a table into multiple tables based upon common column values.
42 / 50
The purpose of the House of Quality technique is to reduce ______ types of risk.
43 / 50
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The smaller the portion of the program that must be executed __________, the greater the scalability of the computation.
44 / 50
45 / 50
To measure or quantify the similarity or dissimilarity, different techniques are available. Which of the following option represent the name of available techniques?
46 / 50
____________ in agriculture extension is that pest population beyond which the benefit of spraying outweighs its cost.
47 / 50
Companies collect and record their own operational data, but at the same time they also use reference data obtained from _______ sources such as codes, prices etc.
48 / 50
The automated, prospective analyses offered by data mining move beyond the analysis of past events provided by respective tools typical of ___________.
49 / 50
50 / 50
Pipeline parallelism focuses on increasing throughput of task execution, NOT on __________ sub-task execution time.
Your score is
The average score is 0%
Restart quiz