CS614-Midterm
1 / 50
The degree of similarity between two records, often measured by a numerical value between _______, usually depends on application characteristics.
2 / 50
Rearranging the grouping of source data, delivering it to the destination database, and ensuring the quality of data are crucial to the process of loading the data warehouse. Data ____________ is vitally important to the overall health of a warehouse project. 1. Cleansing 2. Cleaning 3. Scrubbing Which of the following options is true?
3 / 50
Data mining is a/an ______ approach , where browsing through data using mining techniques may reveal something that might be of interest to the user as information that was unknown previously.
4 / 50
The automated, prospective analyses offered by data mining move beyond the analysis of past events provided by respective tools typical of ___________.
5 / 50
There are many variants of the traditional nested-loop join, if there is an index is exploited, then it is called……
6 / 50
Non uniform distribution, when the data is distributed across the processors, is called ______.
7 / 50
The automated, prospective analyses offered by data mining move beyond the analyses of past events provided by _____________ tools typical of decision support systems.
8 / 50
Pre-computed _______ can solve performance problems
9 / 50
Pakistan is one of the five major ________ countries in the world.
10 / 50
The Kimball s iterative data warehouse development approach drew on decades of experience to develop the _________.
11 / 50
The need to synchronize data upon update is called
12 / 50
Multi-dimensional databases (MDDs) typically use ___________ formats to store pre-summarized cube structures.
13 / 50
A ________ dimension is a collection of random transactional codes, flags and/text attributes that are unrelated to any particular dimension. The ______ dimension is simply a structure that provides a convenient place to store the ______ attributes.
14 / 50
Analytical processing uses ____________ , instead of record level access.
15 / 50
To measure or quantify the similarity or dissimilarity, different techniques are available. Which of the following option represent the name of available techniques?
16 / 50
We must try to find the one access tool that will handle all the needs of their users.
17 / 50
Node of a B-Tree is stored in memory block and traversing a B-Tree involves ______ page faults.
18 / 50
Data warehousing and on-line analytical processing (OLAP) are _______ elements of decision support system.
19 / 50
Grain is the ________ level of data stored in the warehouse.
20 / 50
: An optimized structure which is built primarily for retrieval, with update being only a secondary consideration is
21 / 50
The goal of star schema design is to simplify ________
22 / 50
NUMA stands for __________
23 / 50
The divide & conquer cube partitioning approach helps alleviate the ____________ limitations of MOLAP implementation.
24 / 50
Pipeline parallelism focuses on increasing throughput of task execution, NOT on __________ sub-task execution time.
25 / 50
For a DWH project, the key requirement are ________ and product experience.
26 / 50
In DWH project, it is assured that ___________ environment is similar to the production environment
27 / 50
With data mining, the best way to accomplish this is by setting aside some of your data in a vault to isolate it from the mining process; once the mining is complete, the results can be tested against the isolated data to confirm the model's _______.
28 / 50
Normalization effects performance
29 / 50
Cube is a __________ entity containing values of a certain fact at a certain aggregation level at an intersection of a combination of dimensions.
30 / 50
De-Normalization normally speeds up
31 / 50
In horizontal splitting, we split a relation into multiple tables on the basis of
32 / 50
33 / 50
In _________ system, the contents change with time. :
34 / 50
A single database, couldn‟t serve both operational high performance transaction processing and DSS, analytical processing, all at the same time.
35 / 50
36 / 50
Data mining derives its name from the similarities between searching for valuable business information in a large database, for example, finding linked products in gigabytes of store scanner data, and mining a mountain for a _________ of valuable ore.
37 / 50
38 / 50
B-Tree is used as an index to provide access to records
39 / 50
40 / 50
Many data warehouse project teams waste enormous amounts of time searching in vain for a _______.
41 / 50
Which statement is true for De-Normalization?
42 / 50
in agriculture extension is that pest population beyond which the benefit of spraying outweighs levels
43 / 50
DSS queries do not involve a primary key
44 / 50
Data mining evolve as mechanism to cater the limitations of _____ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc...
45 / 50
For a smooth DWH implementation we must be a technologist.
46 / 50
Transactional fact tables do not have records for events that do not occur. These are called
47 / 50
Investing years in architecture and forgetting the primary purpose of solving business problems, results in inefficient application. This is the example of _________ mistake.
48 / 50
Change Data Capture is one of the challenging technical issues in _____________
49 / 50
For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:
50 / 50
If someone told you that he had a good model to predict customer usage, the first thing you might try would be to ask him to apply his model to your customer _______, where you already knew the answer.
Your score is
The average score is 0%
Restart quiz