CS614-Midterm
1 / 50
Node of a B-Tree is stored in memory block and traversing a B-Tree involves ______ page faults.
2 / 50
Pre-computed _______ can solve performance problems
3 / 50
Normalization effects performance
4 / 50
It is observed that every year the amount of data recorded in anorganization is
5 / 50
There are many variants of the traditional nested-loop join, if there is an index is exploited, then it is called……
6 / 50
Which statement is true for De-Normalization?
7 / 50
During ETL process of an organization, suppose you have data which can be transformed using any of the transformation method. Which of the following strategy will be your choice for least complexity?
8 / 50
Data warehousing and on-line analytical processing (OLAP) are _______ elements of decision support system.
9 / 50
The users of data warehouse are knowledge workers in other words they are _______in the organization.
10 / 50
Slice and Dice is changing the view of the data.
11 / 50
For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:
12 / 50
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The smaller the portion of the program that must be executed __________, the greater the scalability of the computation.
13 / 50
It is observed that every year the amount of data recorded in an organization :
14 / 50
Data mining derives its name from the similarities between searching for valuable business information in a large database, for example, finding linked products in gigabytes of store scanner data, and mining a mountain for a _________ of valuable ore.
15 / 50
Execution can be completed successfully or it may be stopped due to some error. In case of successful completion of execution all the transactions will be ___________
16 / 50
To measure or quantify the similarity or dissimilarity, different techniques are available. Which of the following option represent the name of available techniques?
17 / 50
In horizontal splitting, we split a relation into multiple tables on the basis of
18 / 50
DTS allows us to connect through any data source or destination that is supported by ____________
19 / 50
If w is the window size and n is the size of data set, then the complexity of merging phase in BSN method is___________
20 / 50
21 / 50
Multidimensional databases typically use proprietary __________ format to store pre-summarized cube structures.
22 / 50
23 / 50
Many data warehouse project teams waste enormous amounts of time searching in vain for a _______.
24 / 50
For a given data set, to get a global view in un-supervised learning we use
25 / 50
All data is ______________ of something real. I An Abstraction II A Representation Which of the following option is true?
26 / 50
Collapsing tables can be done on the ___________ relationships
27 / 50
_____modeling technique is more appropriate for data warehouses.
28 / 50
For a DWH project, the key requirement are ________ and product experience.
29 / 50
Pakistan is one of the five major ________ countries in the world.
30 / 50
The technique that is used to perform these feats in data mining modeling, and this act of model building is something that people have been doing for long time, certainly before the _______ of computers or data mining technology.
31 / 50
The need to synchronize data upon update is called
32 / 50
People that design and build the data warehouse must be capable of working across the organization at all levels
33 / 50
Rearranging the grouping of source data, delivering it to the destination database, and ensuring the quality of data are crucial to the process of loading the data warehouse. Data ____________ is vitally important to the overall health of a warehouse project. 1. Cleansing 2. Cleaning 3. Scrubbing Which of the following options is true?
34 / 50
Analytical processing uses ____________ , instead of record level access.
35 / 50
If „M‟ rows from table-A match the conditions in the query then table-B is accessed „M‟ times. Suppose table-B has an index on the join column. If „a‟ I/Os are required to read the data block for each scan and „b‟ I/Os for each data block then the total cost of accessing table-B is _____________ logical I/Os approximately.
36 / 50
37 / 50
NUMA stands for __________
38 / 50
Data mining uses _________ algorithms to discover patterns and regularities in data.
39 / 50
The key idea behind ___________ is to take a big task and break it into subtasks that can be processed concurrently on a stream of data inputs in multiple, overlapping stages of execution.
40 / 50
When performing objective assessments, companies follow a set of principles to develop metrics specific to their needs, there is hard to have “one size fits all” approach. Which of the following statement represents the pervasive functional forms?
41 / 50
42 / 50
Focusing on data warehouse delivery only often end up _________.
43 / 50
The degree of similarity between two records, often measured by a numerical value between _______, usually depends on application characteristics.
44 / 50
_________ breaks a table into multiple tables based upon common column values.
45 / 50
Data Transformation Services (DTS) provide a set of _____ that lets you extract, transform, and consolidate data from disparate sources into single or multiple destinations supported by DTS connectivity.
46 / 50
Suppose the amount of data recorded in an organization is doubled every year. This increase is __________ .
47 / 50
____________ in agriculture extension is that pest population beyond which the benefit of spraying outweighs its cost.
48 / 50
The divide & conquer cube partitioning approach helps alleviate the ____________ limitations of MOLAP implementation.
49 / 50
50 / 50
For a relation to be in 4NF it must be:-
Your score is
The average score is 0%
Restart quiz