CS614-Midterm
1 / 50
Execution can be completed successfully or it may be stopped due to some error. In case of successful completion of execution all the transactions will be ___________
2 / 50
It is observed that every year the amount of data recorded in an organization :
3 / 50
Slice and Dice is changing the view of the data.
4 / 50
_______________, if fits into memory, costs only one disk I/O access to locate a record by given key.
5 / 50
The technique that is used to perform these feats in data mining modeling, and this act of model building is something that people have been doing for long time, certainly before the _______ of computers or data mining technology.
6 / 50
Pakistan is one of the five major ________ countries in the world.
7 / 50
For a relation to be in 4NF it must be:-
8 / 50
In horizontal splitting, we split a relation into multiple tables on the basis of
9 / 50
To identify the __________________ required we need to perform data profiling
10 / 50
_______________, if too big and does not fit into memory, will be expensive when used to find a record by given key.
11 / 50
_____________ is a process which involves gathering of information about column through execution of certain queries with intention to identify erroneous records.
12 / 50
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The __________ the portion of the program that must be executed sequentially, the greater the scalability of computation.
13 / 50
Data mining evolve as mechanism to cater the limitations of _____ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc...
14 / 50
A dense index, if fits into memory, costs only ______ disk I/O access to locate a record by given key.
15 / 50
The divide & conquer cube partitioning approach helps alleviate the ____________ limitations of MOLAP implementation.
16 / 50
Multidimensional databases typically use proprietary __________ format to store pre-summarized cube structures.
17 / 50
A data warehouse implementation without an OLAP tool is always possible.
18 / 50
NUMA stands for __________
19 / 50
In _________ system, the contents change with time. :
20 / 50
The goal of star schema design is to simplify ________
21 / 50
For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:
22 / 50
in agriculture extension is that pest population beyond which the benefit of spraying outweighs levels
23 / 50
De-Normalization normally speeds up
24 / 50
Pipeline parallelism focuses on increasing throughput of task execution, NOT on __________ sub-task execution time.
25 / 50
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The smaller the portion of the program that must be executed __________, the greater the scalability of the computation.
26 / 50
5 million bales.
27 / 50
Analytical processing uses ____________ , instead of record level access.
28 / 50
Grain is the ________ level of data stored in the warehouse.
29 / 50
Focusing on data warehouse delivery only often end up _________.
30 / 50
The purpose of the House of Quality technique is to reduce ______ types of risk.
31 / 50
B-Tree is used as an index to provide access to records
32 / 50
Data mining evolve as a mechanism to cater the limitations of ________ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.
33 / 50
Data warehousing and on-line analytical processing (OLAP) are _______ elements of decision support system.
34 / 50
To measure or quantify the similarity or dissimilarity, different techniques are available. Which of the following option represent the name of available techniques?
35 / 50
36 / 50
Transactional fact tables do not have records for events that do not occur. These are called
37 / 50
Suppose the amount of data recorded in an organization is doubled every year. This increase is __________ .
38 / 50
_________ breaks a table into multiple tables based upon common column values.
39 / 50
Multi-dimensional databases (MDDs) typically use ___________ formats to store pre-summarized cube structures.
40 / 50
For a DWH project, the key requirement are ________ and product experience.
41 / 50
Data mining is a/an ______ approach , where browsing through data using mining techniques may reveal something that might be of interest to the user as information that was unknown previously.
42 / 50
Which statement is true for De-Normalization?
43 / 50
________ is the technique in which existing heterogeneous segments are reshuffled, relocated into homogeneous segments.
44 / 50
During the application specification activity, we also must give consideration to the organization of the applications.
45 / 50
The _________ is only a small part in realizing the true business value buried within the mountain of data collected and stored within organizations business systems and operational databases.
46 / 50
The input to the data warehouse can come from OLTP or transactional system but not from other third party database.
47 / 50
Investing years in architecture and forgetting the primary purpose of solving business problems, results in inefficient application. This is the example of _________ mistake.
48 / 50
49 / 50
People that design and build the data warehouse must be capable of working across the organization at all levels
50 / 50
The STAR schema used for data design is a __________ consisting of fact and dimension tables. :
Your score is
The average score is 0%
Restart quiz