CS614-Midterm
1 / 50
Multi-dimensional databases (MDDs) typically use ___________ formats to store pre-summarized cube structures.
2 / 50
Companies collect and record their own operational data, but at the same time they also use reference data obtained from _______ sources such as codes, prices etc.
3 / 50
The STAR schema used for data design is a __________ consisting of fact and dimension tables. :
4 / 50
For a DWH project, the key requirement are ________ and product experience.
5 / 50
Collapsing tables can be done on the ___________ relationships
6 / 50
B-Tree is used as an index to provide access to records
7 / 50
The degree of similarity between two records, often measured by a numerical value between _______, usually depends on application characteristics.
8 / 50
Change Data Capture is one of the challenging technical issues in _____________
9 / 50
_________ breaks a table into multiple tables based upon common column values.
10 / 50
In _________ system, the contents change with time. :
11 / 50
Data Warehouse provides the best support for analysis while OLAP carries out the _________ task.
12 / 50
When performing objective assessments, companies follow a set of principles to develop metrics specific to their needs, there is hard to have “one size fits all” approach. Which of the following statement represents the pervasive functional forms?
13 / 50
Pakistan is one of the five major ________ countries in the world.
14 / 50
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The __________ the portion of the program that must be executed sequentially, the greater the scalability of computation.
15 / 50
The growth of master files and magnetic tapes exploded around the mid- _______. :
16 / 50
Suppose the amount of data recorded in an organization is doubled every year. This increase is __________ .
17 / 50
People that design and build the data warehouse must be capable of working across the organization at all levels
18 / 50
Execution can be completed successfully or it may be stopped due to some error. In case of successful completion of execution all the transactions will be ___________
19 / 50
In DWH project, it is assured that ___________ environment is similar to the production environment
20 / 50
____________ in agriculture extension is that pest population beyond which the benefit of spraying outweighs its cost.
21 / 50
If w is the window size and n is the size of data set, then the complexity of merging phase in BSN method is___________
22 / 50
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The smaller the portion of the program that must be executed __________, the greater the scalability of the computation.
23 / 50
During ETL process of an organization, suppose you have data which can be transformed using any of the transformation method. Which of the following strategy will be your choice for least complexity?
24 / 50
Transactional fact tables do not have records for events that do not occur. These are called
25 / 50
Data mining evolve as a mechanism to cater the limitations of ________ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.
26 / 50
In horizontal splitting, we split a relation into multiple tables on the basis of
27 / 50
_____ contributes to an under-utilization of valuable and expensive historical data, and inevitably results in a limitedcapability to provide decision support and analysis.
28 / 50
The technique that is used to perform these feats in data mining modeling, and this act of model building is something that people have been doing for long time, certainly before the _______ of computers or data mining technology.
29 / 50
Pipeline parallelism focuses on increasing throughput of task execution, NOT on __________ sub-task execution time.
30 / 50
.______ is class of Decision Support Environment.
31 / 50
32 / 50
For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:
33 / 50
For a smooth DWH implementation we must be a technologist.
34 / 50
For a relation to be in 4NF it must be:-
35 / 50
36 / 50
_______________, if fits into memory, costs only one disk I/O access to locate a record by given key.
37 / 50
De-Normalization normally speeds up
38 / 50
39 / 50
40 / 50
The automated, prospective analyses offered by data mining move beyond the analyses of past events provided by _____________ tools typical of decision support systems.
41 / 50
If „M‟ rows from table-A match the conditions in the query then table-B is accessed „M‟ times. Suppose table-B has an index on the join column. If „a‟ I/Os are required to read the data block for each scan and „b‟ I/Os for each data block then the total cost of accessing table-B is _____________ logical I/Os approximately.
42 / 50
DOLAP allows download of “cube” structures to a desktop platform with the need for shared relational or cube server.
43 / 50
44 / 50
45 / 50
46 / 50
Analytical processing uses ____________ , instead of record level access.
47 / 50
DTS allows us to connect through any data source or destination that is supported by ____________
48 / 50
The goal of star schema design is to simplify ________
49 / 50
_____________ is a process which involves gathering of information about column through execution of certain queries with intention to identify erroneous records.
50 / 50
Data mining evolve as mechanism to cater the limitations of _____ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc...
Your score is
The average score is 0%
Restart quiz