CS614-Midterm
1 / 50
Multidimensional databases typically use proprietary __________ format to store pre-summarized cube structures.
2 / 50
: The goal of ___________ is to look at as few blocks as possible to find the matching records(s).
3 / 50
The goal of star schema design is to simplify ________
4 / 50
For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:
5 / 50
30.Data Warehouse is about taking / colleting data from different ________ sources:
6 / 50
Pakistan is one of the five major ________ countries in the world.
7 / 50
Cube is a __________ entity containing values of a certain fact at a certain aggregation level at an intersection of a combination of dimensions.
8 / 50
Execution can be completed successfully or it may be stopped due to some error. In case of successful completion of execution all the transactions will be ___________
9 / 50
Analytical processing uses ____________ , instead of record level access.
10 / 50
The performance in a MOLAP cube comes from the O(1) look-up time for the array data structure.
11 / 50
To measure or quantify the similarity or dissimilarity, different techniques are available. Which of the following option represent the name of available techniques?
12 / 50
DTS allows us to connect through any data source or destination that is supported by ____________
13 / 50
The Kimball s iterative data warehouse development approach drew on decades of experience to develop the _________.
14 / 50
The users of data warehouse are knowledge workers in other words they are _______in the organization.
15 / 50
The divide & conquer cube partitioning approach helps alleviate the ____________ limitations of MOLAP implementation.
16 / 50
The automated, prospective analyses offered by data mining move beyond the analyses of past events provided by _____________ tools typical of decision support systems.
17 / 50
If every key in the data file is represented in the index file then index is :
18 / 50
If „M‟ rows from table-A match the conditions in the query then table-B is accessed „M‟ times. Suppose table-B has an index on the join column. If „a‟ I/Os are required to read the data block for each scan and „b‟ I/Os for each data block then the total cost of accessing table-B is _____________ logical I/Os approximately.
19 / 50
Investing years in architecture and forgetting the primary purpose of solving business problems, results in inefficient application. This is the example of _________ mistake.
20 / 50
During ETL process of an organization, suppose you have data which can be transformed using any of the transformation method. Which of the following strategy will be your choice for least complexity?
21 / 50
NUMA stands for __________
22 / 50
Data mining uses _________ algorithms to discover patterns and regularities in data.
23 / 50
Virtual cube is used to query two similar cubes by creating a third “virtual” cube by a join between two cubes.
24 / 50
Many data warehouse project teams waste enormous amounts of time searching in vain for a _______.
25 / 50
People that design and build the data warehouse must be capable of working across the organization at all levels
26 / 50
The technique that is used to perform these feats in data mining modeling, and this act of model building is something that people have been doing for long time, certainly before the _______ of computers or data mining technology.
27 / 50
If some error occurs, execution will be terminated abnormally and all transactions will be rolled back. In this case when we will access the database we will find it in the state that was before the ____________.
28 / 50
Data mining evolve as mechanism to cater the limitations of _____ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc...
29 / 50
Pipeline parallelism focuses on increasing throughput of task execution, NOT on __________ sub-task execution time.
30 / 50
We must try to find the one access tool that will handle all the needs of their users.
31 / 50
It is observed that every year the amount of data recorded in anorganization is
32 / 50
________ is the technique in which existing heterogeneous segments are reshuffled, relocated into homogeneous segments.
33 / 50
34 / 50
Normalization effects performance
35 / 50
Relational databases allow you to navigate the data in ____________ that is appropriate using the primary, foreign key structure within the data model.
36 / 50
There are many variants of the traditional nested-loop join. If the index is built as part of the query plan and subsequently dropped, it is called
37 / 50
Data Warehouse provides the best support for analysis while OLAP carries out the _________ task.
38 / 50
Data mining derives its name from the similarities between searching for valuable business information in a large database, for example, finding linked products in gigabytes of store scanner data, and mining a mountain for a _________ of valuable ore.
39 / 50
_______________, if too big and does not fit into memory, will be expensive when used to find a record by given key.
40 / 50
41 / 50
During the application specification activity, we also must give consideration to the organization of the applications.
42 / 50
The purpose of the House of Quality technique is to reduce ______ types of risk.
43 / 50
The STAR schema used for data design is a __________ consisting of fact and dimension tables. :
44 / 50
A dense index, if fits into memory, costs only ______ disk I/O access to locate a record by given key.
45 / 50
Grain is the ________ level of data stored in the warehouse.
46 / 50
47 / 50
A data warehouse implementation without an OLAP tool is always possible.
48 / 50
Slice and Dice is changing the view of the data.
49 / 50
Multi-dimensional databases (MDDs) typically use ___________ formats to store pre-summarized cube structures.
50 / 50
De-Normalization normally speeds up
Your score is
The average score is 0%
Restart quiz