CS614-Midterm
1 / 50
Pre-computed _______ can solve performance problems
2 / 50
NUMA stands for __________
3 / 50
Suppose the amount of data recorded in an organization is doubled every year. This increase is __________ .
4 / 50
_____________, if fits into memory , costs only one disk I/O access to locate a record by given key.
5 / 50
To measure or quantify the similarity or dissimilarity, different techniques are available. Which of the following option represent the name of available techniques?
6 / 50
Multidimensional databases typically use proprietary __________ format to store pre-summarized cube structures.
7 / 50
Data mining evolve as a mechanism to cater the limitations of ________ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.
8 / 50
Cube is a __________ entity containing values of a certain fact at a certain aggregation level at an intersection of a combination of dimensions.
9 / 50
A ________ dimension is a collection of random transactional codes, flags and/text attributes that are unrelated to any particular dimension. The ______ dimension is simply a structure that provides a convenient place to store the ______ attributes.
10 / 50
Virtual cube is used to query two similar cubes by creating a third “virtual” cube by a join between two cubes.
11 / 50
The purpose of the House of Quality technique is to reduce ______ types of risk.
12 / 50
: An optimized structure which is built primarily for retrieval, with update being only a secondary consideration is
13 / 50
To identify the __________________ required we need to perform data profiling
14 / 50
We must try to find the one access tool that will handle all the needs of their users.
15 / 50
Rearranging the grouping of source data, delivering it to the destination database, and ensuring the quality of data are crucial to the process of loading the data warehouse. Data ____________ is vitally important to the overall health of a warehouse project. 1. Cleansing 2. Cleaning 3. Scrubbing Which of the following options is true?
16 / 50
_______________, if fits into memory, costs only one disk I/O access to locate a record by given key.
17 / 50
_________ breaks a table into multiple tables based upon common column values.
18 / 50
De-Normalization normally speeds up
19 / 50
Which statement is true for De-Normalization?
20 / 50
The input to the data warehouse can come from OLTP or transactional system but not from other third party database.
21 / 50
Non uniform distribution, when the data is distributed across the processors, is called ______.
22 / 50
Execution can be completed successfully or it may be stopped due to some error. In case of successful completion of execution all the transactions will be ___________
23 / 50
Data Warehouse provides the best support for analysis while OLAP carries out the _________ task.
24 / 50
25 / 50
The technique that is used to perform these feats in data mining modeling, and this act of model building is something that people have been doing for long time, certainly before the _______ of computers or data mining technology.
26 / 50
During ETL process of an organization, suppose you have data which can be transformed using any of the transformation method. Which of the following strategy will be your choice for least complexity?
27 / 50
The users of data warehouse are knowledge workers in other words they are _______in the organization.
28 / 50
The STAR schema used for data design is a __________ consisting of fact and dimension tables. :
29 / 50
For a DWH project, the key requirement are ________ and product experience.
30 / 50
With data mining, the best way to accomplish this is by setting aside some of your data in a vault to isolate it from the mining process; once the mining is complete, the results can be tested against the isolated data to confirm the model's _______.
31 / 50
In a traditional MIS system, there is an almost linear sequence of queries.
32 / 50
In DWH project, it is assured that ___________ environment is similar to the production environment
33 / 50
Transactional fact tables do not have records for events that do not occur. These are called
34 / 50
It is observed that every year the amount of data recorded in an organization :
35 / 50
There are many variants of the traditional nested-loop join. If the index is built as part of the query plan and subsequently dropped, it is called
36 / 50
DSS queries do not involve a primary key
37 / 50
The growth of master files and magnetic tapes exploded around the mid- _______. :
38 / 50
The automated, prospective analyses offered by data mining move beyond the analyses of past events provided by _____________ tools typical of decision support systems.
39 / 50
40 / 50
There are many variants of the traditional nested-loop join, if there is an index is exploited, then it is called……
41 / 50
Ad-hoc access means to run such queries which are known already.
42 / 50
43 / 50
Analytical processing uses ____________ , instead of record level access.
44 / 50
45 / 50
____________ in agriculture extension is that pest population beyond which the benefit of spraying outweighs its cost.
46 / 50
People that design and build the data warehouse must be capable of working across the organization at all levels
47 / 50
Pipeline parallelism focuses on increasing throughput of task execution, NOT on __________ sub-task execution time.
48 / 50
_____modeling technique is more appropriate for data warehouses.
49 / 50
Many data warehouse project teams waste enormous amounts of time searching in vain for a _______.
50 / 50
Collapsing tables can be done on the ___________ relationships
Your score is
The average score is 0%
Restart quiz