CS614-Midterm
1 / 50
The users of data warehouse are knowledge workers in other words they are _______in the organization.
2 / 50
For a DWH project, the key requirement are ________ and product experience.
3 / 50
The degree of similarity between two records, often measured by a numerical value between _______, usually depends on application characteristics.
4 / 50
Virtual cube is used to query two similar cubes by creating a third “virtual” cube by a join between two cubes.
5 / 50
Slice and Dice is changing the view of the data.
6 / 50
in agriculture extension is that pest population beyond which the benefit of spraying outweighs levels
7 / 50
_____________, if fits into memory , costs only one disk I/O access to locate a record by given key.
8 / 50
If someone told you that he had a good model to predict customer usage, the first thing you might try would be to ask him to apply his model to your customer _______, where you already knew the answer.
9 / 50
Analytical processing uses ____________ , instead of record level access.
10 / 50
DSS queries do not involve a primary key
11 / 50
________ gives total view of an organization
12 / 50
Transactional fact tables do not have records for events that do not occur. These are called
13 / 50
To identify the __________________ required we need to perform data profiling
14 / 50
Execution can be completed successfully or it may be stopped due to some error. In case of successful completion of execution all the transactions will be ___________
15 / 50
Data mining uses _________ algorithms to discover patterns and regularities in data.
16 / 50
: The goal of ___________ is to look at as few blocks as possible to find the matching records(s).
17 / 50
The performance in a MOLAP cube comes from the O(1) look-up time for the array data structure.
18 / 50
Multidimensional databases typically use proprietary __________ format to store pre-summarized cube structures.
19 / 50
For a smooth DWH implementation we must be a technologist.
20 / 50
21 / 50
Horizontal splitting breaks a table into multiple tables based upon_______
22 / 50
Non uniform distribution, when the data is distributed across the processors, is called ______.
23 / 50
People that design and build the data warehouse must be capable of working across the organization at all levels
24 / 50
A single database, couldn‟t serve both operational high performance transaction processing and DSS, analytical processing, all at the same time.
25 / 50
_____ contributes to an under-utilization of valuable and expensive historical data, and inevitably results in a limitedcapability to provide decision support and analysis.
26 / 50
Pre-computed _______ can solve performance problems
27 / 50
_____________ is a process which involves gathering of information about column through execution of certain queries with intention to identify erroneous records.
28 / 50
Change Data Capture is one of the challenging technical issues in _____________
29 / 50
_______________, if fits into memory, costs only one disk I/O access to locate a record by given key.
30 / 50
31 / 50
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The __________ the portion of the program that must be executed sequentially, the greater the scalability of computation.
32 / 50
It is observed that every year the amount of data recorded in an organization :
33 / 50
A ________ dimension is a collection of random transactional codes, flags and/text attributes that are unrelated to any particular dimension. The ______ dimension is simply a structure that provides a convenient place to store the ______ attributes.
34 / 50
To measure or quantify the similarity or dissimilarity, different techniques are available. Which of the following option represent the name of available techniques?
35 / 50
The automated, prospective analyses offered by data mining move beyond the analysis of past events provided by respective tools typical of ___________.
36 / 50
A data warehouse implementation without an OLAP tool is always possible.
37 / 50
_________ breaks a table into multiple tables based upon common column values.
38 / 50
Which statement is true for De-Normalization?
39 / 50
If „M‟ rows from table-A match the conditions in the query then table-B is accessed „M‟ times. Suppose table-B has an index on the join column. If „a‟ I/Os are required to read the data block for each scan and „b‟ I/Os for each data block then the total cost of accessing table-B is _____________ logical I/Os approximately.
40 / 50
41 / 50
In DWH project, it is assured that ___________ environment is similar to the production environment
42 / 50
De-Normalization normally speeds up
43 / 50
If some error occurs, execution will be terminated abnormally and all transactions will be rolled back. In this case when we will access the database we will find it in the state that was before the ____________.
44 / 50
Data mining derives its name from the similarities between searching for valuable business information in a large database, for example, finding linked products in gigabytes of store scanner data, and mining a mountain for a _________ of valuable ore.
45 / 50
It is observed that every year the amount of data recorded in anorganization is
46 / 50
Data mining evolve as a mechanism to cater the limitations of ________ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.
47 / 50
The input to the data warehouse can come from OLTP or transactional system but not from other third party database.
48 / 50
_______ is an application of information and data.
49 / 50
For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:
50 / 50
The purpose of the House of Quality technique is to reduce ______ types of risk.
Your score is
The average score is 0%
Restart quiz