CS614-Midterm
1 / 50
If „M‟ rows from table-A match the conditions in the query then table-B is accessed „M‟ times. Suppose table-B has an index on the join column. If „a‟ I/Os are required to read the data block for each scan and „b‟ I/Os for each data block then the total cost of accessing table-B is _____________ logical I/Os approximately.
2 / 50
A dense index, if fits into memory, costs only ______ disk I/O access to locate a record by given key.
3 / 50
The need to synchronize data upon update is called
4 / 50
________ is the technique in which existing heterogeneous segments are reshuffled, relocated into homogeneous segments.
5 / 50
To measure or quantify the similarity or dissimilarity, different techniques are available. Which of the following option represent the name of available techniques?
6 / 50
The goal of ______is to look at as few block as possible to find the matching records.
7 / 50
With data mining, the best way to accomplish this is by setting aside some of your data in a vault to isolate it from the mining process; once the mining is complete, the results can be tested against the isolated data to confirm the model's _______.
8 / 50
_______________, if too big and does not fit into memory, will be expensive when used to find a record by given key.
9 / 50
The goal of star schema design is to simplify ________
10 / 50
Multidimensional databases typically use proprietary __________ format to store pre-summarized cube structures.
11 / 50
Horizontal splitting breaks a table into multiple tables based upon_______
12 / 50
Collapsing tables can be done on the ___________ relationships
13 / 50
Analytical processing uses ____________ , instead of record level access.
14 / 50
Execution can be completed successfully or it may be stopped due to some error. In case of successful completion of execution all the transactions will be ___________
15 / 50
_____________, if fits into memory , costs only one disk I/O access to locate a record by given key.
16 / 50
Data mining evolve as mechanism to cater the limitations of _____ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc...
17 / 50
DTS allows us to connect through any data source or destination that is supported by ____________
18 / 50
19 / 50
Change Data Capture is one of the challenging technical issues in _____________
20 / 50
Data warehousing and on-line analytical processing (OLAP) are _______ elements of decision support system.
21 / 50
Naturally Evolving architecture occurred when an organization had a _______ approach to handling the whole process of hardware and software architecture.
22 / 50
De-Normalization normally speeds up
23 / 50
_____ contributes to an under-utilization of valuable and expensive historical data, and inevitably results in a limitedcapability to provide decision support and analysis.
24 / 50
In DWH project, it is assured that ___________ environment is similar to the production environment
25 / 50
Ad-hoc access means to run such queries which are known already.
26 / 50
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The __________ the portion of the program that must be executed sequentially, the greater the scalability of computation.
27 / 50
The degree of similarity between two records, often measured by a numerical value between _______, usually depends on application characteristics.
28 / 50
________ gives total view of an organization
29 / 50
For a smooth DWH implementation we must be a technologist.
30 / 50
Data mining is a/an __________ approach, where browsing through data using data mining techniques may reveal something that might be of interest to the user as information that was unknown previously.
31 / 50
32 / 50
A single database, couldn‟t serve both operational high performance transaction processing and DSS, analytical processing, all at the same time.
33 / 50
A ________ dimension is a collection of random transactional codes, flags and/text attributes that are unrelated to any particular dimension. The ______ dimension is simply a structure that provides a convenient place to store the ______ attributes.
34 / 50
During the application specification activity, we also must give consideration to the organization of the applications.
35 / 50
Data Warehouse provides the best support for analysis while OLAP carries out the _________ task.
36 / 50
People that design and build the data warehouse must be capable of working across the organization at all levels
37 / 50
Many data warehouse project teams waste enormous amounts of time searching in vain for a _______.
38 / 50
The divide & conquer cube partitioning approach helps alleviate the ____________ limitations of MOLAP implementation.
39 / 50
Grain is the ________ level of data stored in the warehouse.
40 / 50
Normalization effects performance
41 / 50
It is observed that every year the amount of data recorded in an organization :
42 / 50
For a relation to be in 4NF it must be:-
43 / 50
All data is ______________ of something real. I An Abstraction II A Representation Which of the following option is true?
44 / 50
____________ in agriculture extension is that pest population beyond which the benefit of spraying outweighs its cost.
45 / 50
The input to the data warehouse can come from OLTP or transactional system but not from other third party database.
46 / 50
The purpose of the House of Quality technique is to reduce ______ types of risk.
47 / 50
_____________ is a process which involves gathering of information about column through execution of certain queries with intention to identify erroneous records.
48 / 50
in agriculture extension is that pest population beyond which the benefit of spraying outweighs levels
49 / 50
Companies collect and record their own operational data, but at the same time they also use reference data obtained from _______ sources such as codes, prices etc.
50 / 50
The key idea behind ___________ is to take a big task and break it into subtasks that can be processed concurrently on a stream of data inputs in multiple, overlapping stages of execution.
Your score is
The average score is 0%
Restart quiz