CS614-Midterm
1 / 50
In a traditional MIS system, there is an almost linear sequence of queries.
2 / 50
The goal of star schema design is to simplify ________
3 / 50
DTS allows us to connect through any data source or destination that is supported by ____________
4 / 50
The need to synchronize data upon update is called
5 / 50
________ is the technique in which existing heterogeneous segments are reshuffled, relocated into homogeneous segments.
6 / 50
It is observed that every year the amount of data recorded in an organization :
7 / 50
The technique that is used to perform these feats in data mining modeling, and this act of model building is something that people have been doing for long time, certainly before the _______ of computers or data mining technology.
8 / 50
Multidimensional databases typically use proprietary __________ format to store pre-summarized cube structures.
9 / 50
Data mining evolve as a mechanism to cater the limitations of ________ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.
10 / 50
Collapsing tables can be done on the ___________ relationships
11 / 50
_____ contributes to an under-utilization of valuable and expensive historical data, and inevitably results in a limitedcapability to provide decision support and analysis.
12 / 50
Data warehousing and on-line analytical processing (OLAP) are _______ elements of decision support system.
13 / 50
Slice and Dice is changing the view of the data.
14 / 50
_____modeling technique is more appropriate for data warehouses.
15 / 50
_____________, if fits into memory , costs only one disk I/O access to locate a record by given key.
16 / 50
The automated, prospective analyses offered by data mining move beyond the analysis of past events provided by respective tools typical of ___________.
17 / 50
It is observed that every year the amount of data recorded in anorganization is
18 / 50
To measure or quantify the similarity or dissimilarity, different techniques are available. Which of the following option represent the name of available techniques?
19 / 50
For a given data set, to get a global view in un-supervised learning we use
20 / 50
_________ breaks a table into multiple tables based upon common column values.
21 / 50
There are many variants of the traditional nested-loop join. If the index is built as part of the query plan and subsequently dropped, it is called
22 / 50
Investing years in architecture and forgetting the primary purpose of solving business problems, results in inefficient application. This is the example of _________ mistake.
23 / 50
Multi-dimensional databases (MDDs) typically use ___________ formats to store pre-summarized cube structures.
24 / 50
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The __________ the portion of the program that must be executed sequentially, the greater the scalability of computation.
25 / 50
De-Normalization normally speeds up
26 / 50
If w is the window size and n is the size of data set, then the complexity of merging phase in BSN method is___________
27 / 50
________ gives total view of an organization
28 / 50
Pre-join technique is used to avoid
29 / 50
Normalization effects performance
30 / 50
31 / 50
Pakistan is one of the five major ________ countries in the world.
32 / 50
30.Data Warehouse is about taking / colleting data from different ________ sources:
33 / 50
In _________ system, the contents change with time. :
34 / 50
In horizontal splitting, we split a relation into multiple tables on the basis of
35 / 50
If someone told you that he had a good model to predict customer usage, the first thing you might try would be to ask him to apply his model to your customer _______, where you already knew the answer.
36 / 50
We must try to find the one access tool that will handle all the needs of their users.
37 / 50
_____________ is a process which involves gathering of information about column through execution of certain queries with intention to identify erroneous records.
38 / 50
Pipeline parallelism focuses on increasing throughput of task execution, NOT on __________ sub-task execution time.
39 / 50
_______ is an application of information and data.
40 / 50
Node of a B-Tree is stored in memory block and traversing a B-Tree involves ______ page faults.
41 / 50
42 / 50
43 / 50
For a DWH project, the key requirement are ________ and product experience.
44 / 50
Pre-computed _______ can solve performance problems
45 / 50
If „M‟ rows from table-A match the conditions in the query then table-B is accessed „M‟ times. Suppose table-B has an index on the join column. If „a‟ I/Os are required to read the data block for each scan and „b‟ I/Os for each data block then the total cost of accessing table-B is _____________ logical I/Os approximately.
46 / 50
People that design and build the data warehouse must be capable of working across the organization at all levels
47 / 50
Data mining uses _________ algorithms to discover patterns and regularities in data.
48 / 50
49 / 50
NUMA stands for __________
50 / 50
Execution can be completed successfully or it may be stopped due to some error. In case of successful completion of execution all the transactions will be ___________
Your score is
The average score is 0%
Restart quiz