CS614-Midterm
1 / 50
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The smaller the portion of the program that must be executed __________, the greater the scalability of the computation.
2 / 50
: The goal of ___________ is to look at as few blocks as possible to find the matching records(s).
3 / 50
To identify the __________________ required we need to perform data profiling
4 / 50
For a given data set, to get a global view in un-supervised learning we use
5 / 50
The _________ is only a small part in realizing the true business value buried within the mountain of data collected and stored within organizations business systems and operational databases.
6 / 50
For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:
7 / 50
5 million bales.
8 / 50
The growth of master files and magnetic tapes exploded around the mid- _______. :
9 / 50
The automated, prospective analyses offered by data mining move beyond the analyses of past events provided by _____________ tools typical of decision support systems.
10 / 50
_____________ is a process which involves gathering of information about column through execution of certain queries with intention to identify erroneous records.
11 / 50
Many data warehouse project teams waste enormous amounts of time searching in vain for a _______.
12 / 50
_____ contributes to an under-utilization of valuable and expensive historical data, and inevitably results in a limitedcapability to provide decision support and analysis.
13 / 50
Data Transformation Services (DTS) provide a set of _____ that lets you extract, transform, and consolidate data from disparate sources into single or multiple destinations supported by DTS connectivity.
14 / 50
During ETL process of an organization, suppose you have data which can be transformed using any of the transformation method. Which of the following strategy will be your choice for least complexity?
15 / 50
Horizontal splitting breaks a table into multiple tables based upon_______
16 / 50
If w is the window size and n is the size of data set, then the complexity of merging phase in BSN method is___________
17 / 50
The need to synchronize data upon update is called
18 / 50
Pakistan is one of the five major ________ countries in the world.
19 / 50
It is observed that every year the amount of data recorded in anorganization is
20 / 50
Pre-computed _______ can solve performance problems
21 / 50
When performing objective assessments, companies follow a set of principles to develop metrics specific to their needs, there is hard to have “one size fits all” approach. Which of the following statement represents the pervasive functional forms?
22 / 50
Grain is the ________ level of data stored in the warehouse.
23 / 50
The goal of star schema design is to simplify ________
24 / 50
Data mining evolve as a mechanism to cater the limitations of ________ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.
25 / 50
People that design and build the data warehouse must be capable of working across the organization at all levels
26 / 50
Multidimensional databases typically use proprietary __________ format to store pre-summarized cube structures.
27 / 50
There are many variants of the traditional nested-loop join. If the index is built as part of the query plan and subsequently dropped, it is called
28 / 50
B-Tree is used as an index to provide access to records
29 / 50
Pipeline parallelism focuses on increasing throughput of task execution, NOT on __________ sub-task execution time.
30 / 50
The input to the data warehouse can come from OLTP or transactional system but not from other third party database.
31 / 50
For a relation to be in 4NF it must be:-
32 / 50
Data mining is a/an ______ approach , where browsing through data using mining techniques may reveal something that might be of interest to the user as information that was unknown previously.
33 / 50
34 / 50
DSS queries do not involve a primary key
35 / 50
In horizontal splitting, we split a relation into multiple tables on the basis of
36 / 50
The goal of ______is to look at as few block as possible to find the matching records.
37 / 50
in agriculture extension is that pest population beyond which the benefit of spraying outweighs levels
38 / 50
Focusing on data warehouse delivery only often end up _________.
39 / 50
In a traditional MIS system, there is an almost linear sequence of queries.
40 / 50
The Kimball s iterative data warehouse development approach drew on decades of experience to develop the _________.
41 / 50
30.Data Warehouse is about taking / colleting data from different ________ sources:
42 / 50
NUMA stands for __________
43 / 50
If „M‟ rows from table-A match the conditions in the query then table-B is accessed „M‟ times. Suppose table-B has an index on the join column. If „a‟ I/Os are required to read the data block for each scan and „b‟ I/Os for each data block then the total cost of accessing table-B is _____________ logical I/Os approximately.
44 / 50
De-Normalization normally speeds up
45 / 50
_____modeling technique is more appropriate for data warehouses.
46 / 50
If someone told you that he had a good model to predict customer usage, the first thing you might try would be to ask him to apply his model to your customer _______, where you already knew the answer.
47 / 50
A dense index, if fits into memory, costs only ______ disk I/O access to locate a record by given key.
48 / 50
For a DWH project, the key requirement are ________ and product experience.
49 / 50
Companies collect and record their own operational data, but at the same time they also use reference data obtained from _______ sources such as codes, prices etc.
50 / 50
A single database, couldn‟t serve both operational high performance transaction processing and DSS, analytical processing, all at the same time.
Your score is
The average score is 0%
Restart quiz