CS614-Midterm
1 / 50
The purpose of the House of Quality technique is to reduce ______ types of risk.
2 / 50
Multidimensional databases typically use proprietary __________ format to store pre-summarized cube structures.
3 / 50
DTS allows us to connect through any data source or destination that is supported by ____________
4 / 50
Analytical processing uses ____________ , instead of record level access.
5 / 50
To judge effectiveness we perform data profiling twice.
6 / 50
_______________, if fits into memory, costs only one disk I/O access to locate a record by given key.
7 / 50
The degree of similarity between two records, often measured by a numerical value between _______, usually depends on application characteristics.
8 / 50
To measure or quantify the similarity or dissimilarity, different techniques are available. Which of the following option represent the name of available techniques?
9 / 50
As apposed to the out come of classification, estimation deal with ____________ valued outcome.
10 / 50
Data Transformation Services (DTS) provide a set of _____ that lets you extract, transform, and consolidate data from disparate sources into single or multiple destinations supported by DTS connectivity.
11 / 50
If „M‟ rows from table-A match the conditions in the query then table-B is accessed „M‟ times. Suppose table-B has an index on the join column. If „a‟ I/Os are required to read the data block for each scan and „b‟ I/Os for each data block then the total cost of accessing table-B is _____________ logical I/Os approximately.
12 / 50
_________ breaks a table into multiple tables based upon common column values.
13 / 50
Transactional fact tables do not have records for events that do not occur. These are called
14 / 50
In horizontal splitting, we split a relation into multiple tables on the basis of
15 / 50
Execution can be completed successfully or it may be stopped due to some error. In case of successful completion of execution all the transactions will be ___________
16 / 50
17 / 50
For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:
18 / 50
19 / 50
Pipeline parallelism focuses on increasing throughput of task execution, NOT on __________ sub-task execution time.
20 / 50
Focusing on data warehouse delivery only often end up _________.
21 / 50
22 / 50
The goal of ______is to look at as few block as possible to find the matching records.
23 / 50
Data warehousing and on-line analytical processing (OLAP) are _______ elements of decision support system.
24 / 50
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The __________ the portion of the program that must be executed sequentially, the greater the scalability of computation.
25 / 50
Data mining evolve as mechanism to cater the limitations of _____ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc...
26 / 50
DSS queries do not involve a primary key
27 / 50
Ad-hoc access means to run such queries which are known already.
28 / 50
With data mining, the best way to accomplish this is by setting aside some of your data in a vault to isolate it from the mining process; once the mining is complete, the results can be tested against the isolated data to confirm the model's _______.
29 / 50
People that design and build the data warehouse must be capable of working across the organization at all levels
30 / 50
Rearranging the grouping of source data, delivering it to the destination database, and ensuring the quality of data are crucial to the process of loading the data warehouse. Data ____________ is vitally important to the overall health of a warehouse project. 1. Cleansing 2. Cleaning 3. Scrubbing Which of the following options is true?
31 / 50
Node of a B-Tree is stored in memory block and traversing a B-Tree involves ______ page faults.
32 / 50
Virtual cube is used to query two similar cubes by creating a third “virtual” cube by a join between two cubes.
33 / 50
In DWH project, it is assured that ___________ environment is similar to the production environment
34 / 50
When performing objective assessments, companies follow a set of principles to develop metrics specific to their needs, there is hard to have “one size fits all” approach. Which of the following statement represents the pervasive functional forms?
35 / 50
_____ contributes to an under-utilization of valuable and expensive historical data, and inevitably results in a limitedcapability to provide decision support and analysis.
36 / 50
Data mining evolve as a mechanism to cater the limitations of ________ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.
37 / 50
Pre-computed _______ can solve performance problems
38 / 50
Data mining is a/an __________ approach, where browsing through data using data mining techniques may reveal something that might be of interest to the user as information that was unknown previously.
39 / 50
The users of data warehouse are knowledge workers in other words they are _______in the organization.
40 / 50
Pre-join technique is used to avoid
41 / 50
For a given data set, to get a global view in un-supervised learning we use
42 / 50
During the application specification activity, we also must give consideration to the organization of the applications.
43 / 50
44 / 50
The goal of star schema design is to simplify ________
45 / 50
: An optimized structure which is built primarily for retrieval, with update being only a secondary consideration is
46 / 50
The technique that is used to perform these feats in data mining modeling, and this act of model building is something that people have been doing for long time, certainly before the _______ of computers or data mining technology.
47 / 50
Pakistan is one of the five major ________ countries in the world.
48 / 50
49 / 50
NUMA stands for __________
50 / 50
Non uniform distribution, when the data is distributed across the processors, is called ______.
Your score is
The average score is 0%
Restart quiz