CS614-Midterm
1 / 50
Data mining uses _________ algorithms to discover patterns and regularities in data.
2 / 50
Suppose the amount of data recorded in an organization is doubled every year. This increase is __________ .
3 / 50
_____________ is a process which involves gathering of information about column through execution of certain queries with intention to identify erroneous records.
4 / 50
The _________ is only a small part in realizing the true business value buried within the mountain of data collected and stored within organizations business systems and operational databases.
5 / 50
People that design and build the data warehouse must be capable of working across the organization at all levels
6 / 50
Grain is the ________ level of data stored in the warehouse.
7 / 50
Node of a B-Tree is stored in memory block and traversing a B-Tree involves ______ page faults.
8 / 50
_____modeling technique is more appropriate for data warehouses.
9 / 50
There are many variants of the traditional nested-loop join. If the index is built as part of the query plan and subsequently dropped, it is called
10 / 50
Multidimensional databases typically use proprietary __________ format to store pre-summarized cube structures.
11 / 50
Data Transformation Services (DTS) provide a set of _____ that lets you extract, transform, and consolidate data from disparate sources into single or multiple destinations supported by DTS connectivity.
12 / 50
The automated, prospective analyses offered by data mining move beyond the analysis of past events provided by respective tools typical of ___________.
13 / 50
NUMA stands for __________
14 / 50
DOLAP allows download of “cube” structures to a desktop platform with the need for shared relational or cube server.
15 / 50
For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:
16 / 50
Data mining evolve as a mechanism to cater the limitations of ________ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.
17 / 50
_____ contributes to an under-utilization of valuable and expensive historical data, and inevitably results in a limitedcapability to provide decision support and analysis.
18 / 50
19 / 50
Data warehousing and on-line analytical processing (OLAP) are _______ elements of decision support system.
20 / 50
The goal of star schema design is to simplify ________
21 / 50
To judge effectiveness we perform data profiling twice.
22 / 50
: An optimized structure which is built primarily for retrieval, with update being only a secondary consideration is
23 / 50
.______ is class of Decision Support Environment.
24 / 50
Pakistan is one of the five major ________ countries in the world.
25 / 50
The goal of ______is to look at as few block as possible to find the matching records.
26 / 50
Data mining is a/an ______ approach , where browsing through data using mining techniques may reveal something that might be of interest to the user as information that was unknown previously.
27 / 50
28 / 50
In horizontal splitting, we split a relation into multiple tables on the basis of
29 / 50
Ad-hoc access means to run such queries which are known already.
30 / 50
A single database, couldn‟t serve both operational high performance transaction processing and DSS, analytical processing, all at the same time.
31 / 50
If every key in the data file is represented in the index file then index is :
32 / 50
With data mining, the best way to accomplish this is by setting aside some of your data in a vault to isolate it from the mining process; once the mining is complete, the results can be tested against the isolated data to confirm the model's _______.
33 / 50
It is observed that every year the amount of data recorded in anorganization is
34 / 50
The Kimball s iterative data warehouse development approach drew on decades of experience to develop the _________.
35 / 50
36 / 50
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The __________ the portion of the program that must be executed sequentially, the greater the scalability of computation.
37 / 50
Relational databases allow you to navigate the data in ____________ that is appropriate using the primary, foreign key structure within the data model.
38 / 50
DTS allows us to connect through any data source or destination that is supported by ____________
39 / 50
Data Warehouse provides the best support for analysis while OLAP carries out the _________ task.
40 / 50
A ________ dimension is a collection of random transactional codes, flags and/text attributes that are unrelated to any particular dimension. The ______ dimension is simply a structure that provides a convenient place to store the ______ attributes.
41 / 50
If someone told you that he had a good model to predict customer usage, the first thing you might try would be to ask him to apply his model to your customer _______, where you already knew the answer.
42 / 50
B-Tree is used as an index to provide access to records
43 / 50
_______________, if fits into memory, costs only one disk I/O access to locate a record by given key.
44 / 50
The purpose of the House of Quality technique is to reduce ______ types of risk.
45 / 50
________ is the technique in which existing heterogeneous segments are reshuffled, relocated into homogeneous segments.
46 / 50
To identify the __________________ required we need to perform data profiling
47 / 50
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The smaller the portion of the program that must be executed __________, the greater the scalability of the computation.
48 / 50
49 / 50
In DWH project, it is assured that ___________ environment is similar to the production environment
50 / 50
For a given data set, to get a global view in un-supervised learning we use
Your score is
The average score is 0%
Restart quiz