CS614-Midterm
1 / 50
The technique that is used to perform these feats in data mining modeling, and this act of model building is something that people have been doing for long time, certainly before the _______ of computers or data mining technology.
2 / 50
Suppose the amount of data recorded in an organization is doubled every year. This increase is __________ .
3 / 50
The Kimball s iterative data warehouse development approach drew on decades of experience to develop the _________.
4 / 50
Data mining derives its name from the similarities between searching for valuable business information in a large database, for example, finding linked products in gigabytes of store scanner data, and mining a mountain for a _________ of valuable ore.
5 / 50
DTS allows us to connect through any data source or destination that is supported by ____________
6 / 50
Focusing on data warehouse delivery only often end up _________.
7 / 50
Pakistan is one of the five major ________ countries in the world.
8 / 50
The goal of star schema design is to simplify ________
9 / 50
The performance in a MOLAP cube comes from the O(1) look-up time for the array data structure.
10 / 50
Multidimensional databases typically use proprietary __________ format to store pre-summarized cube structures.
11 / 50
DOLAP allows download of “cube” structures to a desktop platform with the need for shared relational or cube server.
12 / 50
In horizontal splitting, we split a relation into multiple tables on the basis of
13 / 50
Many data warehouse project teams waste enormous amounts of time searching in vain for a _______.
14 / 50
The need to synchronize data upon update is called
15 / 50
_____________ is a process which involves gathering of information about column through execution of certain queries with intention to identify erroneous records.
16 / 50
A dense index, if fits into memory, costs only ______ disk I/O access to locate a record by given key.
17 / 50
During the application specification activity, we also must give consideration to the organization of the applications.
18 / 50
If „M‟ rows from table-A match the conditions in the query then table-B is accessed „M‟ times. Suppose table-B has an index on the join column. If „a‟ I/Os are required to read the data block for each scan and „b‟ I/Os for each data block then the total cost of accessing table-B is _____________ logical I/Os approximately.
19 / 50
Data mining evolve as a mechanism to cater the limitations of ________ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.
20 / 50
The divide & conquer cube partitioning approach helps alleviate the ____________ limitations of MOLAP implementation.
21 / 50
Ad-hoc access means to run such queries which are known already.
22 / 50
Data Transformation Services (DTS) provide a set of _____ that lets you extract, transform, and consolidate data from disparate sources into single or multiple destinations supported by DTS connectivity.
23 / 50
24 / 50
People that design and build the data warehouse must be capable of working across the organization at all levels
25 / 50
With data mining, the best way to accomplish this is by setting aside some of your data in a vault to isolate it from the mining process; once the mining is complete, the results can be tested against the isolated data to confirm the model's _______.
26 / 50
When performing objective assessments, companies follow a set of principles to develop metrics specific to their needs, there is hard to have “one size fits all” approach. Which of the following statement represents the pervasive functional forms?
27 / 50
_______________, if too big and does not fit into memory, will be expensive when used to find a record by given key.
28 / 50
The key idea behind ___________ is to take a big task and break it into subtasks that can be processed concurrently on a stream of data inputs in multiple, overlapping stages of execution.
29 / 50
NUMA stands for __________
30 / 50
Horizontal splitting breaks a table into multiple tables based upon_______
31 / 50
Data mining uses _________ algorithms to discover patterns and regularities in data.
32 / 50
It is observed that every year the amount of data recorded in an organization :
33 / 50
DSS queries do not involve a primary key
34 / 50
To measure or quantify the similarity or dissimilarity, different techniques are available. Which of the following option represent the name of available techniques?
35 / 50
If w is the window size and n is the size of data set, then the complexity of merging phase in BSN method is___________
36 / 50
37 / 50
30.Data Warehouse is about taking / colleting data from different ________ sources:
38 / 50
: An optimized structure which is built primarily for retrieval, with update being only a secondary consideration is
39 / 50
Normalization effects performance
40 / 50
During ETL process of an organization, suppose you have data which can be transformed using any of the transformation method. Which of the following strategy will be your choice for least complexity?
41 / 50
Analytical processing uses ____________ , instead of record level access.
42 / 50
43 / 50
For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:
44 / 50
Transactional fact tables do not have records for events that do not occur. These are called
45 / 50
_______________, if fits into memory, costs only one disk I/O access to locate a record by given key.
46 / 50
Slice and Dice is changing the view of the data.
47 / 50
.______ is class of Decision Support Environment.
48 / 50
49 / 50
: The goal of ___________ is to look at as few blocks as possible to find the matching records(s).
50 / 50
Which statement is true for De-Normalization?
Your score is
The average score is 0%
Restart quiz