CS614-Midterm
1 / 50
It is observed that every year the amount of data recorded in anorganization is
2 / 50
In horizontal splitting, we split a relation into multiple tables on the basis of
3 / 50
To judge effectiveness we perform data profiling twice.
4 / 50
As apposed to the out come of classification, estimation deal with ____________ valued outcome.
5 / 50
The technique that is used to perform these feats in data mining modeling, and this act of model building is something that people have been doing for long time, certainly before the _______ of computers or data mining technology.
6 / 50
Taken jointly, the extract programs or naturally evolving systems formed a spider web, also known as
7 / 50
The purpose of the House of Quality technique is to reduce ______ types of risk.
8 / 50
DTS allows us to connect through any data source or destination that is supported by ____________
9 / 50
All data is ______________ of something real. I An Abstraction II A Representation Which of the following option is true?
10 / 50
Multidimensional databases typically use proprietary __________ format to store pre-summarized cube structures.
11 / 50
Data mining uses _________ algorithms to discover patterns and regularities in data.
12 / 50
NUMA stands for __________
13 / 50
_____ contributes to an under-utilization of valuable and expensive historical data, and inevitably results in a limitedcapability to provide decision support and analysis.
14 / 50
The users of data warehouse are knowledge workers in other words they are _______in the organization.
15 / 50
The need to synchronize data upon update is called
16 / 50
People that design and build the data warehouse must be capable of working across the organization at all levels
17 / 50
in agriculture extension is that pest population beyond which the benefit of spraying outweighs levels
18 / 50
A data warehouse implementation without an OLAP tool is always possible.
19 / 50
Transactional fact tables do not have records for events that do not occur. These are called
20 / 50
Normalization effects performance
21 / 50
To measure or quantify the similarity or dissimilarity, different techniques are available. Which of the following option represent the name of available techniques?
22 / 50
23 / 50
For a given data set, to get a global view in un-supervised learning we use
24 / 50
25 / 50
The key idea behind ___________ is to take a big task and break it into subtasks that can be processed concurrently on a stream of data inputs in multiple, overlapping stages of execution.
26 / 50
For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:
27 / 50
28 / 50
For a DWH project, the key requirement are ________ and product experience.
29 / 50
In DWH project, it is assured that ___________ environment is similar to the production environment
30 / 50
Multi-dimensional databases (MDDs) typically use ___________ formats to store pre-summarized cube structures.
31 / 50
Rearranging the grouping of source data, delivering it to the destination database, and ensuring the quality of data are crucial to the process of loading the data warehouse. Data ____________ is vitally important to the overall health of a warehouse project. 1. Cleansing 2. Cleaning 3. Scrubbing Which of the following options is true?
32 / 50
Horizontal splitting breaks a table into multiple tables based upon_______
33 / 50
For a relation to be in 4NF it must be:-
34 / 50
In _________ system, the contents change with time. :
35 / 50
In a traditional MIS system, there is an almost linear sequence of queries.
36 / 50
Data mining evolve as a mechanism to cater the limitations of ________ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.
37 / 50
Ad-hoc access means to run such queries which are known already.
38 / 50
Virtual cube is used to query two similar cubes by creating a third “virtual” cube by a join between two cubes.
39 / 50
There are many variants of the traditional nested-loop join. If the index is built as part of the query plan and subsequently dropped, it is called
40 / 50
Which statement is true for De-Normalization?
41 / 50
A dense index, if fits into memory, costs only ______ disk I/O access to locate a record by given key.
42 / 50
The Kimball s iterative data warehouse development approach drew on decades of experience to develop the _________.
43 / 50
: The goal of ___________ is to look at as few blocks as possible to find the matching records(s).
44 / 50
: An optimized structure which is built primarily for retrieval, with update being only a secondary consideration is
45 / 50
If someone told you that he had a good model to predict customer usage, the first thing you might try would be to ask him to apply his model to your customer _______, where you already knew the answer.
46 / 50
DSS queries do not involve a primary key
47 / 50
_______________, if too big and does not fit into memory, will be expensive when used to find a record by given key.
48 / 50
Pakistan is one of the five major ________ countries in the world.
49 / 50
The goal of star schema design is to simplify ________
50 / 50
Your score is
The average score is 0%
Restart quiz