CS614-Midterm
1 / 50
During the application specification activity, we also must give consideration to the organization of the applications.
2 / 50
30.Data Warehouse is about taking / colleting data from different ________ sources:
3 / 50
Pre-computed _______ can solve performance problems
4 / 50
_____________, if fits into memory , costs only one disk I/O access to locate a record by given key.
5 / 50
The growth of master files and magnetic tapes exploded around the mid- _______. :
6 / 50
To judge effectiveness we perform data profiling twice.
7 / 50
Focusing on data warehouse delivery only often end up _________.
8 / 50
For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:
9 / 50
Data mining evolve as a mechanism to cater the limitations of ________ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.
10 / 50
Execution can be completed successfully or it may be stopped due to some error. In case of successful completion of execution all the transactions will be ___________
11 / 50
The _________ is only a small part in realizing the true business value buried within the mountain of data collected and stored within organizations business systems and operational databases.
12 / 50
The technique that is used to perform these feats in data mining modeling, and this act of model building is something that people have been doing for long time, certainly before the _______ of computers or data mining technology.
13 / 50
: An optimized structure which is built primarily for retrieval, with update being only a secondary consideration is
14 / 50
DTS allows us to connect through any data source or destination that is supported by ____________
15 / 50
Pipeline parallelism focuses on increasing throughput of task execution, NOT on __________ sub-task execution time.
16 / 50
17 / 50
To identify the __________________ required we need to perform data profiling
18 / 50
Data mining is a/an ______ approach , where browsing through data using mining techniques may reveal something that might be of interest to the user as information that was unknown previously.
19 / 50
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The smaller the portion of the program that must be executed __________, the greater the scalability of the computation.
20 / 50
The STAR schema used for data design is a __________ consisting of fact and dimension tables. :
21 / 50
22 / 50
Which statement is true for De-Normalization?
23 / 50
The input to the data warehouse can come from OLTP or transactional system but not from other third party database.
24 / 50
Data warehousing and on-line analytical processing (OLAP) are _______ elements of decision support system.
25 / 50
Slice and Dice is changing the view of the data.
26 / 50
Multidimensional databases typically use proprietary __________ format to store pre-summarized cube structures.
27 / 50
28 / 50
29 / 50
It is observed that every year the amount of data recorded in an organization :
30 / 50
The need to synchronize data upon update is called
31 / 50
People that design and build the data warehouse must be capable of working across the organization at all levels
32 / 50
In horizontal splitting, we split a relation into multiple tables on the basis of
33 / 50
For a DWH project, the key requirement are ________ and product experience.
34 / 50
There are many variants of the traditional nested-loop join, if there is an index is exploited, then it is called……
35 / 50
36 / 50
Data Transformation Services (DTS) provide a set of _____ that lets you extract, transform, and consolidate data from disparate sources into single or multiple destinations supported by DTS connectivity.
37 / 50
The degree of similarity between two records, often measured by a numerical value between _______, usually depends on application characteristics.
38 / 50
If someone told you that he had a good model to predict customer usage, the first thing you might try would be to ask him to apply his model to your customer _______, where you already knew the answer.
39 / 50
Normalization effects performance
40 / 50
Many data warehouse project teams waste enormous amounts of time searching in vain for a _______.
41 / 50
For a given data set, to get a global view in un-supervised learning we use
42 / 50
To measure or quantify the similarity or dissimilarity, different techniques are available. Which of the following option represent the name of available techniques?
43 / 50
With data mining, the best way to accomplish this is by setting aside some of your data in a vault to isolate it from the mining process; once the mining is complete, the results can be tested against the isolated data to confirm the model's _______.
44 / 50
The Kimball s iterative data warehouse development approach drew on decades of experience to develop the _________.
45 / 50
When performing objective assessments, companies follow a set of principles to develop metrics specific to their needs, there is hard to have “one size fits all” approach. Which of the following statement represents the pervasive functional forms?
46 / 50
47 / 50
De-Normalization normally speeds up
48 / 50
_________ breaks a table into multiple tables based upon common column values.
49 / 50
_____modeling technique is more appropriate for data warehouses.
50 / 50
A data warehouse implementation without an OLAP tool is always possible.
Your score is
The average score is 0%
Restart quiz