CS614-Midterm
1 / 50
_____________, if fits into memory , costs only one disk I/O access to locate a record by given key.
2 / 50
For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:
3 / 50
The goal of ______is to look at as few block as possible to find the matching records.
4 / 50
The degree of similarity between two records, often measured by a numerical value between _______, usually depends on application characteristics.
5 / 50
Rearranging the grouping of source data, delivering it to the destination database, and ensuring the quality of data are crucial to the process of loading the data warehouse. Data ____________ is vitally important to the overall health of a warehouse project. 1. Cleansing 2. Cleaning 3. Scrubbing Which of the following options is true?
6 / 50
With data mining, the best way to accomplish this is by setting aside some of your data in a vault to isolate it from the mining process; once the mining is complete, the results can be tested against the isolated data to confirm the model's _______.
7 / 50
For a given data set, to get a global view in un-supervised learning we use
8 / 50
During the application specification activity, we also must give consideration to the organization of the applications.
9 / 50
Naturally Evolving architecture occurred when an organization had a _______ approach to handling the whole process of hardware and software architecture.
10 / 50
To judge effectiveness we perform data profiling twice.
11 / 50
We must try to find the one access tool that will handle all the needs of their users.
12 / 50
The performance in a MOLAP cube comes from the O(1) look-up time for the array data structure.
13 / 50
People that design and build the data warehouse must be capable of working across the organization at all levels
14 / 50
Analytical processing uses ____________ , instead of record level access.
15 / 50
: The goal of ___________ is to look at as few blocks as possible to find the matching records(s).
16 / 50
Pakistan is one of the five major ________ countries in the world.
17 / 50
Data mining is a/an __________ approach, where browsing through data using data mining techniques may reveal something that might be of interest to the user as information that was unknown previously.
18 / 50
When performing objective assessments, companies follow a set of principles to develop metrics specific to their needs, there is hard to have “one size fits all” approach. Which of the following statement represents the pervasive functional forms?
19 / 50
For a DWH project, the key requirement are ________ and product experience.
20 / 50
Data mining evolve as a mechanism to cater the limitations of ________ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.
21 / 50
_________ breaks a table into multiple tables based upon common column values.
22 / 50
5 million bales.
23 / 50
Data mining is a/an ______ approach , where browsing through data using mining techniques may reveal something that might be of interest to the user as information that was unknown previously.
24 / 50
Horizontal splitting breaks a table into multiple tables based upon_______
25 / 50
The STAR schema used for data design is a __________ consisting of fact and dimension tables. :
26 / 50
27 / 50
In _________ system, the contents change with time. :
28 / 50
NUMA stands for __________
29 / 50
It is observed that every year the amount of data recorded in anorganization is
30 / 50
The technique that is used to perform these feats in data mining modeling, and this act of model building is something that people have been doing for long time, certainly before the _______ of computers or data mining technology.
31 / 50
If „M‟ rows from table-A match the conditions in the query then table-B is accessed „M‟ times. Suppose table-B has an index on the join column. If „a‟ I/Os are required to read the data block for each scan and „b‟ I/Os for each data block then the total cost of accessing table-B is _____________ logical I/Os approximately.
32 / 50
The purpose of the House of Quality technique is to reduce ______ types of risk.
33 / 50
The need to synchronize data upon update is called
34 / 50
Multidimensional databases typically use proprietary __________ format to store pre-summarized cube structures.
35 / 50
_______________, if fits into memory, costs only one disk I/O access to locate a record by given key.
36 / 50
To identify the __________________ required we need to perform data profiling
37 / 50
_______ is an application of information and data.
38 / 50
Pre-computed _______ can solve performance problems
39 / 50
To measure or quantify the similarity or dissimilarity, different techniques are available. Which of the following option represent the name of available techniques?
40 / 50
41 / 50
42 / 50
________ gives total view of an organization
43 / 50
Pipeline parallelism focuses on increasing throughput of task execution, NOT on __________ sub-task execution time.
44 / 50
Non uniform distribution, when the data is distributed across the processors, is called ______.
45 / 50
Pre-join technique is used to avoid
46 / 50
If some error occurs, execution will be terminated abnormally and all transactions will be rolled back. In this case when we will access the database we will find it in the state that was before the ____________.
47 / 50
There are many variants of the traditional nested-loop join. If the index is built as part of the query plan and subsequently dropped, it is called
48 / 50
If someone told you that he had a good model to predict customer usage, the first thing you might try would be to ask him to apply his model to your customer _______, where you already knew the answer.
49 / 50
Focusing on data warehouse delivery only often end up _________.
50 / 50
Data Transformation Services (DTS) provide a set of _____ that lets you extract, transform, and consolidate data from disparate sources into single or multiple destinations supported by DTS connectivity.
Your score is
The average score is 0%
Restart quiz