CS614-Midterm
1 / 50
People that design and build the data warehouse must be capable of working across the organization at all levels
2 / 50
Data mining evolve as mechanism to cater the limitations of _____ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc...
3 / 50
Data mining is a/an ______ approach , where browsing through data using mining techniques may reveal something that might be of interest to the user as information that was unknown previously.
4 / 50
: An optimized structure which is built primarily for retrieval, with update being only a secondary consideration is
5 / 50
Transactional fact tables do not have records for events that do not occur. These are called
6 / 50
Suppose the amount of data recorded in an organization is doubled every year. This increase is __________ .
7 / 50
If some error occurs, execution will be terminated abnormally and all transactions will be rolled back. In this case when we will access the database we will find it in the state that was before the ____________.
8 / 50
Grain is the ________ level of data stored in the warehouse.
9 / 50
_________ breaks a table into multiple tables based upon common column values.
10 / 50
The goal of star schema design is to simplify ________
11 / 50
During ETL process of an organization, suppose you have data which can be transformed using any of the transformation method. Which of the following strategy will be your choice for least complexity?
12 / 50
The Kimball s iterative data warehouse development approach drew on decades of experience to develop the _________.
13 / 50
_____________, if fits into memory , costs only one disk I/O access to locate a record by given key.
14 / 50
For a smooth DWH implementation we must be a technologist.
15 / 50
For a given data set, to get a global view in un-supervised learning we use
16 / 50
17 / 50
Execution can be completed successfully or it may be stopped due to some error. In case of successful completion of execution all the transactions will be ___________
18 / 50
Pakistan is one of the five major ________ countries in the world.
19 / 50
Data mining is a/an __________ approach, where browsing through data using data mining techniques may reveal something that might be of interest to the user as information that was unknown previously.
20 / 50
________ is the technique in which existing heterogeneous segments are reshuffled, relocated into homogeneous segments.
21 / 50
The _________ is only a small part in realizing the true business value buried within the mountain of data collected and stored within organizations business systems and operational databases.
22 / 50
Ad-hoc access means to run such queries which are known already.
23 / 50
Non uniform distribution, when the data is distributed across the processors, is called ______.
24 / 50
25 / 50
Change Data Capture is one of the challenging technical issues in _____________
26 / 50
.______ is class of Decision Support Environment.
27 / 50
Investing years in architecture and forgetting the primary purpose of solving business problems, results in inefficient application. This is the example of _________ mistake.
28 / 50
_______________, if fits into memory, costs only one disk I/O access to locate a record by given key.
29 / 50
To judge effectiveness we perform data profiling twice.
30 / 50
____________ in agriculture extension is that pest population beyond which the benefit of spraying outweighs its cost.
31 / 50
The automated, prospective analyses offered by data mining move beyond the analysis of past events provided by respective tools typical of ___________.
32 / 50
Data mining uses _________ algorithms to discover patterns and regularities in data.
33 / 50
5 million bales.
34 / 50
To identify the __________________ required we need to perform data profiling
35 / 50
The input to the data warehouse can come from OLTP or transactional system but not from other third party database.
36 / 50
Data mining evolve as a mechanism to cater the limitations of ________ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.
37 / 50
Companies collect and record their own operational data, but at the same time they also use reference data obtained from _______ sources such as codes, prices etc.
38 / 50
_______ is an application of information and data.
39 / 50
The growth of master files and magnetic tapes exploded around the mid- _______. :
40 / 50
Multi-dimensional databases (MDDs) typically use ___________ formats to store pre-summarized cube structures.
41 / 50
It is observed that every year the amount of data recorded in an organization :
42 / 50
Analytical processing uses ____________ , instead of record level access.
43 / 50
Multidimensional databases typically use proprietary __________ format to store pre-summarized cube structures.
44 / 50
To measure or quantify the similarity or dissimilarity, different techniques are available. Which of the following option represent the name of available techniques?
45 / 50
_____modeling technique is more appropriate for data warehouses.
46 / 50
Rearranging the grouping of source data, delivering it to the destination database, and ensuring the quality of data are crucial to the process of loading the data warehouse. Data ____________ is vitally important to the overall health of a warehouse project. 1. Cleansing 2. Cleaning 3. Scrubbing Which of the following options is true?
47 / 50
NUMA stands for __________
48 / 50
DOLAP allows download of “cube” structures to a desktop platform with the need for shared relational or cube server.
49 / 50
in agriculture extension is that pest population beyond which the benefit of spraying outweighs levels
50 / 50
_____________ is a process which involves gathering of information about column through execution of certain queries with intention to identify erroneous records.
Your score is
The average score is 0%
Restart quiz