CS614-Midterm
1 / 50
Focusing on data warehouse delivery only often end up _________.
2 / 50
DTS allows us to connect through any data source or destination that is supported by ____________
3 / 50
The technique that is used to perform these feats in data mining modeling, and this act of model building is something that people have been doing for long time, certainly before the _______ of computers or data mining technology.
4 / 50
The STAR schema used for data design is a __________ consisting of fact and dimension tables. :
5 / 50
A dense index, if fits into memory, costs only ______ disk I/O access to locate a record by given key.
6 / 50
Multidimensional databases typically use proprietary __________ format to store pre-summarized cube structures.
7 / 50
For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:
8 / 50
Normalization effects performance
9 / 50
It is observed that every year the amount of data recorded in an organization :
10 / 50
11 / 50
During ETL process of an organization, suppose you have data which can be transformed using any of the transformation method. Which of the following strategy will be your choice for least complexity?
12 / 50
Many data warehouse project teams waste enormous amounts of time searching in vain for a _______.
13 / 50
The degree of similarity between two records, often measured by a numerical value between _______, usually depends on application characteristics.
14 / 50
In a traditional MIS system, there is an almost linear sequence of queries.
15 / 50
16 / 50
Pre-computed _______ can solve performance problems
17 / 50
30.Data Warehouse is about taking / colleting data from different ________ sources:
18 / 50
It is observed that every year the amount of data recorded in anorganization is
19 / 50
Transactional fact tables do not have records for events that do not occur. These are called
20 / 50
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The smaller the portion of the program that must be executed __________, the greater the scalability of the computation.
21 / 50
_________ breaks a table into multiple tables based upon common column values.
22 / 50
Ad-hoc access means to run such queries which are known already.
23 / 50
Data mining is a/an ______ approach , where browsing through data using mining techniques may reveal something that might be of interest to the user as information that was unknown previously.
24 / 50
: An optimized structure which is built primarily for retrieval, with update being only a secondary consideration is
25 / 50
Data mining evolve as mechanism to cater the limitations of _____ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc...
26 / 50
In DWH project, it is assured that ___________ environment is similar to the production environment
27 / 50
If some error occurs, execution will be terminated abnormally and all transactions will be rolled back. In this case when we will access the database we will find it in the state that was before the ____________.
28 / 50
________ gives total view of an organization
29 / 50
To measure or quantify the similarity or dissimilarity, different techniques are available. Which of the following option represent the name of available techniques?
30 / 50
5 million bales.
31 / 50
The automated, prospective analyses offered by data mining move beyond the analyses of past events provided by _____________ tools typical of decision support systems.
32 / 50
During the application specification activity, we also must give consideration to the organization of the applications.
33 / 50
DSS queries do not involve a primary key
34 / 50
Pipeline parallelism focuses on increasing throughput of task execution, NOT on __________ sub-task execution time.
35 / 50
Data Transformation Services (DTS) provide a set of _____ that lets you extract, transform, and consolidate data from disparate sources into single or multiple destinations supported by DTS connectivity.
36 / 50
Collapsing tables can be done on the ___________ relationships
37 / 50
To identify the __________________ required we need to perform data profiling
38 / 50
.______ is class of Decision Support Environment.
39 / 50
The purpose of the House of Quality technique is to reduce ______ types of risk.
40 / 50
Multi-dimensional databases (MDDs) typically use ___________ formats to store pre-summarized cube structures.
41 / 50
People that design and build the data warehouse must be capable of working across the organization at all levels
42 / 50
Taken jointly, the extract programs or naturally evolving systems formed a spider web, also known as
43 / 50
_____________ is a process which involves gathering of information about column through execution of certain queries with intention to identify erroneous records.
44 / 50
NUMA stands for __________
45 / 50
For a DWH project, the key requirement are ________ and product experience.
46 / 50
If „M‟ rows from table-A match the conditions in the query then table-B is accessed „M‟ times. Suppose table-B has an index on the join column. If „a‟ I/Os are required to read the data block for each scan and „b‟ I/Os for each data block then the total cost of accessing table-B is _____________ logical I/Os approximately.
47 / 50
48 / 50
Pakistan is one of the five major ________ countries in the world.
49 / 50
Companies collect and record their own operational data, but at the same time they also use reference data obtained from _______ sources such as codes, prices etc.
50 / 50
_______________, if too big and does not fit into memory, will be expensive when used to find a record by given key.
Your score is
The average score is 0%
Restart quiz