CS614-Midterm
1 / 50
For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:
2 / 50
_____modeling technique is more appropriate for data warehouses.
3 / 50
4 / 50
Grain is the ________ level of data stored in the warehouse.
5 / 50
With data mining, the best way to accomplish this is by setting aside some of your data in a vault to isolate it from the mining process; once the mining is complete, the results can be tested against the isolated data to confirm the model's _______.
6 / 50
The degree of similarity between two records, often measured by a numerical value between _______, usually depends on application characteristics.
7 / 50
Pre-join technique is used to avoid
8 / 50
In horizontal splitting, we split a relation into multiple tables on the basis of
9 / 50
_______ is an application of information and data.
10 / 50
Ad-hoc access means to run such queries which are known already.
11 / 50
The automated, prospective analyses offered by data mining move beyond the analyses of past events provided by _____________ tools typical of decision support systems.
12 / 50
13 / 50
For a relation to be in 4NF it must be:-
14 / 50
Companies collect and record their own operational data, but at the same time they also use reference data obtained from _______ sources such as codes, prices etc.
15 / 50
De-Normalization normally speeds up
16 / 50
5 million bales.
17 / 50
30.Data Warehouse is about taking / colleting data from different ________ sources:
18 / 50
Slice and Dice is changing the view of the data.
19 / 50
A single database, couldn‟t serve both operational high performance transaction processing and DSS, analytical processing, all at the same time.
20 / 50
Normalization effects performance
21 / 50
_________ breaks a table into multiple tables based upon common column values.
22 / 50
The purpose of the House of Quality technique is to reduce ______ types of risk.
23 / 50
Taken jointly, the extract programs or naturally evolving systems formed a spider web, also known as
24 / 50
The divide & conquer cube partitioning approach helps alleviate the ____________ limitations of MOLAP implementation.
25 / 50
A ________ dimension is a collection of random transactional codes, flags and/text attributes that are unrelated to any particular dimension. The ______ dimension is simply a structure that provides a convenient place to store the ______ attributes.
26 / 50
The need to synchronize data upon update is called
27 / 50
Data mining derives its name from the similarities between searching for valuable business information in a large database, for example, finding linked products in gigabytes of store scanner data, and mining a mountain for a _________ of valuable ore.
28 / 50
In DWH project, it is assured that ___________ environment is similar to the production environment
29 / 50
Data Transformation Services (DTS) provide a set of _____ that lets you extract, transform, and consolidate data from disparate sources into single or multiple destinations supported by DTS connectivity.
30 / 50
Data Warehouse provides the best support for analysis while OLAP carries out the _________ task.
31 / 50
________ gives total view of an organization
32 / 50
NUMA stands for __________
33 / 50
B-Tree is used as an index to provide access to records
34 / 50
Multidimensional databases typically use proprietary __________ format to store pre-summarized cube structures.
35 / 50
People that design and build the data warehouse must be capable of working across the organization at all levels
36 / 50
Relational databases allow you to navigate the data in ____________ that is appropriate using the primary, foreign key structure within the data model.
37 / 50
_______________, if too big and does not fit into memory, will be expensive when used to find a record by given key.
38 / 50
_______________, if fits into memory, costs only one disk I/O access to locate a record by given key.
39 / 50
Pipeline parallelism focuses on increasing throughput of task execution, NOT on __________ sub-task execution time.
40 / 50
Analytical processing uses ____________ , instead of record level access.
41 / 50
The input to the data warehouse can come from OLTP or transactional system but not from other third party database.
42 / 50
There are many variants of the traditional nested-loop join. If the index is built as part of the query plan and subsequently dropped, it is called
43 / 50
Horizontal splitting breaks a table into multiple tables based upon_______
44 / 50
For a DWH project, the key requirement are ________ and product experience.
45 / 50
DTS allows us to connect through any data source or destination that is supported by ____________
46 / 50
To identify the __________________ required we need to perform data profiling
47 / 50
A data warehouse implementation without an OLAP tool is always possible.
48 / 50
If someone told you that he had a good model to predict customer usage, the first thing you might try would be to ask him to apply his model to your customer _______, where you already knew the answer.
49 / 50
Pakistan is one of the five major ________ countries in the world.
50 / 50
Your score is
The average score is 0%
Restart quiz