CS614-Midterm
1 / 50
If every key in the data file is represented in the index file then index is :
2 / 50
In horizontal splitting, we split a relation into multiple tables on the basis of
3 / 50
The key idea behind ___________ is to take a big task and break it into subtasks that can be processed concurrently on a stream of data inputs in multiple, overlapping stages of execution.
4 / 50
To judge effectiveness we perform data profiling twice.
5 / 50
_________ breaks a table into multiple tables based upon common column values.
6 / 50
Data mining evolve as a mechanism to cater the limitations of ________ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.
7 / 50
Collapsing tables can be done on the ___________ relationships
8 / 50
With data mining, the best way to accomplish this is by setting aside some of your data in a vault to isolate it from the mining process; once the mining is complete, the results can be tested against the isolated data to confirm the model's _______.
9 / 50
Data mining evolve as mechanism to cater the limitations of _____ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc...
10 / 50
: An optimized structure which is built primarily for retrieval, with update being only a secondary consideration is
11 / 50
Execution can be completed successfully or it may be stopped due to some error. In case of successful completion of execution all the transactions will be ___________
12 / 50
Relational databases allow you to navigate the data in ____________ that is appropriate using the primary, foreign key structure within the data model.
13 / 50
Data Warehouse provides the best support for analysis while OLAP carries out the _________ task.
14 / 50
Multi-dimensional databases (MDDs) typically use ___________ formats to store pre-summarized cube structures.
15 / 50
Focusing on data warehouse delivery only often end up _________.
16 / 50
For a smooth DWH implementation we must be a technologist.
17 / 50
The STAR schema used for data design is a __________ consisting of fact and dimension tables. :
18 / 50
The need to synchronize data upon update is called
19 / 50
Which statement is true for De-Normalization?
20 / 50
NUMA stands for __________
21 / 50
The Kimball s iterative data warehouse development approach drew on decades of experience to develop the _________.
22 / 50
Horizontal splitting breaks a table into multiple tables based upon_______
23 / 50
For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:
24 / 50
We must try to find the one access tool that will handle all the needs of their users.
25 / 50
________ gives total view of an organization
26 / 50
The technique that is used to perform these feats in data mining modeling, and this act of model building is something that people have been doing for long time, certainly before the _______ of computers or data mining technology.
27 / 50
.______ is class of Decision Support Environment.
28 / 50
29 / 50
The automated, prospective analyses offered by data mining move beyond the analyses of past events provided by _____________ tools typical of decision support systems.
30 / 50
in agriculture extension is that pest population beyond which the benefit of spraying outweighs levels
31 / 50
Pre-computed _______ can solve performance problems
32 / 50
_______ is an application of information and data.
33 / 50
34 / 50
Node of a B-Tree is stored in memory block and traversing a B-Tree involves ______ page faults.
35 / 50
36 / 50
A ________ dimension is a collection of random transactional codes, flags and/text attributes that are unrelated to any particular dimension. The ______ dimension is simply a structure that provides a convenient place to store the ______ attributes.
37 / 50
_____ contributes to an under-utilization of valuable and expensive historical data, and inevitably results in a limitedcapability to provide decision support and analysis.
38 / 50
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The __________ the portion of the program that must be executed sequentially, the greater the scalability of computation.
39 / 50
Naturally Evolving architecture occurred when an organization had a _______ approach to handling the whole process of hardware and software architecture.
40 / 50
For a DWH project, the key requirement are ________ and product experience.
41 / 50
Data mining derives its name from the similarities between searching for valuable business information in a large database, for example, finding linked products in gigabytes of store scanner data, and mining a mountain for a _________ of valuable ore.
42 / 50
____________ in agriculture extension is that pest population beyond which the benefit of spraying outweighs its cost.
43 / 50
Normalization effects performance
44 / 50
45 / 50
In _________ system, the contents change with time. :
46 / 50
DSS queries do not involve a primary key
47 / 50
A dense index, if fits into memory, costs only ______ disk I/O access to locate a record by given key.
48 / 50
Change Data Capture is one of the challenging technical issues in _____________
49 / 50
50 / 50
Slice and Dice is changing the view of the data.
Your score is
The average score is 0%
Restart quiz