CS614-Midterm
1 / 50
If someone told you that he had a good model to predict customer usage, the first thing you might try would be to ask him to apply his model to your customer _______, where you already knew the answer.
2 / 50
There are many variants of the traditional nested-loop join, if there is an index is exploited, then it is called……
3 / 50
The degree of similarity between two records, often measured by a numerical value between _______, usually depends on application characteristics.
4 / 50
Multi-dimensional databases (MDDs) typically use ___________ formats to store pre-summarized cube structures.
5 / 50
For a smooth DWH implementation we must be a technologist.
6 / 50
Data mining derives its name from the similarities between searching for valuable business information in a large database, for example, finding linked products in gigabytes of store scanner data, and mining a mountain for a _________ of valuable ore.
7 / 50
For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:
8 / 50
Normalization effects performance
9 / 50
_______ is an application of information and data.
10 / 50
Virtual cube is used to query two similar cubes by creating a third “virtual” cube by a join between two cubes.
11 / 50
The goal of star schema design is to simplify ________
12 / 50
If every key in the data file is represented in the index file then index is :
13 / 50
Pakistan is one of the five major ________ countries in the world.
14 / 50
Naturally Evolving architecture occurred when an organization had a _______ approach to handling the whole process of hardware and software architecture.
15 / 50
If w is the window size and n is the size of data set, then the complexity of merging phase in BSN method is___________
16 / 50
When performing objective assessments, companies follow a set of principles to develop metrics specific to their needs, there is hard to have “one size fits all” approach. Which of the following statement represents the pervasive functional forms?
17 / 50
To judge effectiveness we perform data profiling twice.
18 / 50
If „M‟ rows from table-A match the conditions in the query then table-B is accessed „M‟ times. Suppose table-B has an index on the join column. If „a‟ I/Os are required to read the data block for each scan and „b‟ I/Os for each data block then the total cost of accessing table-B is _____________ logical I/Os approximately.
19 / 50
Horizontal splitting breaks a table into multiple tables based upon_______
20 / 50
: An optimized structure which is built primarily for retrieval, with update being only a secondary consideration is
21 / 50
: The goal of ___________ is to look at as few blocks as possible to find the matching records(s).
22 / 50
It is observed that every year the amount of data recorded in an organization :
23 / 50
24 / 50
25 / 50
If some error occurs, execution will be terminated abnormally and all transactions will be rolled back. In this case when we will access the database we will find it in the state that was before the ____________.
26 / 50
27 / 50
_________ breaks a table into multiple tables based upon common column values.
28 / 50
The goal of ______is to look at as few block as possible to find the matching records.
29 / 50
Execution can be completed successfully or it may be stopped due to some error. In case of successful completion of execution all the transactions will be ___________
30 / 50
During ETL process of an organization, suppose you have data which can be transformed using any of the transformation method. Which of the following strategy will be your choice for least complexity?
31 / 50
_______________, if too big and does not fit into memory, will be expensive when used to find a record by given key.
32 / 50
________ is the technique in which existing heterogeneous segments are reshuffled, relocated into homogeneous segments.
33 / 50
The automated, prospective analyses offered by data mining move beyond the analyses of past events provided by _____________ tools typical of decision support systems.
34 / 50
In horizontal splitting, we split a relation into multiple tables on the basis of
35 / 50
For a DWH project, the key requirement are ________ and product experience.
36 / 50
Pre-computed _______ can solve performance problems
37 / 50
The purpose of the House of Quality technique is to reduce ______ types of risk.
38 / 50
It is observed that every year the amount of data recorded in anorganization is
39 / 50
The divide & conquer cube partitioning approach helps alleviate the ____________ limitations of MOLAP implementation.
40 / 50
41 / 50
In a traditional MIS system, there is an almost linear sequence of queries.
42 / 50
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The smaller the portion of the program that must be executed __________, the greater the scalability of the computation.
43 / 50
DSS queries do not involve a primary key
44 / 50
Rearranging the grouping of source data, delivering it to the destination database, and ensuring the quality of data are crucial to the process of loading the data warehouse. Data ____________ is vitally important to the overall health of a warehouse project. 1. Cleansing 2. Cleaning 3. Scrubbing Which of the following options is true?
45 / 50
Cube is a __________ entity containing values of a certain fact at a certain aggregation level at an intersection of a combination of dimensions.
46 / 50
The _________ is only a small part in realizing the true business value buried within the mountain of data collected and stored within organizations business systems and operational databases.
47 / 50
All data is ______________ of something real. I An Abstraction II A Representation Which of the following option is true?
48 / 50
Relational databases allow you to navigate the data in ____________ that is appropriate using the primary, foreign key structure within the data model.
49 / 50
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The __________ the portion of the program that must be executed sequentially, the greater the scalability of computation.
50 / 50
Data mining evolve as a mechanism to cater the limitations of ________ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.
Your score is
The average score is 0%
Restart quiz