CS614 Midterm Online Quiz

0%

CS614-Midterm

1 / 50

Data mining uses _________ algorithms to discover patterns and regularities in data.

2 / 50

Suppose the amount of data recorded in an organization is doubled every year. This increase is __________ .

3 / 50

_____________ is a process which involves gathering of information about column through execution of certain queries with intention to identify erroneous records.

4 / 50

The _________ is only a small part in realizing the true business value buried within the mountain of data collected and stored within organizations business systems and operational databases.

5 / 50

People that design and build the data warehouse must be capable of working across the organization at all levels

6 / 50

Grain is the ________ level of data stored in the warehouse.

7 / 50

Node of a B-Tree is stored in memory block and traversing a B-Tree involves ______ page faults.

8 / 50

_____modeling technique is more appropriate for data warehouses.

9 / 50

There are many variants of the traditional nested-loop join. If the index is built as part of the query plan and subsequently dropped, it is called

10 / 50

Multidimensional databases typically use proprietary __________ format to store pre-summarized cube structures.

11 / 50

Data Transformation Services (DTS) provide a set of _____ that lets you extract, transform, and consolidate data from disparate sources into single or multiple destinations supported by DTS connectivity.

12 / 50

The automated, prospective analyses offered by data mining move beyond the analysis of past events provided by respective tools typical of ___________.

13 / 50

NUMA stands for __________

14 / 50

DOLAP allows download of “cube” structures to a desktop platform with the need for shared relational or cube server.

15 / 50

For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:

16 / 50

Data mining evolve as a mechanism to cater the limitations of ________ systems to deal massive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.

17 / 50

_____ contributes to an under-utilization of valuable and expensive historical data, and inevitably results in a limitedcapability to provide decision support and analysis.

18 / 50

Multidimensional databases typically use proprietary __________ format to store pre-summarized cube structures.

19 / 50

Data warehousing and on-line analytical processing (OLAP) are _______ elements of decision support system.

20 / 50

The goal of star schema design is to simplify ________

21 / 50

To judge effectiveness we perform data profiling twice.

22 / 50

: An optimized structure which is built primarily for retrieval, with update being only a secondary consideration is

23 / 50

.______ is class of Decision Support Environment.

24 / 50

Pakistan is one of the five major ________ countries in the world.

25 / 50

The goal of ______is to look at as few block as possible to find the matching records.

26 / 50

Data mining is a/an ______ approach , where browsing through data using mining techniques may reveal something that might be of interest to the user as information that was unknown previously.

27 / 50

: An optimized structure which is built primarily for retrieval, with update being only a secondary consideration is

28 / 50

In horizontal splitting, we split a relation into multiple tables on the basis of

29 / 50

Ad-hoc access means to run such queries which are known already.

30 / 50

A single database, couldn‟t serve both operational high performance transaction processing and DSS, analytical processing, all at the same time.

31 / 50

If every key in the data file is represented in the index file then index is :

32 / 50

With data mining, the best way to accomplish this is by setting aside some of your data in a vault to isolate it from the mining process; once the mining is complete, the results can be tested against the isolated data to confirm the model's _______.

33 / 50

It is observed that every year the amount of data recorded in anorganization is

34 / 50

The Kimball s iterative data warehouse development approach drew on decades of experience to develop the _________.

35 / 50

Pakistan is one of the five major ________ countries in the world.

36 / 50

The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The __________ the portion of the program that must be executed sequentially, the greater the scalability of computation.

37 / 50

Relational databases allow you to navigate the data in ____________ that is appropriate using the primary, foreign key structure within the data model.

38 / 50

DTS allows us to connect through any data source or destination that is supported by ____________

39 / 50

Data Warehouse provides the best support for analysis while OLAP carries out the _________ task.

40 / 50

A ________ dimension is a collection of random transactional codes, flags and/text attributes that are unrelated to any particular dimension. The ______ dimension is simply a structure that provides a convenient place to store the ______ attributes.

41 / 50

If someone told you that he had a good model to predict customer usage, the first thing you might try would be to ask him to apply his model to your customer _______, where you already knew the answer.

42 / 50

B-Tree is used as an index to provide access to records

43 / 50

_______________, if fits into memory, costs only one disk I/O access to locate a record by given key.

44 / 50

The purpose of the House of Quality technique is to reduce ______ types of risk.

45 / 50

________ is the technique in which existing heterogeneous segments are reshuffled, relocated into homogeneous segments.

46 / 50

To identify the __________________ required we need to perform data profiling

47 / 50

The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The smaller the portion of the program that must be executed __________, the greater the scalability of the computation.

48 / 50

DTS allows us to connect through any data source or destination that is supported by ____________

49 / 50

In DWH project, it is assured that ___________ environment is similar to the production environment

50 / 50

For a given data set, to get a global view in un-supervised learning we use

Your score is

The average score is 0%

0%

Qunoot e Nazilah
Dua e Hajat
4 Qul
6 Kalma
Dua-e-Akasha
Darood Akbar
Surah Fatiha
Dua-e-Ganj Ul Arsh
Dua-e-Jamilah
Ayat-ul-Kursi