CS614 Midterm Online Quiz

CS614-Midterm

1 / 50

De-Normalization normally speeds up

Data Retrieval

Data Replication

Development Cycle

Data Modification

2 / 50

Data mining is a/an __________ approach, where browsing through data using data mining techniques may reveal something that might be of interest to the user as information that was unknown previously.

Exploratory

Computer Science

Non-Exploratory

3 / 50

5 million bales.

Cotton Worm

Blue Worm

Purple Worm

Boll Worm

4 / 50

NUMA stands for __________

Non-updateable Memory Architecture

Non-uniform Memory Access

New Universal Memory Architecture

5 / 50

Collapsing tables can be done on the ___________ relationships

None of these

One-to-One

Many-to-Many

Both One-to-One and Many-to-Many

6 / 50

When performing objective assessments, companies follow a set of principles to develop metrics specific to their needs, there is hard to have “one size fits all” approach. Which of the following statement represents the pervasive functional forms?

Simple Ratio, Min or Max Operation, Weighted Average

Only Min or Max Operation, Weighted Average

Only Simple Ratio, Min or Max Operation

Only Complex Ratio, Min Operation, Max Operation

7 / 50

The growth of master files and magnetic tapes exploded around the mid- _______. :

1960s.

1970s.

1950s.

1980s.

8 / 50

A dense index, if fits into memory, costs only ______ disk I/O access to locate a record by given key.

Two

One

lg (n)

9 / 50

Data Transformation Services (DTS) provide a set of _____ that lets you extract, transform, and consolidate data from disparate sources into single or multiple destinations supported by DTS connectivity.

Documentations

Tools

Guidelines

10 / 50

DTS allows us to connect through any data source or destination that is supported by ____________

OLTP

OLE DB

Data Warehouse

OLAP

11 / 50

Focusing on data warehouse delivery only often end up _________.

Rebuilding

Good Stable Product

Success

None of these

12 / 50

The Kimball s iterative data warehouse development approach drew on decades of experience to develop the _________.

Business Definition Lifecycle

Data Warehouse Dimension

OLAP Dimension

Business DimBusiness Dimensional Lifecycleensional Lifecycle

13 / 50

The automated, prospective analyses offered by data mining move beyond the analyses of past events provided by _____________ tools typical of decision support systems.

Intuitive

Retrospective

Introspective

Reminiscent

14 / 50

For good decision making, data should be integrated across the organization to cross the LoB (Line of Business). This is to give the total view of organization from:

Owner‟s Perspective

Customer’s Perspective

Employee's Perspective

Decision Maker‟s Perspective

15 / 50

NUMA stands for __________

Non-uniform Memory Access

New Universal Memory Architecture

Non-updateable Memory Architecture

16 / 50

In a traditional MIS system, there is an almost linear sequence of queries.

FALSE

TRUE

17 / 50

Change Data Capture is one of the challenging technical issues in _____________

Data Extraction

Data Cleansing

Data Transformation

Data Loading

18 / 50

The purpose of the House of Quality technique is to reduce ______ types of risk.

Three

Four

All

Two

19 / 50

The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The smaller the portion of the program that must be executed __________, the greater the scalability of the computation.

Distributed

In Parallel

Sequentially

None of these

20 / 50

For a relation to be in 4NF it must be:-

In 3NF and every non-key column is non-transitively dependent upon its Foreign key.

In 3NF and every non-key column is non-transitively dependent upon its primary key.

In 3NF and It does not have multi valued dependencies

In 2NF if and only if it is in first normal form and all nonkey attributes are fully functionally dependent on the key.

21 / 50

The automated, prospective analyses offered by data mining move beyond the analysis of past events provided by respective tools typical of ___________.

OLAP

None of these

Decision Support systems

OLTP

22 / 50

If every key in the data file is represented in the index file then index is :

Inverted Index

None of these

Dense Index

Sparse Index

23 / 50

Pipeline parallelism focuses on increasing throughput of task execution, NOT on __________ sub-task execution time.

None of these

Decreasing

Maintaining

Increasing

24 / 50

: An optimized structure which is built primarily for retrieval, with update being only a secondary consideration is

Inverted Index

OLAP

DSS

OLTP

25 / 50

We must try to find the one access tool that will handle all the needs of their users.

FALSE

TRUE

26 / 50

For a DWH project, the key requirement are ________ and product experience.

Industry

Software

Tools

None of these

27 / 50

To identify the __________________ required we need to perform data profiling

Complexity

Degree of Transformation

Time

Cost

28 / 50

Multi-dimensional databases (MDDs) typically use ___________ formats to store pre-summarized cube structures.

Object oriented

SQL

proprietary file

Non- proprietary file

29 / 50

The degree of similarity between two records, often measured by a numerical value between _______, usually depends on application characteristics.

0 and 100

0 and 1

0 and 99

0 and 10

30 / 50

People that design and build the data warehouse must be capable of working across the organization at all levels

FALSE

TRUE

31 / 50

For a DWH project, the key requirement are ________ and product experience.

Industry

None of these

Software

Tools

32 / 50

Investing years in architecture and forgetting the primary purpose of solving business problems, results in inefficient application. This is the example of _________ mistake.

Extreme Technology Design

None of these

Extreme Architecture Design

33 / 50

Grain is the ________ level of data stored in the warehouse.

Atomic

Cube

Aggregated

Summarized

34 / 50

________ gives total view of an organization

OLAP

Database

OLAP

Data Warehouse

35 / 50

The divide & conquer cube partitioning approach helps alleviate the ____________ limitations of MOLAP implementation.

Security

Scalability

Flexibility

Maintainability

36 / 50

Ad-hoc access means to run such queries which are known already.

TRUE

FALSE

37 / 50

The goal of star schema design is to simplify ________

None of these

Logical data model

Physical data model

Conceptual data model

38 / 50

The STAR schema used for data design is a __________ consisting of fact and dimension tables. :

None of the given

Network model

Hierarchical data model

Relational model

39 / 50

Analytical processing uses ____________ , instead of record level access.

multi-level aggregates

None of the Given

Single-level hierarchy

Single-level aggregates

40 / 50

To judge effectiveness we perform data profiling twice.

One before Transformation and the other after Transformation

One before Extraction and the other after Extraction

One before Loading and the other after Loading MIDTERM EXAMINATION Spring 2008 CS614- Data Warehousing

41 / 50

DSS queries do not involve a primary key

FALSE

TRUE

42 / 50

The purpose of the House of Quality technique is to reduce ______ types of risk.

Three

Two

All

Four

43 / 50

The goal of ______is to look at as few block as possible to find the matching records.

Indexing

Partitioning

Joining

none of these

44 / 50

Non uniform distribution, when the data is distributed across the processors, is called ______.

Uncontrolled Distribution

Distributed Distribution

Pipeline Distribution

Skew in Partition

45 / 50

Execution can be completed successfully or it may be stopped due to some error. In case of successful completion of execution all the transactions will be ___________

Committed to the database

Rolled back

46 / 50

To measure or quantify the similarity or dissimilarity, different techniques are available. Which of the following option represent the name of available techniques?

Both Pearson correlation and Euclidean distance

Euclidean distance is the only technique

None of these

Pearson correlation is the only technique

47 / 50

Data mining derives its name from the similarities between searching for valuable business information in a large database, for example, finding linked products in gigabytes of store scanner data, and mining a mountain for a _________ of valuable ore.

Vein

Streak

Furrow

Trough

48 / 50

If w is the window size and n is the size of data set, then the complexity of merging phase in BSN method is___________

O (w n)

O (n)

O (w log n)

O (w)

49 / 50

Data mining is a/an ______ approach , where browsing through data using mining techniques may reveal something that might be of interest to the user as information that was unknown previously.

Exploratory

Non-Exploratory

none of these

Compute Science

50 / 50

Suppose the amount of data recorded in an organization is doubled every year. This increase is __________ .

Exponential

Linear

Quadratic

logarithmic

Your score is

The average score is 0%