Virtual University MCQs BANK - MCQs Collection from Online Quizzes
CS614 Data Warehousing Solved MCQs from Quiz # 2
- Details
- Category: CS614 - Data Warehousing MCQs
- Published on Thursday, 07 June 2012 16:00
- Written by Bonfire
CS614 Data Warehousing Solved MCQs from Quiz # 2
Solved and shared by Gulshan
Many data warehouse project teams waste enormous amounts of time searching in vain for a ___________________.
Silver Bullet
Golden Bullet
Suitable Hardware
Compatible Product
A dense index, if fits into memory, costs only ______ disk I/O access to locate a record by given key.
One
Two
lg (n)
n
All data is ______________ of something real.
I An Abstraction
II A Representation
Which of the following option is true?
I Only
II Only
Both I & II
None of I & II
The key idea behind ___________ is to take a big task and break it into subtasks that can be processed concurrently on a stream of data inputs in multiple, overlapping stages of execution.
Pipeline Parallelism
Overlapped Parallelism
Massive Parallelism
Distributed Parallelism
Non uniform distribution, when the data is distributed across the processors, is called ______.
Skew in Partition
Pipeline Distribution
Distributed Distribution
Uncontrolled Distribution
The goal of ideal parallel execution is to completely parallelize those parts of a computation that are not constrained by data dependencies. The smaller the portion of the program that must be executed __________, the greater the scalability of the computation.
None of these
Sequentially
In Parallel
Distributed
Data mining is a/an __________ approach, where browsing through data using data mining techniques may reveal something that might be of interest to the user as information that was unknown previously.
Exploratory
Non-Exploratory
Computer Science
Data mining evolve as a mechanism to cater the limitations of ________ systems to dealmassive data sets with high dimensionality, new data types, multiple heterogeneous data resources etc.
OLTP
OLAP
DSS
DWH
________ is the technique in which existing heterogeneous segments are reshuffled, relocated into homogeneous segments.
Clustering
Aggregation
Segmentation
Partitioning
To measure or quantify the similarity or dissimilarity, different techniques are available. Which of the following option represent the name of available techniques?
Pearson correlation is the only technique
Euclidean distance is the only technique
Both Pearson correlation and Euclidean distance
None of these
For a DWH project, the key requirement are ________ and product experience.
Tools
Industry
Software
None of these
Pipeline parallelism focuses on increasing throughput of task execution, NOT on __________ sub-task execution time.
Increasing
Decreasing
Maintaining
None of these
Focusing on data warehouse delivery only often end up _________.
Rebuilding
Success
Good Stable Product
None of these
Pakistan is one of the five major ________ countries in the world.
Cotton-growing
Rice-growing
Weapon Producing
_____________ is a process which involves gathering of information about column through execution of certain queries with intention to identify erroneous records.
Data profiling
Data Anomaly Detection
Record Duplicate Detection
None of these
Relational databases allow you to navigate the data in ____________ that is appropriate using the primary, foreign key structure within the data model.
Only One Direction
Any Direction
Two Direction
None of these
DSS queries do not involve a primary key
True
False
__________________ contributes to an under-utilization of valuable and expensive historical data, and inevitably results in a limited capability to provide decision support and analysis.
The lack of data integration and standardization
Missing Data
Data Stored in Heterogeneous Sources
DTS allows us to connect through any data source or destination that is supported by ____________
OLE DB
OLAP
OLTP
Data Warehouse
If some error occurs, execution will be terminated abnormally and all transactions will be rolled back. In this case when we will access the database we will find it in the state that was before the ____________.
Execution of package
Creation of package
Connection of package
The need to synchronize data upon update is called
Data Manipulation
Data Replication
Data Coherency
Data Imitation
Taken jointly, the extract programs or naturally evolving systems formed a spider web, also known as
Distributed Systems Architecture
Legacy Systems Architecture
Online Systems Architecture
Intranet Systems Architecture
It is observed that every year the amount of data recorded in an organization is
Doubles
Triples
Quartiles
Remains same as previous year
Pre-computed _______ can solve performance problems
Aggregates
Facts
Dimensions
The degree of similarity between two records, often measured by a numerical value between _______, usually depends on application characteristics.
0 and 1
0 and 10
0 and 100
0 and 99
The purpose of the House of Quality technique is to reduce ______ types of risk.
Two
Three
Four
All
NUMA stands for __________
Non-uniform Memory Access
Non-updateable Memory Architecture
New Universal Memory Architecture
There are many variants of the traditional nested-loop join. If the index is built as part of the query plan and subsequently dropped, it is called
Naive nested-loop join
Index nested-loop join
Temporary index nested-loop join
None of these
The Kimball s iterative data warehouse development approach drew on decades of experience to develop the _____________.
Business Dimensional Lifecycle
Data Warehouse Dimension
Business Definition Lifecycle
OLAP Dimension
During the application specification activity, we also must give consideration to the organization of the applications.
True
False
The most recent attack is the ________ attack on the cotton crop during 2003- 04, resulting in a loss of nearly 0.5 million bales.
Boll Worm
Purple Worm
Blue Worm
Cotton Worm
The users of data warehouse are knowledge workers in other words they are_________ in the organization.
Decision maker
Manager
Database Administrator
DWH Analyst
_________ breaks a table into multiple tables based upon common column values.
Horizontal splitting
Vertical splitting
_____modeling technique is more appropriate for data warehouses.
entity-relationship
dimensional
physical
None of the given
Multi-dimensional databases (MDDs) typically use ___________ formats to store pre-summarized cube structures.
SQL
proprietary file
Object oriented
Non- proprietary file
Data warehousing and on-line analytical processing (OLAP) are _______ elements of decision support system.
Unusual
Essential
Optional
None of the given
Analytical processing uses ____________ , instead of record level access.
multi-level aggregates
Single-level aggregates
Single-level hierarchy
None of the Given
The divide&conquer cube partitioning approach helps alleviate the ____________ limitations of MOLAP implementation.
Flexibility
Maintainability
Security
Scalability
Data Warehouse provides the best support for analysis while OLAP carries out the _________ task.
Mandatory
Whole
Analysis
Prediction
Data Warehouse provides the best support for analysis while OLAP carries out the _________ task.
Mandatory
Whole
Analysis
Prediction
Virtual cube is used to query two similar cubes by creating a third “virtual” cube by a join between two cubes.
True
False
Data warehousing and on-line analytical processing (OLAP) are _______ elements of decision support system.
Select correct option:
Unusual
Essential
Optional
None of the given
