Skip to Main Content
Frequently Asked Questions
Submit an ETD
Global Search Box
Need Help?
Keyword Search
Participating Institutions
Advanced Search
School Logo
Files
File List
toledo1104773641.pdf (738.46 KB)
ETD Abstract Container
Abstract Header
Data Warehouse Operational Design: View Selection and Performance Simulation
Author Info
AGRAWAL, VIKAS R.
Permalink:
http://rave.ohiolink.edu/etdc/view?acc_num=toledo1104773641
Abstract Details
Year and Degree
2005, Doctor of Manufacturing Management, University of Toledo, Manufacturing Management.
Abstract
Decision support systems are a key to gaining competitive advantage. Many corporations have built or are building unified decision-support databases called data warehouses on which decision makers can carry out their analysis. A data warehouse is a very large data base that integrates information extracted from multiple, independent, heterogeneous data sources to support business analysis activities and decision-making tasks. The data that is likely to be in demand is generally pre-computed and stored ahead of time at the data warehouse in the form of materialized views. This dramatically reduces execution time of decision support queries from hours or days to minutes or even seconds. There are many architectural issues concerning the efficient design of a data warehouse. This dissertation studies in depth three important issues. The first issue addressed is the Materialized View Selection (MVS) problem, which is the problem of choosing an optimal set of views to materialize under resource constraints. We have formulated interesting bottleneck versions of this problem and presented the 0-1 Integer Programming models as well as the heuristic procedures. Performance analysis of the heuristic procedures is also presented. Formulation of the MVS problem requires knowledge of the number of rows in each view in a given lattice structure, which refers to views and their interrelationships for a given set of dimensions. Counting actual number of rows present in each view takes considerable time. The second issue addressed in this dissertation focuses on the statistical sampling techniques applied to data warehouses to estimate number of rows in each view in a given lattice structure. We have shown that the application of sampling techniques results in significant time savings without compromising on accuracy. The third issue deals with modeling the behavior and performance of a data warehouse system using simulation. We implemented the model in ARENA. The model enables a data warehouse manager to walk through various scenarios to investigate the synergy among various system components and to identify areas of inefficiencies in the system. This could also help improve overall performance of the data warehouse system.
Committee
Mesbah Ahmed (Advisor)
Pages
129 p.
Subject Headings
Business Administration, Management
Keywords
Data Cube
;
Data Warehouse
;
Simulation of Data Warehouse
;
Heuristics
;
View Materialization
;
View Selection
;
Sampling in Data Warehouse
Recommended Citations
Refworks
EndNote
RIS
Mendeley
Citations
AGRAWAL, V. R. (2005).
Data Warehouse Operational Design: View Selection and Performance Simulation
[Doctoral dissertation, University of Toledo]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=toledo1104773641
APA Style (7th edition)
AGRAWAL, VIKAS.
Data Warehouse Operational Design: View Selection and Performance Simulation.
2005. University of Toledo, Doctoral dissertation.
OhioLINK Electronic Theses and Dissertations Center
, http://rave.ohiolink.edu/etdc/view?acc_num=toledo1104773641.
MLA Style (8th edition)
AGRAWAL, VIKAS. "Data Warehouse Operational Design: View Selection and Performance Simulation." Doctoral dissertation, University of Toledo, 2005. http://rave.ohiolink.edu/etdc/view?acc_num=toledo1104773641
Chicago Manual of Style (17th edition)
Abstract Footer
Document number:
toledo1104773641
Download Count:
4,063
Copyright Info
© 2005, all rights reserved.
This open access ETD is published by University of Toledo and OhioLINK.