All IT Courses 50% Off
QA Tutorials

Data Warehouse Testing

The word data warehouse was first coined by Bill Inmon in 1990. Data warehouse is a subject oriented, integrated and time variant and non-volatile collection of data. A data warehouse provides generalised and consolidated data in a multidimensional view. A data warehouse also provides online analytical processing tools. There are tools that help in interactive and effective analysis of data in a multidimensional space.

Generally data warehouse is a database which is separated from organization’s operational database. Frequent updating is not done in the data warehouse. It contains the consolidated historical data which helps the organisation to analyse its business. The data ware house systems helps to integrate the diversity of system applications.

Features of Data warehouse:

  1. Subject Oriented-  It is subject oriented because it provides information  about the subject rather than the organisation’s ongoing operations. The subjects can be product, customer, supplier, sales and revenue.
  1. Integrated- A data warehouse is constructed by integrating data from many different sources like relational databases, flat files etc.
  1. Time variant- The data gathered in a data ware house is identified with a particular time period.
  1. Non-volatile: Non-volatile means the previous data is not erased when new data is added to it. A data warehouse is kept separate from the operational database and frequent changes in operational  database is not reflected in the data warehouse.

Example of a data ware house:

Consider an e-commerce website, that has a warehouse where all the data of the project,the demand for the products and procure it from the suppliers and store it in the warehouse suppose the supplier places the order the goods is immediately dispatched. Now consider if there is no data warehouse in the system or website. If any customer searches for the product details the e-commerce website will directly contact the supplier then automatically the strain is caused to the manufacturer to supply those products. On the other hand there would be delay in the delivery of the products to the customers. This would reduce in demand and also affect the business.

Data Warehouse Testing

The primary reason to have a data warehouse for any company is to get an edge over its competitors.

The Data Warehouse Architecture

Data Warehousing Architecture

The Data warehouse is a three tier architecture:

  1. Bottom tier- the bottom tier of the data warehouse consists of Database server. It is the relational database system. Back end tools and some utilities are used to feed the data. The back end tools and utilities do the functions like extract, clean, load and refresh.
  2. Middle-tier- Here we have OLAP server which is implemented in a ways like:

By relational OLAP which is called as an extended relational database management system which maps the operations on multi dimentional data to standard relational operations.

By multi dimentional OLAP model which implements directly the multi dimentional data and operations.

  1. Top tier- This tier consists of  front-end client layer which hold the query tools and reporting tools,analysis tools  and data mining tools.

From the data warehouse architecture  we have models like:

  1. Virtual Warehouse- view over an operational data warehouse is called virtual data warehouse.
  2. Data mart- it contains the  subset of organisation wide data.
  3. Enterprise Warehouse- It collects all the information and the subjects spanning an entire organisation.
Facebook Comments

3 Comments

  1. Data warehouse is a subject oriented, integrated and time variant and non-volatile collection of data.
    A data warehouse provides generalized and consolidated data in a multidimensional view.
    A data warehouse also provides online analytical processing tools. There are tools that help in interactive and effective analysis of
    data in a multidimensional space.
    The data ware house systems helps to integrate the diversity of system applications.
    Data warehouse is a three tier architecture
    1. Top tier – This tier consists of front-end client layer which hold the query tools and reporting tools, analysis tools and data mining tools.
    2. Middle tier – OLAP
    3. bottom tier – data warehouse consists of Database server. It is the relational database system. Back end tools and some utilities are used to feed the data. The back end tools and utilities do the functions like extract, clean, load and refresh.

  2. A data warehouse provides generalised and consolidated data in a multidimensional view. A data warehouse also provides online analytical processing tools. There are tools that help in interactive and effective analysis of data in a multidimensional space.
    Information stored in Data warehouse is;
    1.Subject oriented
    2.Integrated
    3.Time varient
    4.Non volatile

    The Data warehouse has three layered architecture
    1. Bottom layer : which is actually server. The backend tools and utilities do the function.
    2. Middle Layer: Has OLAP server which maps the operations on multimentional data to standard relational operation.
    3. Top Layer : Is the front end client layer which holds the tools for rquery, report, analysis and mining.

  3. Data Warehouse Testing
    A data warehouse provides generalised and consolidated data in a multidimensional view, and also provides online analytical processing tools. There are tools that help in interactive and effective analysis of data in a multidimensional space. Generally data warehouse is a database which is separated from organization’s operational database.
    – Frequent updating is not done
    – Contains the consolidated historical data which helps the organisation to analyse its business.
    – Helps to integrate the diversity of system applications.

    4 Features of Data warehouse:
    1. Subject Oriented – Provides information about the subject rather than the organisation’s ongoing operations.
    2. Integrated – Constructed by integrating data from many different sources like relational databases, flat files etc.
    3. Time variant – The data gathered is identified with a particular time period.
    4. Non-volatile -The previous data is not erased when new data is added to it. A data warehouse is kept separate from the operational database

    The Data warehouse is a three tier architecture:
    1. Bottom tier – Consists of “Database Server” in which the back end tools and utilities are being used to do the functions
    2. Middle-tier – OLAP server which maps the operations on multi dimentional data to standard relational operations
    3. Top tier – Consists of front-end client layer which hold the query tools and reporting tools, analysis tools and data mining tools.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Related Articles

Back to top button
Close
Close