Database Management Systems - Prof. Holowczak

Zicklin School of Business - Baruch College
City University of New York

Database Management Systems II


Data Warehousing

What You'll Learn This Week

Elmasri/Navathe (3rd ed.) Connolly and Begg (3rd ed.) McFadden (5th ed.)
Chapter 26 Chapter 30, 31 Chapter 14

Data Warehousing

Components of a Data Warehousing System


Data Warehouse Design

Star Schema


Store Dimension

StoreDimID Store District State Region Country
SD101 101 North New Jersey NJ NorthEast US
SD102 102 North New Jersey NJ NorthEast US
SD103 103 South New Jersey NJ NorthEast US
SD104 104 South New Jersey NJ NorthEast US
SD501 NULL North New Jersey NJ NorthEast US
SD502 NULL South New Jersey NJ NorthEast US
SD551 NULL NULL NJ NorthEast US
SD552 NULL NULL PA NorthEast US
SD553 NULL NULL NY NorthEast US
SD562 NULL NULL FL SouthEast US
SD901 NULL NULL NULL NorthEast US
SD902 NULL NULL NULL SouthEast US
SD951 NULL NULL NULL NULL US

TimeDimension
TimeDimID Day Week Month Quarter Year
TD10001 NULL NULL NULL NULL 1998
TD10002 NULL NULL NULL 1st 1998
TD10003 NULL NULL NULL 2nd 1998
TD10004 NULL NULL NULL 3rd 1998
TD10005 NULL NULL NULL 4th 1998
TD10006 NULL NULL Jan. 1st 1998
TD10007 NULL NULL Feb. 1st 1998
TD10008 NULL NULL Mar. 1st 1998
TD10009 NULL NULL Apr. 1st 1998
TD10010 NULL NULL May. 2nd 1998
...
TD10101 NULL 1/4 Jan. 1st 1998
TD10102 NULL 1/11 Jan. 1st 1998
TD10103 NULL 1/18 Jan. 1st 1998
TD10104 NULL 1/25 Jan. 1st 1998
...
TD12101 1/4 1/4 Jan. 1st 1998
TD12102 1/5 1/4 Jan. 1st 1998
TD12103 1/6 1/4 Jan. 1st 1998
TD12104 1/7 1/4 Jan. 1st 1998
TD12105 1/8 1/4 Jan. 1st 1998
TD12106 1/9 1/4 Jan. 1st 1998
TD12107 1/10 1/4 Jan. 1st 1998
TD12108 1/11 1/11 Jan. 1st 1998
TD12109 1/12 1/11 Jan. 1st 1998
...

Product Dimension
ProdDimId SKU Package Brand SubCategoryCategory
PD100001 NULL NULL NULL NULL Canned Food
PD100002 NULL NULL NULL Soups Canned Food
PD100003 NULL NULL NULL Vegetables Canned Food
PD100101 NULL NULL Campbells Soups Canned Food
PD100102 NULL NULL ShopRite Soups Canned Food
PD100104 NULL NULL ShopRite Vegetables Canned Food
PD100301 NULL 12 oz. Campbells Soups Canned Food
PD100901 99998 12 oz. Campbells Soups Canned Food
PD100902 99997 12 oz. Campbells Soups Canned Food
PD100903 99996 12 oz. Campbells Soups Canned Food

Fact Table
TimeDimID ProdDimID StoreDimID DollarsSold UnitsSold DollarsCost
TD10001 PD100001 SD951 $4,300,000 8,000,000 $2,110,000
TD10001 PD100803 SD101 $ 90,000 170,000 $45,000
TD10103 PD100104 SD901 etc.

Snowflake Schema

Constellation Schema

Some Issues in DW Systems

Data Marts

Data Mart - Same DW technologies applied at the divisional/departmental level.

Two approaches to Data Marts:

  1. Departmental operational systems summarized into departmental data mart. Then abstract data from each data mart and load into Data warehouse.
    • Advantages: Data marts can be built more quickly in this fashion due to local autonomy. Also, not as data/resource intensive as the full data warehouse.
    • Disadvantages: Data marts may not be compatible with one another. This might cause even more integration problems at the warehouse level. Organization-wide metadata standards can help in these cases.
  2. Departmental operational systems summarized into organizational data warehouse. Subsets of data are copied from the data warehouse into departmental data marts.
    • Advantages: Compatible data models for data marts are easy to define - just choose dimensions and facts from the DW that deal with the department.
    • Disadvantages: The organizational DW must be constructed first.

Further Reading


File: dw_index.html Date: Mon Dec 2 10:18:41 EST 2002
All materials Copyright, 1997-2002 Richard Holowczak