Note that there are some explanatory texts on larger screens.

plurals
  1. POStructure within staging area of data warehouse
    text
    copied!<p>We are working on a datawarehouse for a bank and have pretty much followed the standard Kimball model of staging tables, a star schema and an ETL to pull the data through the process.</p> <blockquote> <p>Kimball talks about using the staging area for import, cleaning, processing and everything until you are ready to put the data into the star schema. In practice this typically means uploading data from the sources into a set of tables with little or no modification, followed by taking data optionally through intermediate tables until it is ready to go into the star schema. That's a lot of work for a single entity, no single responsibility here.</p> </blockquote> <p>Previous systems I have worked on have made a distinction between the different sets of tables, to the extent of having:</p> <ul> <li><strong>Upload tables</strong>: raw source system data, unmodified</li> <li><strong>Staging tables</strong>: intermediate processing, typed and cleansed</li> <li><strong>Warehouse tables</strong></li> </ul> <p>You can stick these in separate schemas and then apply differing policies for archive/backup/security etc. One of the other guys has worked on a warehouse where there is a <em>StagingInput</em> and a <em>StagingOutput</em>, similar story. The team as a whole has a lot of experience, both datawarehouse and otherwise.</p> <p>However, despite all this, looking through Kimball and the web there seems to be absolutely nothing in writing about giving any kind of structure to the staging database. One would be forgiven for believing that Mr Kimball would have us all work with staging as this big deep dark unstructured pool of data.</p> <p>Whilst of course it is pretty obvious how to go about it if we want to add some more structure to the staging area, it seems very odd that there seems to be nothing written about it.</p> <p>So, what is everyone else out there doing? Is staging just this big unstructured mess or do folk have some interesting designs on it?</p>
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload