What is the Kimball Architecture?

On every job interview I go I am asked if I know what Kimbal Architecture is?
Here what Kimball group say their architecture is on their web site:

The Kimball Architecture separates the data and processes comprising the DW/BI system into the backroom extract, transformation and load (ETL) environment and the front room presentation area, as illustrated in the following diagram.
Figure 1: Kimball architecture diagram.

The Kimball Architecture focuses on the following components:

  • Backroom ETL system: The Kimball Group has identified 34 subsystems in the ETL process flow, grouped into four major operations:extracting the data from the sources, performing cleansing and conforming transformations, delivering it to the presentation server, and managing the ETL process and back room environment.
  • Front room presentation area: The Kimball Architecture presumes the data utilized by the BI applications is dimensionally-structured, organized by business process, atomically-grained (complemented by aggregated summaries for performance tuning), and tied together by the enterprise data warehouse bus architecture, as described earlier on this page.
  • Front room BI applications: The front room is the public face of the DW/BI system; it’s what business users see and work with day-to-day. There’s a broad range of BI applications supported by BI management services in the front room, including ad hoc queries, standardized reports, dashboards and scorecards, and more powerful analytic or mining/modeling applications.
  • Metadata: Metadata is all the information that defines and describes the structures, operations, and contents of the DW/BI system.Technical metadata defines the objects and processes which comprise the DW/BI system. Business metadata describes the data warehouse contents in user terms, including what data is available, where did it come from, what does it mean, and how does it relate to other data. Finally, process metadata describes the warehouse’s operational results.

Some organizations adopt an alternative data warehouse architecture that includes a third normal form (3NF) relational data warehouse. This hub-and-spoke architecture, often called the Corporate Information Factory (CIF), includes a data acquisition ETL process to gather, clean and integrate data similar to the backroom ETL system described above. With the CIF, atomic data is loaded into third normal form structures, typically called the enterprise data warehouse (EDW). Another ETL data delivery process then populates downstream reporting and analytic environments supporting the business users; these environments are typically structured dimensionally.
A modification to the Kimball Architecture, sometimes referred to as a hybrid architecture, leverages an existing 3NF data warehouse as the source of clean, integrated data to feed the front room presentation area described above.


Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.