Data Warehousing

How it works

A data warehouse is useful for automating reporting workflows, and essential for running sophisticated analytics on larger datasets

Your data warehouse should contain a copy of all key data in your various business systems, such as CRM (e.g. Salesforce), ERP (e.g. Netsuite), and WFM (e.g. Kronos). Data is replicated daily, or sometimes twice daily. Your business intelligence tools will point to the data warehouse as their data source to run analytics and generate reports

We recommend having a data warehouse in place before scaling up reporting and dashboarding

Engagement process

The deployment process typically follows six steps and can range from 4 weeks to 3+ months depending on the quantity and complexity of your systems

  1. We run discovery sessions with the business users to identify the key use cases, goals, and overall requirements
  2. Concurrent with step 1, we align with the IT team on the technical architecture, technology stack, naming conventions, and data governance
  3. Working with both the business and IT, we define the list of data sources and objects to replicate, as well as the ETL method for each object
  4. Our data engineering team spins up the data infrastructure, then develops and tests the replication pipelines
  5. While most tables will be in the Silver layer (meaning they will exactly match the sources), we also create a set of Gold layer denormalized tables for the frequently-used business concepts like sales
  6. We provide documentation and training to both the business users and the IT team  
Technology stack

We use the Microsoft Azure data stack, as mid-market companies are usually already in the Microsoft ecosystem, making for a seamless user experience. The specific products within the stack will be recommended after we understand your needs. We find that a simple setup with Azure SQL Server and Azure Data Factory is sufficient, cost-effective, and simple to manage for most SMEs, but if complex data transformations are required on extremely large volumes of data, we utilize Databricks and Python

Ongoing support

Replication pipelines may occasionally break due to issues in the source systems, such as expired credentials, connectivity issues, or schema changes. We will be available to fix any issue that comes up after project completion, either on a time-and-material basis when an issue is spotted, or with an on-going monitoring and maintenance service

Benefits
Why have a data warehouse?
Works with any analytics tool

Unlike business systems built for daily operations and limited to vendor-specific reporting tools, data warehouses are designed for analytics and can connect to any tool for reporting, analytics, and data science needs

Enables powerful insights

A data warehouse allows you to run algorithms on your data, combine data from different business systems, snapshot data that gets overwritten (e.g., current CRM opportunities), and more

Aggregates subsidiaries' data

Many SMEs have subsidiaries running on different ERPs, which are costly to change. A data warehouse offers a cost-effective way to combine data from these subsidiaries for reporting and analytics, while maintaining separate ERPs for operations

Pricing

Pricing varies significantly depending on the number of source systems, quantity of tables to replicate, volume of data, and complexity of data transformations required. Below is a range of typical pricing

Initial deployment
$15,000-$80,000

Discovery, implementation, documentation, training

Optional monthly maintenance
$500-$2,000

Monitoring, troubleshooting, minor data additions

Get in touch
other services

If you already have a data warehouse, here are our additional services that can help you make the most out of your data

We’d love
to work with you

Get in touch