MSSQL to Snowflake Integration for a Retail Giants: Accelerating Real-Time Analytics and Driving Growth

Client Overview: A Retail Giant

The client is a leading US-based retail chain specializing in a wide range of food supplies and services. Known for their high-quality products, they have built a strong reputation in the retail industry for exceptional customer service, supply chain efficiency, and seamless order fulfillment.

Client Requirement to Integrate Snowflake in Place of MSSQ

Replace the existing SSIS process with a Python-based ETL pipeline for data integration from MSSQL to Snowflake.

Data is transferred from the MSSQL Server to an Amazon S3 bucket in the form of CSV files.

An ETL pipeline, utilizing a Python script, is to be established to replace the current SSIS process and move data from the MSSQL Server to the Amazon S3 bucket.

The data from the S3 bucket needs to be loaded into Snowflake.

How GetOnData Helped Implement Snowflake?

Bullet point

Created reconfiguration tables containing select queries for all necessary fields, source table names, target table names, header names, process names, an S3 key, and a destination folder in AWS.

Bullet point

Developed a Python script that runs daily at a specific time using a cron job on an EC2 instance.

Bullet point

The script retrieves metadata from the configuration tables and connects to the source MSSQL Server to fetch all records.

Bullet point

Delimited data is then written to CSV files.

Bullet point

The CSV files are then transferred to the specified destination folder in the S3 bucket.

Bullet point

Processes are logged in Snowflake tables.

Bullet point

Then the data is moved from the AWS S3 bucket to the Snowflake Data Warehouse using the lambda function.

Bullet point

GitLab is being used as a version control tool for managing and collaborating on code development.

Our Solution

How the Retail Giant Benefited by Integrating Snowflake?

Reusable Python Script

The new design allows the reuse of the generic Python script for future data imports.

Eliminates SSIS Package Creation

The need to create a new SSIS package or SQL Server Agent job for each table import is eliminated.

Reduces SSIS Developer Team

This new design architecture allows customer to reduce the SSIS developer team.

Cost Saving

The result in cost savings on investing in SSIS licenses.

Case Studies

Start your journey towards data-driven excellence.