Data Migration
Cloud Data Migration from Netezza to Google BigQuery
A major payment processing firm based out of USA wishes to build a cloud based big data strategy that the enterprise as well as external users can leverage to build better analytics and better business decisions. One of the key components and challenge in the build process was migration of on premise data to cloud.
Client Challenges and Requirements
- Migration scope included 23TB of data, 1500 tables and 1200 plus jobs running on Netezza
- Maintaining Netezza batch process performance (ensuring compliance with defined SLA) so as to have a balance between data migration load and regular batch process load
Bitwise Solution
Tools & Technologies We Used
Netezza
Ab Initio
Shell Script
Python
Control M
GCS
Dataflow
BigQuery
Cloud SQL
Airflow
Key Results
Seamless migration of the DW on to GCP
Successful migration of 23TB of historical data ensuring data integrity
Automated Data Validation resulting in 25% effort saving
Auto generation of BQ YAML from Netezza DDL resulted in 85% effort savings associated with DDL creation