fabric-migration

This repository contains a collection of scripts and utilities designed to assist with the migration to Fabric, covering various workloads.

🚀 Quick Start: Azure Dedicated Pool to Fabric Warehouse

New! Complete migration guide with Python/Bash scripts (no PowerShell required):

📘 Migration Guide - Comprehensive step-by-step guide
📚 ETL Library Documentation - Complete API reference for Migration Scripts ETL Library
🔄 Data Type Mapping - Handle datatype differences between platforms
🔐 Permissions Guide - Setup all required permissions
⚡ Quick Start - Get started in 15 minutes

Migration Scripts (Python + Bash)

All migration scripts are located in the /scripts directory:

# 1. Setup environment
cd scripts
./setup_environment.sh

# 2. Run pre-migration checks
./pre_migration_checks.sh

# 3. Extract data from Azure Dedicated Pool
python3 extract_data.py \
    --server mysynapse.sql.azuresynapse.net \
    --database mydatabase \
    --storage-account mystorageaccount \
    --container migration-staging \
    --parallel-jobs 6

# 4. Load data to Fabric Warehouse
python3 load_data.py \
    --workspace myworkspace \
    --warehouse mywarehouse \
    --storage-account mystorageaccount \
    --container migration-staging \
    --parallel-jobs 8 \
    --validate-rows

# 5. Validate migration
python3 validate_migration.py \
    --source-server mysynapse.sql.azuresynapse.net \
    --source-database mydatabase \
    --target-workspace myworkspace \
    --target-warehouse mywarehouse \
    --generate-report

See scripts/README.md for detailed documentation.

Migration Notebooks (PySpark)

New! Interactive PySpark notebooks for running migration steps in Fabric:

All migration notebooks are located in the /notebooks directory:

01_extract_data.ipynb - Extract data from Azure Synapse to ADLS
02_load_data.ipynb - Load data from ADLS to Fabric Warehouse
03_validate_migration.ipynb - Validate migration completeness
Helper Functions - Shared utilities for connections and operations

See notebooks/README.md for detailed documentation on running notebooks in Fabric.

Migration Scripts ETL Library

New! Comprehensive documentation for the Python ETL library powering the migration scripts:

📚 ETL Library Documentation - Complete API reference covering:

DataExtractor - Extract data from Synapse to ADLS Gen2 using CETAS
DataLoader - Load data from ADLS Gen2 to Fabric Warehouse using COPY INTO
MigrationValidator - Validate row counts and data integrity
ConnectionHelper - Database connection utilities for PySpark notebooks
MigrationUtils - Common migration operations and helpers
StorageHelper - Azure Data Lake Storage operations

The documentation includes:

Architecture and design patterns
Complete API reference for all classes and methods
Usage examples (CLI, Python scripts, PySpark notebooks)
Best practices for performance, security, and monitoring
Comprehensive troubleshooting guide

From Azure Synapse to Fabric

Data Warehouse

[NEW] Comprehensive Migration Guide - Complete guide with scripts
[NEW] PySpark Notebooks - Interactive notebooks for Fabric
[NEW] Data Type Mapping Guide - Datatype compatibility reference
[NEW] Permissions Guide - Security and access setup
Official Microsoft documentation
Existing PowerShell scripts and utils

Data Engineering (Spark)

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.vscode		.vscode
adf_fabric_migrator		adf_fabric_migrator
data-engineering		data-engineering
data-warehouse		data-warehouse
notebooks		notebooks
scripts		scripts
.gitignore		.gitignore
BEST_PRACTICES.md		BEST_PRACTICES.md
CODE_EXAMPLES_BEFORE_AFTER.md		CODE_EXAMPLES_BEFORE_AFTER.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
DATATYPE_MAPPING.md		DATATYPE_MAPPING.md
DELIVERY_SUMMARY.md		DELIVERY_SUMMARY.md
IMPLEMENTATION_GUIDE.md		IMPLEMENTATION_GUIDE.md
INDEX.md		INDEX.md
LICENSE		LICENSE
MIGRATION_ANALYSIS_AND_OPTIMIZATION.md		MIGRATION_ANALYSIS_AND_OPTIMIZATION.md
MIGRATION_GUIDE.md		MIGRATION_GUIDE.md
MIGRATION_PACKAGE_SUMMARY.md		MIGRATION_PACKAGE_SUMMARY.md
MSCLAKE_ETL_DOCUMENTATION.md		MSCLAKE_ETL_DOCUMENTATION.md
PERMISSIONS_GUIDE.md		PERMISSIONS_GUIDE.md
QUICK_START.md		QUICK_START.md
README.md		README.md
README_OPTIMIZATION_SUMMARY.md		README_OPTIMIZATION_SUMMARY.md
SECURITY.md		SECURITY.md
START_HERE.md		START_HERE.md
SUPPORT.md		SUPPORT.md
VISUAL_OVERVIEW.md		VISUAL_OVERVIEW.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fabric-migration

🚀 Quick Start: Azure Dedicated Pool to Fabric Warehouse

Migration Scripts (Python + Bash)

Migration Notebooks (PySpark)

Migration Scripts ETL Library

From Azure Synapse to Fabric

Data Warehouse

Data Engineering (Spark)

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

fabric-migration

🚀 Quick Start: Azure Dedicated Pool to Fabric Warehouse

Migration Scripts (Python + Bash)

Migration Notebooks (PySpark)

Migration Scripts ETL Library

From Azure Synapse to Fabric

Data Warehouse

Data Engineering (Spark)

Contributing

About

Resources

License

Code of conduct

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages