I'm a data engineer and Microsoft Data Platform MVP. I specialise in building and automating analytics platforms using Microsoft technologies from the SQL Server stack to the Azure cloud. I'm a fervent advocate of DataOps, with a technical focus on bringing automation to both analytics development and operations. An active member of the data community and keen knowledge-sharer, I'm a volunteer, organiser, speaker, blogger, open-source contributor and author – the second edition of my book Azure Data Factory by Example will be published by Apress in April 2024. I have a PhD in Computer Science and participate enthusiastically in choral singing, craft ale and cocktails, most successfully in that order.

adf-by-example-2nd-ed.jpg

The new edition of Azure Data Factory by Example will be published on April 7th 2024. If you're in a terrible hurry, the first edition is still available 😊.

Hit the ground running in Azure Data Factory v2 (ADF) with my book Azure Data Factory by Example. The tutorial-first approach gets you working from the first chapter, explaining key ideas naturally as you encounter them. From creating your first data factory to building complex, metadata-driven nested pipelines, the book guides you through essential concepts in Microsoft's cloud-based ETL/ELT platform. It introduces components indispensable for the movement and transformation of data in the cloud. Then it demonstrates the tools necessary to orchestrate, monitor, and manage those components.

The hands-on introduction to ADF found in this book is equally well-suited to data engineers embracing their first ETL/ELT toolset as it is to seasoned veterans of Microsoft's SQL Server Integration Services (SSIS). The example-driven approach leads you through ADF pipeline construction from the ground up, introducing important ideas and making learning natural and engaging. SSIS users will find concepts with familiar parallels, while ADF-first readers will quickly master those concepts through the book's steady building up of knowledge in successive chapters. Summaries of key concepts at the end of each chapter provide a ready reference that you can return to again and again.

Power BI Azure Pipelines A multi-part series on creating a pro dev experience for Power BI, aimed at bringing DevOps or DataOps approaches to Power BI report development. In the series I'll be assembling an approach to creating developer workflows that provide meaningful version control and automate deployment to multiple environments. I'll be writing a new post every couple of weeks and will link them here when they're done:

  1. Deploy TMDL datasets to Power BI
  2. Better version control for Power BI reports
  3. Automated testing for Power BI datasets

A six-part series on test automation for Azure Data Factory pipelines, now referenced in Microsoft's documentation for DataOps in ADF. Automated testing is a key component of CI/CD development pipelines: it allows you to run more tests, more often, more quickly, with guaranteed repeatability. In the series I build up an approach to writing tests for ADF and triggering them automatically.

If you're wondering if all this testing is worth the trouble, I have some thoughts on the subject here 😃.

Sprockit is a powerful but lightweight ETL process controller for managing SSIS package and T-SQL stored procedure execution. It's free, open-source and written purely in T-SQL. You can download it here.