Introduction Databricks offers great functionality in the form of table CLONING, which works not only for Delta Tables but also for Parquet Table and Iceberg Tables. In this article we will focuse on Delta Tabels. Please note that in this article, whenever I mention: Source Table: This refers to the original table used as the […]
Latest Posts
Azure Policy – an underrated component of a scalable data platform (part1)
Recently, we’ve been encountering an increasing number of projects where a complete data platform has to be designed and built almost from scratch. In such projects, in addition to the typical duties and tasks in the data area, such as data modeling, designing and implementing pipelines, or the final reporting layer, there is a growing […]
Why should I ‘refresh’ Direct Lake models in Microsoft Fabric?
You’ve probably heard about a new mode for datasets/semantic models called Direct Lake. This mode combines the advantages of existing modes: the performance of Import mode and the lack of need to refresh or cache data from Direct Query. It works by directly accessing files from a one lake without requiring extra steps. This means […]
Notebook orchestration in Microsoft Fabric made easy
When I meet with customers, we discuss how to efficiently organize notebook execution via orchestration mechanism. They often mention managing it manually using tools like Azure Data Factory. On the other hand, they sometimes express that Data Factory is too simplistic and they require other tools to address their needs. Similar questions arise regarding Microsoft […]
Last comments