Category Archives: Fabric

Microsoft Fabric is an advanced data platform designed for efficient management, integration, and analysis of enterprise data. It enables the centralization of data from various sources, ensures data quality, and makes it accessible for both analytical and operational purposes.

Fabric provides multiple data components within a single solution – including a data lakehouse, data warehouse, ETL/ELT processing tools, metadata management, data cataloging, access control, and support for building data models and analytical workspace environments. Functions included:

  • Data pipelines and dataflows for ELT orchestrations
  • Notebooks (PySpark, SparkR, SparkSQL, Python)
  • Data lakehouses
  • Dara warehouses
  • SQL Databases – something like Azure SQL
  • Native support for all Azure resources within the same tenant using managed identity (Storage, ADF, Container apps,..)
  • and more

Latest posts from the category:

Microsoft Fabric – Intro, Core Concepts – Why It’s so OP?

This article serves as an introductory overview of MS Fabric and as a resource for a basic evaluation of this tool. It also functions as a hub for additional articles, where individual concepts and topics are covered in more detail. The target audience consists of individuals considering trying, learning or implementing Fabric in their organization.… Read More »

Fabric | Getting Started with Data Factory, Pipelines, and Connectors

This article is intended primarily for managers, IT specialists, and technical decision-makers who are evaluating Microsoft Fabric and are considering its implementation. It is also intended for the broader professional audience that is exploring Fabric, as well as for educational purposes. The text focuses on the principles of Fabric Data Factory, key technical concepts, and… Read More »

Bulk Table Automated Ingestion in Microsoft Fabric Data Factory Using a Single Pipeline and JSON Configuration

This article is a technical guide on how to implement a bulk table import using a single pipeline in Fabric Data Factory (or Azure Data Factory) by using a ForEach container and an external JSON configuration file. This approach enables easy addition, removal, and modification of tables without changing the pipeline itself. The JSON file… Read More »

Fabric | dbt – How I Build Gold Layer Dimensional Tables (SCD2) in Data Projects

In the previous article Slowly Changing Dimension (SCD 2) – Snapshots I demonstrated how to implement historical tracking of dimensional data in the Silver layer within Fabric using dbt snapshots. These snapshots are already prepared and provide a complete history of dimensional changes (using the valid_from and valid_to attributes). In the Gold layer, however, we… Read More »

Fabric – Bronze: Data ingestion into Delta tables using pipeline (notebook)

In several previous articles, I discussed how to configure ADLS Gen2 for storing source .parquet data and also how to connect ADLS Gen2 directly to the Fabric Lakehouse using a shortcut. To recap – in the Fabric environment, we have a prepared Bronze Lakehouse, and within this lakehouse, we have a connection to the parquet… Read More »

Fabric | dbt – Shortcuts and how to connect ADLS Gen2 with Fabric Lakehouse

In the previous article – Fabric – ADLS Gen2 and Parquet – Storage Configuration and Bronze Data Format – I described how Azure ADLS Gen2 is set up for storing and archiving our source parquet files. In order to upload our source data into the Fabric Bronze layer and subsequently process it using dbt, we… Read More »

Fabric – ADLS Gen2 and Parquet – Configuring Storage and the Bronze Data Format

In a data architecture based on the Medallion Architecture approach, the Bronze layer represents the first stage of data processing – this is where raw, minimally transformed data from various source systems arrive. In the Fabric article series, we implement a data solution where the Bronze layer is implemented as a Lakehouse in Fabric in… Read More »

Fabric – Workspace and Task Flow – Organizing Processes, Artifacts, and Domains

As part of the series of articles on the Microsoft Fabric platform, we focus on various features and artifacts of this data workspace environment, covering data acquisition (ingest), transformation, analytics, and data product management for efficient and secure development, orchestration, and operation of data solutions within a single integrated ecosystem. The primary artifacts that we… Read More »

Fabric | dbt – Slowly Changing Dimension (SCD 2) – Snapshots and Check Strategy in dbt

Slowly Changing Dimensions (SCD) represent a way to store and manage historical changes of dimensions over time in a data warehouse. To just recap the theory of what facts and dimensions are, I recommend to check the article – Facts and Dimensions – Tables in a Data Warehouse before continuing. In the context of a… Read More »

Fabric – Pipelines and SecureString Exposure Risk – Key Vault for Secure Secret Handling

In Fabric, Pipelines are commonly used to automate data workflows. These pipelines often need to pass authentication credentials, such as API keys or passwords (so-called secrets), to notebooks that execute some code. Although the platform allows “secure” storage of these values within the Pipeline as SecureString parameters, the actual transfer to the Notebook runtime environment… Read More »