Understanding the True Cost of a Query in Microsoft Fabric

Ryan Shiva

Databricks MVP | Data Engineer at Xorbix Technologies | Databricks Certified Data Engineer & ML Professional

Published May 5, 2025

How to Calculate POC Costs in Microsoft Fabric

When choosing a data platform, cost and performance are crucial factors to consider, typically evaluated through a Proof of Concept (POC). In this blog, we will provide guidance and a tool to help you calculate costs from a POC using Microsoft Fabric and estimate your Total Cost of Ownership (TCO).

Evaluating Cost and Performance

Cost and performance are critical factors to consider when choosing a data platform, which are typically evaluated through a POC. In most cloud DWs you can calculate Cost Per Query as:

Cost per Query = # of Compute Units x Price per Compute Unit Per Hour x Duration in Hours

As stated in Microsoft’s Synapse POC Playbook, using a Price/Performance metric like Cost per Query “provides a single comparable number for each performance test”.

Understanding Microsoft Fabric’s Pricing Structure

However, the equation is not as straightforward in Microsoft Fabric. Microsoft Fabric introduced a new pricing structure for data and AI workloads with capacity licensing and bursting. With the capacity licensing model, you select a fixed capacity size corresponding to a number of Capacity Units (CUs). Each CU represents different v-cores depending on the service (0.125 for Power BI, 0.5 for DW, or 2 for Spark). For example, an F64 capacity should provide up to 32 DW v-cores. However, with Bursting in Fabric, a Fabric warehouse can use up to 12X that capacity for queries, meaning up to 384 DW v-cores. While this may sound great in isolation, in production you will likely be running queries across many warehouses and lakehouses, a mix of ad-hoc, BI, and ETL and concurrency across 100s or 1000s of analysts, all of which can burst. While the cost gets smoothed over a 24-hour period, you will still need to pay it back over time. It’s like using Buy Now Pay Later - it helps you to make a big purchase they can’t afford right now (Bursting) but you still need to pay it off in regular installments over time (Smoothing).

To understand the true cost including bursting, you will instead need to calculate the Actual CUs used per query. According to Bradley Schacht from the Fabric CAT team, figuring this out today is "far more cumbersome than you'd like". The Query Insights DMV provides query text and duration but not CUs consumed. The Capacity Metrics App provides CUs consumed but cannot be queried programmatically or drilled down to the query level easily.

Fabric POC Cost Analyzer Notebook

To address this, Xorbix Technologies built on the work of Bradley and Microsoft CSA Kristian Bubalo, who both used Semantic Link to query the Capacity Metrics semantic model. We developed a cost analyzer notebook that joins up this data with query insights to extract query-level duration, CUs, and query text and calculates actual vs. expected CUs, Bursting multiplier, and Actual vs. Expected Query Cost. This tool also recommends a Fabric Capacity size needed to execute queries without bursting, allowing you to estimate TCO for production scenarios.

POC Cost Analysis Example

To illustrate this, we tested using the Taxi Cab Azure Open Dataset, combining Yellow, Green, and For-Hire taxi datasets into a single ~50 GB nyc_taxi Lakehouse table. This dataset is suitable for analytics and BI use cases, representing a small data warehouse size typical for teams like HR or Finance.

Other Considerations

Based on this, if I were part of a team evaluating Microsoft Fabric, would I find the platform straightforward to try out, configure, and secure? What should I know about the simplification of compute into shared Capacity Units (CUs) and its impact on enterprise governance, particularly regarding process visibility and cost planning for team or product workloads?

Do I need to factor in workload isolation from noisy neighbors and create multiple capacities? How about features like Power BI, Private Link, and CoPilot that require at least an F64 for each of those capacities?

Considering the different consumption rates of capabilities like Power BI, Spark, Data Warehouse, or OneLake within or externally from Fabric, how challenging is it to automate processes into trackable modules beyond the entire capacity or query-level grain?

Test out the cost analyzer notebook today in your Fabric evaluation, and let the Xorbix Technologies team know what you learned! For organizations seeking expert guidance on data warehouse optimization and cost management, explore our Data Warehousing & Data Analytics services. Our team can help you make informed cloud platform decisions and optimize your data infrastructure costs.

Ruskál Imre

EMEA Global Black Belt, Analytics and DW Specialist at Microsoft

You could have run this on an F4, or F2 and be even more dramatic and claim 15 or 30 times hiden costs, LOL :-) To be clear for all, the fact that a query bursts (for example 4 times) does NOT MEAN you NEED a capacity that is 4 times bigger than the current one to be able run it, WITHOUT getting throttled. If you run your 4 times bursted query in a loop for 10 minutes and then do nothing (this is the test close to the one described in the blog), the debt you accumulated bc of bursting will be paid in the next 30 (4x10 -10) minutes and (WITHOUT changing the capacity size) NO THROTTLING will happen (because throttling of backgournd ops happens only if consumed all our CUs for the next 24 hour hours). So all this reasoning about hidden costs is highly questionable to say the least (a bit like saying about Databricks autoscale that is "bad" because may result in unpredictale costs if it kicks in)

14 Reactions

Jean-Philippe Duval

Directeur de projets – Stratégie & Développement & Business Intelligence

Audrey Ebran à regarder 😉😉😉😉

Jens Vollstedt

Databricks Tech Lead | Principal Consultant & Cloud Solution Architect @ Inspari a Valantic Company | Data Crusader | Always trying to find the most value in data

Peer Woyczechowski Bilberg

1 Reaction

Michael Milirud

What I touch multiplies in value. Serial confirmed hits.

We gave Fabric our best shot. It just wasn't viable from cost/performance for ETL compared to Databricks. Not even in the same ballpark. I'm sure they are working on it, but meanwhile Databricks is an undisputed king of this hill.

5 Reactions

See more comments

To view or add a comment, sign in

Understanding the True Cost of a Query in Microsoft Fabric

Ryan Shiva

Databricks MVP | Data Engineer at Xorbix Technologies | Databricks Certified Data Engineer & ML Professional

How to Calculate POC Costs in Microsoft Fabric

Evaluating Cost and Performance

Understanding Microsoft Fabric’s Pricing Structure

Fabric POC Cost Analyzer Notebook

POC Cost Analysis Example

Recommended by LinkedIn

Other Considerations

More articles by Ryan Shiva

Insights from the community

Others also viewed

Another way of looking at Microsoft Fabric

The problems and opportunities I see with Microsoft Fabric

Harnessing the Power of Reliable Collections in Microsoft Fabric

Answering early questions about Fabric's place in your analytics stack

How to Successfully Kickstart Your Microsoft Fabric Journey

Get Started with Microsoft Fabric

How to Automatically Pause Fabric Capacity using Microsoft Data Factory After Pipeline Execution and Save Money in Your Azure Subscription?

FabCon 2025 Announcements

Getting into Fabric

Explore topics

How to Calculate POC Costs in Microsoft Fabric

Evaluating Cost and Performance

Understanding Microsoft Fabric’s Pricing Structure

Fabric POC Cost Analyzer Notebook

POC Cost Analysis Example

Recommended by LinkedIn

Other Considerations

More articles by Ryan Shiva

Spark Connector For Microsoft Fabric DW/SQL Endpoint: Enterprise-Grade Security Theater

Choosing Fabric Over Databricks for ML? Enjoy Your Future Migration Project

SAP Databricks: The Enterprise AI Partnership That Will Change the World

Are Power BI and Tableau Obsolete? Why Databricks Apps is the Future of Enterprise BI

Databricks vs. Snowflake: Choosing the Right Platform for Your ML Workflow

Insights from the community

Others also viewed

Another way of looking at Microsoft Fabric

The problems and opportunities I see with Microsoft Fabric

Harnessing the Power of Reliable Collections in Microsoft Fabric

Answering early questions about Fabric's place in your analytics stack

How to Successfully Kickstart Your Microsoft Fabric Journey

Get Started with Microsoft Fabric

How to Automatically Pause Fabric Capacity using Microsoft Data Factory After Pipeline Execution and Save Money in Your Azure Subscription?

FabCon 2025 Announcements

Getting into Fabric

Explore topics