Articles tagged Microsoft Fabric

29 May 2024
29 May 2024

Purview and Microsoft Fabric: Better Together

Microsoft create Purview, a data governance solution. Since Microsoft Fabric was in preview, there was a promise of a deeper integration between Fabric and this governance solution. This integration is finally available. This governance solution is a complete world by itself. This is only a small summary of what’s available. First access to Purview Purview … Read more
08 May 2024
08 May 2024

Edit The JSON of a Fabric Pipeline

A Fabric Pipeline uses JSON as source code. They are also saved in repositories as JSON. We first idea we get is editing the pipeline in JSON format. We can copy the JSON and create new pipelines with small variations, making changes directly on the JSON. However, at first sight we get disappointed, because the … Read more
30 April 2024
30 April 2024

PySpark Secrets to use with Fabric

PySpark is a powerful language for data manipulation and it’s full of tricks. Let’s discover some of them. Control the Type of a NULL column If you are creating a pysspark dataframe, but one of the columns contains only null values (None), how could you control the type of the column? There is an interesting … Read more
23 April 2024
23 April 2024

4 Reasons to use KustoDB for Real-Time

When implementing real-time ingestion, we usually implement an architecture called lambda. Using the lambda architecture, KustoDB in Microsoft Fabric is always recommended for the speed layer. Do you know why? Let’s analyze in detail. 1 – KustoDB uses SSD KustoDB uses an internal SSD storage. Lakehouses use ADLS as their backend. In this way, Kusto … Read more
03 April 2024
03 April 2024

Dynamic Partitioning and a Simple Incremental Load

Let’s consider a simple statement for partitioning and save a table in a lakehouse: df.write.mode("overwrite").format("delta").partitionBy("Year","Month","Day").save("Tables/" + table_name) Let’s consider we load the data daily, with all the transactions from the day. The table will save the transactions for each day in different partitions. We can expect the table to keep the partitions from previous day, … Read more
13 March 2024
13 March 2024

5 Secrets about Dataflows Gen 2

Dataflows Gen 2 are the new version of Power BI dataflows. There are so many changes in relation to the previous version they are considered a new feature. The main difference is the possibility to set a target for the result of each query in the dataflow. In this way, it can be used as … Read more
06 March 2024
06 March 2024

Reverse Creating the PBIP file from Source Control

On my article about Source Control with GIT, Power BI and Microsoft Fabric, I illustrate how to use the PBIP file format to include Power BI reports and semantic models in a source control process and stablish a SDLC (Software Development Lifecycle) for Power BI. However, the complete explanation is based on saving the development using … Read more
28 February 2024
28 February 2024

Customizing Kusto (Data Explorer) Connections in Deployment pipeline

When organizing our SDLC (Software Development Lifecycle) in Power BI/Fabric, we use Deployment Pipelines and create rules to change connection configurations every time we promote an object from one environment (dev for example) to another (test, for example). Kusto connections, on the other hand, are not so simple. You can check more about Deployment Pipelines … Read more
21 February 2024
21 February 2024

Eventstream and the new KQL Processing

Eventstream has many differences in relation to the technologies it proposes to replace. Event Hub, Stream Analytics, Streaming Dataflows and more. We can compare these technologies, but EventStream in Microsoft Fabric has some specific differences from all of them. One of the differences is how the transformation of the input data is linked to the … Read more
14 February 2024
14 February 2024

4 Rules to follow on Microsoft Fabric Source Control

Power BI and Fabric are implementing source control support. It’s a long-awaited feature for Power BI. However, it’s important to highlight some basic principles which should be followed as source control best practices. Some of them apply to any project in source control, some are specific for this environment, and some are specific for this … Read more
23 January 2024
23 January 2024

Power BI and Resource Graph: Why you should care

Recently Azure Resource Graph was announced as a new connector in Power BI. Azure Resource Graph provides access to almost all resources inside the azure environment of a company. Why is this important? Resource Graph by itself is a very important tool to analyze the provisioned resources on Azure environment without lose the control of … Read more
17 January 2024
17 January 2024

Fabric and Self-Service: Control who can use

We can say Fabric is the evolution of the Power BI environment. Power BI is a self-service environment, and so is Fabric. This allows the implementation of very interesting architectures, which will be the subject of future videos and articles. However, it’s not something free-and-easy, and it shouldn’t be. Using Fabric Admin Portal (or Power … Read more