A first introduction to Microsoft Fabric

–  by Martijn van 't Leengoed

What exactly is Microsoft Fabric? It is powerful tool that brings together all the strengths of various Microsoft data components to help you take control of your data and turn it into actionable insights.

Imagine this: instead of struggling with different data tools, you have one all-in-one platform that has all the tools available to you. From collecting and storing data to performing complex analysis and visualizing results.

What makes Microsoft Fabric so special? Let's look at the unique features that set it apart from the rest:

Workloads for every need: Whether you are a data scientist looking for advanced analytics capabilities or a business analyst looking to gain quick insights, Microsoft Fabric can meet these different needs.

The following different work items are at your disposal in Microsoft Fabric;

Data integration:

This is your hub for data transformation and integration. With more than 150 connectors, you can pull data from anywhere and transform it exactly the way you want.

Data Engineering:

Lakehouse, a new concept within Fabric, makes it possible to work with big data and offers great collaboration opportunities.

Data Warehouse:

Here you have access to the most powerful SQL engine available. No matter how large your dataset is, Synapse Data Warehouse can handle it!

Real-time Analytics:

For those of us who like to look ahead, this workload offers the ability to analyze real-time data. Whether it's IoT data or logs, you get the insights you need, quickly and accurately.

Data Science:

Last but not least, we have this workload for real data scientists. Here you will find everything you need to build, train and implement advanced AI models.

 

The OneLake

This is the central layer of Microsoft Fabric. This is where all your data is brought together and organized, so you can easily discover insights that would otherwise remain hidden.

It's like the OneDrive for all your data where you can keep all the data in one central place. However, you do have the ability to make this data available through separate data warehouses and lakehouses in different workspaces with specific security and policies.

You get one OneLake per tenant, with data in different containers. Each OneLake can be split into multiple workspaces with their own access rules, so each team can manage its own data. And best of all? You can host and explore all kinds of files with the different workload tools.

A useful extension within OneLake is that you don't have to store all of your source data in OneLake; you can use shortcuts to point to other storage locations. These shortcuts allow you to work with data without hosting it in Azure, reducing the risks of copying data.

 

Warehouse or a Lakehouse

Within the Microsoft Fabric ecosystem, there are two powerful storage options: Lakehouse and Warehouse.

Lakehouse is a versatile repository for all kinds of data, including structured, semi-structured and unstructured, and supports various formats such as CSV, Parquet, JSON and Delta. It offers flexibility in programming languages such as PySpark and Scala.

Warehouse is focused on structured data storage and processing in the Delta format, primarily with T-SQL. It is ideal for managing and updating data using SQL objects.

In short, the main difference between Lakehouse and Warehouse lies in the language chosen for data management by your pro-code developers. Choose Lakehouse if your team leans toward Scala/Python, and Warehouse if T-SQL is preferred. The analytic language, T-SQL, remains consistent across both items, ensuring consistency in analytic performance.

 

Get the Best from Microsoft Fabric for your Business

Microsoft Fabric is not just a tool; it helps you optimize your data processes. But how do you get the most out of it? It all starts with a clear goal in mind. Whether you want to improve your sales process, increase the quality of your work or increase the efficiency of your team, Microsoft Fabric provides the solution you need.

A crucial first step is to start small and expand gradually. It is tempting to be overwhelmed by a sea of data, but by focusing on the core you can avoid drowning.

A solid foundation of data management is invaluable. Therefore, it is essential to establish clear standards and principles based on the Mediallion architecture, a proven best practice within the Microsoft data platforms for setting up data warehouses and lakehouses.

To accomplish all this, forming a dedicated team is essential. Make sure they have the right skills by attending training in the required work items. This team will focus on developing a Minimum Viable Product (MVP) to take the first step toward integrating Microsoft Fabric into the organization. The knowledge gained from this will then be disseminated within the organization to drive and support multiple initiatives.

With the right approach, Microsoft Fabric can take your business to unprecedented heights. Don't wait, take the first step to success today!

Recent posts