Microsoft Fabric Updates Blog

Creating a shortcut to a VPC-protected Amazon S3 bucket

This guide will show you how to create a OneLake shortcut to a VPC-protected Amazon S3 bucket.

Why use the On-premises-data gateway?

Today, organizations are protecting data by leveraging network security capabilities like virtual networks, firewalls and virtual protected clouds (VPC). To access data securely and to provide a bridge between protected environments and Microsoft Fabric, an on-premises data gateway can be used.

Although the name might suggest that the on-premises data gateway can only be used to access your data that is on-premises, it can actually be used to access any data that is protected by any type of firewall or virtual network, including the virtual protected clouds on AWS. More information about the on premises data gateway is available here.

Setting up a gateway is an easy process. You need to provision an EC2 instance within your virtual private cloud; and configure (or open) appropriate ports to securely communicate with Microsoft Fabric. In this tutorial, we will walk you through the steps to complete end-to-end setup.

If you are already using the on-premises data gateway within Fabric for other items, like Pipelines, dataflows or Power BI, you can use the same instance of on-premises data gateway, as long as it also has access to your S3 bucket inside the VPC as shown in the diagram below.

At a high-level, the setup process consists of the following steps:

  1. Create a public subnet in your VPC environment and assign security groups to the S3 bucket subnet
  2. Create an EC2 instance within the public subnet
  3. Install the on-premises data gateway on the EC2 instance
  4. Open the right ports to Fabric service
  5. Create a shortcut using the on-premises data gateway

Prerequisites

Step-by-step set up

1.     Create a public subnet in your VPC environment and assign NSG’s.

  • If you don’t have a public subnet follow this guide to create an internet gateway for a subnet in your VPC.

2.     Create an EC2 instance within the public subnet

  • Launch EC2 instance in the public subnet of your VPC. Be sure to save the private key file in a secure place. You will need this in the next step.

3. Install the on-premises data gateway on the EC2 instance

4. Open the right ports to the Fabric service

  • If a firewall blocks outbound connections, configure the firewall to allow outbound connections from the gateway to its associated Azure region. The firewall rules on the gateway need to be updated to allow outbound traffic from the gateway server to the following endpoints.

5. Create a shortcut to S3

Related blog posts

Creating a shortcut to a VPC-protected Amazon S3 bucket

September 25, 2024 by Idris Motiwala

Overview This blog will walk thru the new capabilities in Mirroring Azure SQLDB in Fabric since our public preview announcement earlier in March 2024. Today, we also announced general availability of Mirroring for Snowflake in Microsoft Fabric. To recap, the 3 key benefits of Mirroring are: Over the past few months, we’ve removed limitations to … Continue reading “Mirroring Azure SQLDB – new features and what’s coming up?”

September 25, 2024 by Trevor Olson

GCS shortcuts and S3 Compatible shortcuts are now generally available. Utilize shortcuts in OneLake to quickly and easily make data accessible in Fabric. No need to set up pipelines or copy jobs, just create a shortcut and your data is immediately available in Fabric.    From your Lakehouse, select new shortcut. Choose your shortcuts type (GCS, … Continue reading “Google Cloud Storage shortcuts and S3 Compatible shortcuts generally available”