Azure Blob Storage is a Microsoft-managed service providing cloud storage for a variety of use cases. You can use Azure Blob Storage with Flink for reading and writing data as well in conjunction with the streaming state backends
You can use Azure Blob Storage objects like regular files by specifying paths in the following format:
See below for how to use Azure Blob Storage in a Flink job:
flink-azure-fs-hadoop, copy the respective JAR file from the
opt directory to the
plugins directory of your Flink distribution before starting Flink, e.g.
flink-azure-fs-hadoop registers default FileSystem wrappers for URIs with the wasb:// and wasbs:// (SSL encrypted access) scheme.
Hadoop’s Azure Filesystem supports configuration of credentials via the Hadoop configuration as
outlined in the Hadoop Azure Blob Storage documentation.
For convenience Flink forwards all Flink configurations with a key prefix of
fs.azure to the
Hadoop configuration of the filesystem. Consequentially, the azure blob storage key can be configured
Alternatively, the the filesystem can be configured to read the Azure Blob Storage key from an
AZURE_STORAGE_KEY by setting the following configuration keys in