Skip to main content

Posts

Showing posts from August, 2021

How to handle data load failure in SSIS?

 There are different ways to handle data load failure.  1. On dataflow you can redirect failure rows to another location and later retrieve it.  2. You can also set ignore failure rows on destination settings.  3. Easiest way is configure destination columns to easily convertible datatype like Varchar/strong with proper length.  4. If failure can happen due to network issue or connection issues then you can set up retry mechanism.  5. If it's happening due to memory then you can review the batch size and go for sequential load instead of parallel loads. 

Accessing azure blob storage in azure databricks

 Accessing azure blob storage using azure databricks There are two different ways to access azure blob storage. Mounting Directly accessing storage  Mounting azure storage  When we mount azure storage in databricks it behaves like DBFS. We can run all the DBFS commands on the mount point. The mount point persist till we unmount it. If a Blob storage container is mounted using a storage account access key, DBFS uses temporary SAS tokens derived from the storage account key when it accesses this mount point. Below is the code to mount azure storage. Python code dbutils.fs.mount(   source = "wasbs://<container-name>@<storage-account-name>.blob.core.windows.net",   mount_point = "/mnt/<mount-name>",   extra_configs = {"<conf-key>":dbutils.secrets.get(scope = "<scope-name>", key = "<key-name>")})  <conf-key> can be either fs.azure.account.key.<storage-account-name>.blob.core.windows.net or fs.a...