How to dynamically map columns in Azure Data Factory to adjust the match between source and target columns
Azure Data Factory (ADF) is a cloud data orchestration service that allows you to create, schedule, and manage data pipelines. Let's cover each of the mentioned topics:
https://dadosabertos.mec.gov.br/pnp?start=0
Step-by-step:
Example:
{ "name": "ArrayOfValues", "type": "Array", "value": [ "Value1", "Value2", "Value3" ] }
{ "name": "fileName", "type": "String" }
Walkthrough:
"source": { "type": "AzureDataLakeStoreSource", "storeSettings": { "type": "AzureDataLakeStoreReadSettings", "recursive": true } }
"activities": [ { "name": "ReadJSONActivity", "type": "Copy", "inputs": [ { "referenceName": "AzureDataLakeStorageLinkedService", "type": "DatasetReference", "parameters": { "fileName": "example.json" } } ], "outputs": [ { "referenceName": "OutputDataset", "type": "DatasetReference" } ], "typeProperties": { "source": { "type": "JsonSource" }, "sink": { "type": "AzureBlobSink" }, "enableStaging": false } } ]
"sink": { "type": "SqlServerSink", "writeBatchSize": 10000, "writeBatchTimeout": "00:05:00" }
{"type": "TabularTranslator", "mappings": [ {"source": {"name": "Year"},"sink": {"name": "Year"}}, {"source": {"name": "OU"},"sink": {"name": "OU"}}, {"source": {"name": "Action"},"sink": {"name": "Government Action"}}, {"source": {"name": "GND"},"sink": {"name": "GND"}}, {"source": {"name": "Settled Expenses"},"sink": {"name": "Total Settlements"}} ]}
These steps and examples illustrate how to perform various operations with Azure Data Factory, from storing and extracting data to mapping columns and using specific activities.
Data Scientist and Consultant for Digital and Analytics Solutions
@fabioms