Free cookie consent management tool by TermsFeed Policy Generator

#061 How to read data in JSON format in Azure Synapse Analytics

Apresentamos nesse vídeo como obter os munícipios do Brasil utilizando o Azure Synapse Analytics com extração dos dados em formato JSON na internet e armazenando no Azure Datalake em format Parquet.

We'll learn the techniques

:
  1. Create pipeline with data

    copy activity:
    • In Azure Synapse Studio, create a new pipeline and add a data copy activity.
  2. Configure the source data (HTTP,
  3. JSON):

    • In the pipeline, add an HTTP data source and configure to read data in JSON format.
  4. Get data URL

    from IBGE website:

  5. Map the schema of the source data
  6. (IMPORT SCHEMA):

    • Use the copy data activity to import the schema from the source data.
  7. Configure the target
  8. data (DATALAKE, PARQUET):

    • Add a target step in the pipeline by configuring the destination as a Data Lake and choosing the Parquet format.
  9. Some reasons to use the Parquet format:

    • The Parquet format is storage-efficient and offers compression, schema support, and efficient reading for analytical processing.
  10. Run and monitor trigger
  11. (TRIGGER):

    • Set up and run a trigger to start the pipeline and monitor the status of the run.
  12. Identify the costs of running the pipeline
  13. (RUN CONSUMPTION):

    • Analyze the costs associated with running the pipeline using Azure monitoring and consumption tools.
  14. Generate script to read the
  15. PARQUET format file:

    • If necessary, generate a script to read data from the Parquet file in the desired environment (for example, using Apache Spark).
  16. Map the target columns
  17. (MAPPING):

    • In the pipeline, map the columns from the source data to the target columns in Parquet format in the Data Lake.

This content contains
  • Content Video
  • Language Portuguese
  • Duration 10m 1s
  • Subtitles Não

  • Reading time 1 min 13 seg

avatar
Fabio Santos

Data Scientist and Consultant for Digital and Analytics Solutions


  • Share

Youtube Channel

@fabioms

Subscribe now