I previously created a copy activity to copy all columns in sql table into ADLS in parquet format. I am new to Azure data factory. So my question is how would copy only a specific set of columns into Azure data lake storage as Parquet?
CodePudding user response:
- In copy activity source settings, click the Query radio button in Use query option.
- In Query text box, Type the query which selects only the required column.
In sink, use the Parquet dataset.
When copy activity is executed, only selected columns from SQL table gets copied to data lake.
Preview Data of Source
- When table is clicked in source dataset settings, all columns are selected.
- When Query option is clicked in source settings, selected columns from the query are displayed.
CodePudding user response:
You can even restrict via mapping table and delete unnecessary fields, so that only the selected fields will get copied to the destination file or table of your interest.