In this step you will create a Data Pipeline to execute the previously imported Spark Notebooks
-
Open 01-FDA-Download-DrugEvent-Dataset Notebook
-
Open the Run options by clicking Run button in the toolbar
-
Clck Add to Pipeline button and select New Pipeline option
-
Specify appropriate name for the Pipeline example fda_ds_silver_pipeline and click Create button to open Data Pipeline canvas
-
Select the Notebook Activity on canvas to give appropriate name like DownloadFDADataset to the Activity as shown in the screenshot below, also review the Settings Tab for activity (Workspace and Notebook settings are automatically set appropriately because Pipeline was created from Notebook)
-
Open the Activity menu by clicking Activity button in the Toolbar
-
Click Notebook button which will add a new Notebook activity on the canvas
-
Select the new Notebook Activity on the canvas and open Settings section
-
Set the Workspace value to the current Workspace
-
Set the Notebook value to 02-FDA-DrugEvent-CreateSilverTables
-
Connect the two Activities by dragging from the Download Activity On Success icon to the Transform Activity added for second Notebook
-
Make sure to save changes to the Pipeline