Description
I want to create a pipeline from scratch in ODN/UV canvas, so that it can be associated with a dataset in ODN/IC.
Preconditions
- ODN is installed
How to do it:
- DataProcessingUnits (building blocks of a pipeline) are available on the left side in the tree structure
- the DPUs are orgaqnized in E, T, L subtrees in accordance with the purpose it serves in the transformation process
- E - extractors retrieve source data from different sources (remote/local file storage, relational database, etc...)
- T - transformers modify the structure and content of data input, transform data from one datatype to another, etc...)
- L - loaders load the transformed data to selected data storage, publish it in the ODN/IC as resource of a dataset
- use Drag & drop method to put a selected DPU from tree to canvas workspace
- use doubleclick or hover over a DPU in workspace and select the gearwheel icon to open the configuration dialog of a DPU
- each configuration dialog contains 'About' section which describes the DPU and provides help on how to configure
- hover over a DPU in workspace and select the arrow icon to create a conection between two DPUs
- if the connection created is displayed in red color, it is either incorrectly configured or it is not possible to interconnect the selected DPUs at all
- once the connection of two DPUs is created it can be configured by clicking the connection and selecting the gearwheel icon
- configuration dialog is opened where the output data unit of the source DPU can be mapped to input data unit of the target DPU
- hover over a DPU in workspace and select the basket icon to delete the DPU from workspace (all connections from and to this DPU are deleted as well)
- click the connection and select the basket icon to delete the selected connection between two DPUs
- '+' '-' icons on the upper right corner of the screen for zooming the workspace in and out
- the green arrow icon can be used for undo the last step performed
- use 'Save' button to save the pipeline created
- before the pipeline is executed, it needs to be associated with a dataset in ODN/IC