Lineage refers to a directed acyclic graph (DAG) generated by DataForge describing how data is processed, tracked at the rule, column, and input level. Lineage displays the dataflow between nodes. Origin starts from the left and goes to destination on the right. Nodes can be a variety of datatypes within DataForge
Starting a Session
Lineage starts with an object. Users can kick off a lineage graph by navigating to a DataForge object with lineage a lineage icon. Click on the lineage icon to start a new lineage graph.
Users have the option of starting a new lineage graph or adding to an existing graph when the icon is selected.
Lineage can be started from the following list of objects: sources, relations, outputs, output source channels, raw attributes (raw schema), rules, relations, and output columns.
Expanding the Lineage Graph
To trace the lineage graph further upstream or downstream, right-click on any box or object and use the menu options to add a destination or origin. Both can be traced recursively to see the full lineage path.
Remove object nodes from the lineage graph by using the right-click menu and Remove options.
Directly open any object through the same menu.
Legend
In Lineage, different types of objects are represented by combinations of symbols and attribute categories.
Users will find the legend key by clicking the map icon in the upper right corner of the lineage screen. It can be toggled on/off at any time.
Attribute and object references are designated via arrows.
A simple flow (black arrow) represents a direct reference between objects with no intermediate tables or relations involved.
A relation flow (blue arrow) represents a reference between objects where a relation is used. Click the relation arrow to see the relation name used.
Updated