Sources

  • Sources Overview

    A Source represents a single schema of data. Sources are the logical grouping for Inputs, Relations, and Ru...

  • Source Settings

    The Settings tab for a Source allows a user to specify key information about the Source including input types ...

  • Raw Schema

    The Raw Schema tab for Sources allows users to view the raw database attributes as well as raw metadata.   ...

  • Dependencies

    Dependencies allow configurators to modify the workflow engine to introduce waits to the processing queues ...

  • Relations

    Relations define intra-source connections and enable users to configure lookups and cross-source aggregates. ...

  • Rules

    Rules allow DataForge to modify and transform data. Rules Tab The Rules tab allows users to select, ...

  • Complex Data Types

    DataForge supports the use of complex data types such as array and struct, common in semi-structured datasets...

  • Inputs

    The Source Inputs screen shows the status of an individual Source's processing and allows users to restart pro...

  • Process

    The Process page provides an operational dashboard of the processes completed or currently active for this sou...

  • Viewing Source Data

    Every source has a hub table behind the scenes in Databricks that represents all of the input data brought int...

  • Custom Refresh

    Overview and Benefits DataForge provides users the flexibility to define a Custom Refresh process that determi...

  • Sub-Sources

    Overview The sub-source feature in DataForge enables developers to easily work with and transform complex nest...

  • Unmanaged External Source

    DataForge allows users to connect to external tables that are set up in Databricks as Delta or Hive tables.  T...