Connections

The connections home page enables users to quickly search and access connections already configured in the DataForge platform. A connection holds the credentials, network locations, and any other parameters required to access data in the location where it is either generated or staged for ingestion by DataForge.

Only Connections marked as Active are shown here, unless the Active Only toggle is set to off.

Connection Settings

The connection settings page enables users to provide the necessary parameters to enable DataForge to access the system.

Name*: A unique name
Description*: A description
Active: Allows users to disable the connection without deleting the configuration
Group: Allows users to include the connection in a group (requires Connection Template selection to be used)
Connection Template: Allows users to select from a connection template to normalize Name (requires Group selection to be used)
Connection Direction: Specifies if this connection is used to ingest or output data
Connection Type: Specifies the format or location style of the source or target data. Depending on the Type selected, the remaining parameters will change
Uses Agent: A visual indicator showing whether or not this connection will use an Agent (only available if Source Connection Direction is selected)
Agent*: Where Agent is required, used to select the Agent to be used (only available if Source Connection Direction is selected).

The Duplicate button near Save will create a copy of the configuration in a new tab with the same settings and a name of "<configuration name> COPY". The duplicated configuration is not attached to any objects automatically.

API Connection Type

Options available:

Salesforce: For more information on setup and use, refer to the Salesforce API Connection documentation.

Custom Connection Type

Used in the SDK as part of Custom Ingestion.

Parameters here are optional, as not all custom ingestion notebooks require parameterization.

These parameters should be a JSON object in format of {"key1": "value1", "key2": "value2"}

For connections that do not need any parameters, enter an empty JSON object {}

Public Connection Parameters*: Passed in as plain-text to the custom ingestion session

Private Connection Parameters*: These parameters are encrypted on save

Event Type

Options available:

Kafka: For more information on setup and use, refer to the Kafka Events Connections documentation.

File Connection Type

Storage Technology*: Specifies the type of file storage an Agent or Compute will attempt to access
File Path*: The folder/container path for DataForge to access when pulling or generating files

Table Connection Type

Driver*: Which JDBC driver should be used

When using the Generic JDBC driver option, users need to enter the connection string, driver class path, and any sensitive parameters. The driver library also needs to be entered into the compute configuration parameters libraries setting that will be used for ingestion on any sources. For more information and examples, refer to the Generic JDBC Connection documentation.

Parameters

The parameters section will change dynamically based on the required selections above, are typically optional to configure, and are used for advanced configuration or specifications.

To utilize Connection Metadata for database connections, use the parameters Metadata Refresh and Metadata Schema Pattern. See Connection Metadata below for more detail.

Connection Metadata

Connection Metadata shows an optional list of all tables, referenced tables, primary/foreign keys. To populate Connection Metadata, use the parameters in the Connection Settings page for Metadata Refresh and Metadata Schema Pattern.

Metadata Refresh (includes four options):
- Tables, Columns, and Keys collects the most granular information for each table pulling the list of columns and keys defined in each table. This is the default and enables Talos AI to directly search for specific fields within connections for users.
- Tables and Keys (default) collects the table names and key column identifiers for each table
- Tables collects only table/view names
- None disables metadata collection for the connection
Metadata Schema Pattern (optional):
- Specifies LIKE pattern to filter schemas for metadata collection

Note: Connection Metadata only works with Table/Database connections at this time.

With Connection Metadata, users have the option to directly create sources from connection tables, including referenced tables recursively. The Metadata Refresh parameter must be selected to any option except None to use this feature.

After selecting the checkbox next to a table the user wants, options will appear in the triple dot menu to also add referenced tables or referenced tables recursively. Add referenced tables recursively will include all sources referenced in the chain of tables.

Create new sources directly from Connection Metadata by selecting the Create Source(s) button in the triple dot menu top-right above the table.

A source creation modal appears to set the source naming pattern for DataForge to use when creating the sources. If the connection Metadata Refresh parameter is selected to Tables and Keys, an option will appear to automatically Create Relations between all the sources. This option will only work the first time the sources are created or when the user changes the naming pattern to recreate all of the sources as new with relations.

Toggle the Initiate Data Pull to start a new ingestion for each source when they're created rather than manually running a new data pull on each source.

Toggle Set Incremental Refresh to automatically create sources with filter queries to only pull changed data. For this setting to work correctly, Keys need to be defined in the table data dictionary prior to bulk creating sources.

The Sources column of the Connection Metadata tab will provide a number and hyperlink to any existing sources set up to pull data from the specific table/view. Click the hyperlink to view and/or open the existing sources.

The Refresh button will launch the Connection Test compute to retest the connection and do a new scan of the database tables and views.

Data Dictionaries

Data dictionaries can be defined and uploaded into each connection for tables and columns (requires Tables, Columns, and Keys setting enabled in metadata parameters).

To interact with connection data dictionaries, open the connection and select the Data Dictionary button. Note that the Data Dictionary button will not show Import or Export options if a connection test is in progress.

Use Export option to choose whether to download table definitions or column definitions.

Use the Import option to choose whether to import a CSV file of table definitions or column definitions.

Table and column definition files include different columns, listed below.

Table Definitions

Column	Description
table_schema	Name of the schema in the connection
table_name	Name of the table in the schema
description	Description of the table and what information it contains

Column Definitions:

Column	Description	Values
table_schema	Name of the schema in the connection
table_name	Name of the table in the schema
column_name	Name of the column in the table
description	Description of the column and what information it contains
category	Category of the column for use in DataForge configurations	key, dimension, metric, name, modified timestamp
key	Indicates whether the column is a primary key or foreign key	pk, fk, pk+fk
fk table	Schema and table name of the table relating to the foreign key column	schema_name.table_name

Updated January 12, 2026 17:31

Was this article helpful?

0 out of 0 found this helpful