Represents a data source configuration within the system, including connection details, status, chunking configurations, and record counts.

connectionId
string
required

The identifier of the connection associated with the datasource.

createdDate
string
required

The date and time when the datasource was created.

destinationId
string
required

The identifier of the data destination.

name
string
required

The name of the datasource.

originalName
string
required

The original name of the datasource.

sourceId
string
required

The identifier of the data source.

sourceType
string
required

The type of source for the datasource.

workspaceId
string
required

The identifier of the workspace associated with the datasource.

_id
string

Unique identifier for the datasource.

chunkingConfig
object

Configuration settings for chunking unstructured data, including partitioning and chunking strategies, character limits, and similarity thresholds.

connectionSettings
object

Configuration settings for the datasource connection.

description
string

Optional description of the datasource.

discoveredSchema
object | null

Schema discovered during the data source connection. The structure depends on the source type.

embeddingField
string

The field used for embedding within the datasource.

filename
string | null

The name of the file associated with the datasource, if applicable.

hidden
boolean

Indicates whether the datasource is hidden from standard views.

lastSyncedDate
string | null

The date and time when the datasource was last synced. Null indicates it has never been synced.

modelId
string

Identifier of the embedding model used, if applicable.

orgId
string

Identifier of the organization to which the datasource belongs.

recordCount
object

The record count details for the datasource, including total, successful, and failed records.

status
enum<string>

The current status of the datasource.

Available options:
draft,
processing,
embedding,
ready
streamConfig
object

Configuration settings for processing streams of data, breaking them into smaller chunks for more manageable processing.

teamId
string

Identifier of the team to which the datasource belongs.

timeUnit
string

A temporary field to limit CRON frequency based on the plan. This will be replaced with a more robust solution in the future.

timeWeightField
string

The field used to apply time weighting within the datasource.