Reference
Retrieve A Datasource
Retrieve a JSON object of a single model within the specified team by modelId.
Path Parameters
The resourceSlug is a URL parameter of the teamId associated with the user.
The ID of the datasource to retrieve.
Response
JSON object of the datasource retrieved by the operation
The identifier of the connection associated with the datasource.
The date and time when the datasource was created.
The identifier of the data destination.
The name of the datasource.
The original name of the datasource.
The identifier of the data source.
The type of source for the datasource.
The identifier of the workspace associated with the datasource.
Unique identifier for the datasource.
Configuration settings for chunking unstructured data, including partitioning and chunking strategies, character limits, and similarity thresholds.
The maximum number of characters allowed per chunk.
The number of characters after which a new chunk is created.
The number of characters to overlap between chunks.
Indicates whether to apply overlap to all chunks or only between adjacent chunks.
The partitioning strategy used for unstructured data.
auto
, fast
, hi_res
, ocr_only
Threshold for similarity when chunking by similarity, with a value between 0.0 and 1.0.
0 < x < 1
The chunking strategy used for unstructured data.
basic
, by_title
, by_page
, by_similarity
Configuration settings for the datasource connection.
Configuration settings for the datasource connection. Structure is dependent on the datasource type.
The identifier of the data destination.
The name of the datasource connection.
Specifies the behavior for handling non-breaking schema updates.
Optional prefix to be added to the destination's namespace. Can be null.
The identifier of the data source.
The status of the datasource connection. This should match the enum values defined by the Airbyte API and should allow creation in a paused state.
Specifies where the data should be stored geographically.
Defines how the namespace should be determined for the data.
The format of the namespace, can be null if not applicable.
Scheduling information for the datasource connection.
Optional description of the datasource.
Schema discovered during the data source connection. The structure depends on the source type.
The field used for embedding within the datasource.
The name of the file associated with the datasource, if applicable.
Indicates whether the datasource is hidden from standard views.
The date and time when the datasource was last synced. Null indicates it has never been synced.
Identifier of the embedding model used, if applicable.
Identifier of the organization to which the datasource belongs.
The record count details for the datasource, including total, successful, and failed records.
The current status of the datasource.
draft
, processing
, embedding
, ready
Configuration settings for processing streams of data, breaking them into smaller chunks for more manageable processing.
Configuration settings for a specific stream, used to break down large volumes of data into smaller, manageable chunks for processing.
List of child stream identifiers that are checked for inclusion in the sync.
List of fields that act as the cursor for incremental syncs.
A map of field names to their descriptions.
Provides details about a specific field, including its description and type.
List of fields that make up the primary key for the stream.
The synchronization mode used for the stream.
Identifier of the team to which the datasource belongs.
A temporary field to limit CRON frequency based on the plan. This will be replaced with a more robust solution in the future.
The field used to apply time weighting within the datasource.
An array object of all models the user has access to, this is used to identify the embedding model of the datasource and to have a list of all available embedding models to allow for easy modification of the selected embedding model.
Unique identifier for the model.
Configuration settings for the model.
API key for accessing the model.
The base URL for the model's API.
API key for accessing Cohere services.
API key for accessing Groq services.
The model configuration setting.
The length of the embeddings generated by the model.
The specific AI model used.
The type of the model.
The name of the model.
Identifier of the organization to which the model belongs.
Identifier of the team to which the model belongs.
The general type of the model (e.g., embedding, language model).