All Objects
UnstructuredChunkingConfig
partitioning
enum<string>
requiredThe partitioning strategy used for unstructured data.
Available options:
auto
, fast
, hi_res
, ocr_only
strategy
enum<string>
requiredThe chunking strategy used for unstructured data.
Available options:
basic
, by_title
, by_page
, by_similarity
max_characters
integer
requiredThe maximum number of characters allowed per chunk.
new_after_n_chars
integer
requiredThe number of characters after which a new chunk is created.
overlap
integer
requiredThe number of characters to overlap between chunks.
similarity_threshold
number
requiredThreshold for similarity when chunking by similarity, with a value between 0.0 and 1.0.
overlap_all
boolean
requiredIndicates whether to apply overlap to all chunks or only between adjacent chunks.