Get started
Install the CLI, ingest your first corpus, and export chunks to a vector store in under five minutes.
- Quickstart
- Install the CLI
- Your first pipeline
- Conceptual overview
Quickstart, concepts, API reference, and integration guides — written by the engineers who built CHUNKZA.
Install the CLI, ingest your first corpus, and export chunks to a vector store in under five minutes.
Understand and configure every splitting strategy CHUNKZA ships.
Inspect boundaries, embeddings, and retrieval replay.
Export to your stack and pipe chunks into your RAG runtime.
Programmatic access to the full CHUNKZA pipeline.
End-to-end playbooks for common scenarios.
Every pipeline is a versioned YAML policy — diffable, reviewable, and replayable. No more "why did retrieval change between releases?"
# chunkza.yml — a versioned chunking policy
version: 1.4
source:
type: s3
bucket: acme-kb
prefix: docs/
strategy:
layout_aware: true
semantic_boundary:
enabled: true
model: chunkza-boundary-v2
parent_child:
enabled: true
parent_max_tokens: 1024
child_max_tokens: 256
overlap_tokens: 48
metadata:
inject: [section_path, source_uri, page, acl_roles]
export:
target: pinecone
index: acme-kb-v3
embeddings: text-embedding-3-largeRequest access and we'll provision your workspace plus a walkthrough with a solutions engineer.