Cuadra AI - Connect, Train, and Deploy Your Custom AI Assistant

How RAG Works

Step	Process
1. Upload	Add files or connect external sources
2. Chunk	Documents split into ~250 token segments
3. Embed	Vector embeddings generated for each chunk
4. Search	User query matched against embeddings
5. Retrieve	Top chunks injected into LLM context

Create Dataset

curl -X POST https://api.cuadra.ai/v1/datasets \
  -H "Authorization: Bearer YOUR_TOKEN" \
  -H "Content-Type: application/json" \
  -H "Idempotency-Key: create-ds-001" \
  -d '{"name": "Product Docs", "description": "API guides"}'

Upload Documents

Adding documents is a two-step process: upload the file, then associate it with a dataset.

Step 1: Upload File

curl -X POST https://api.cuadra.ai/v1/files \
  -H "Authorization: Bearer YOUR_TOKEN" \
  -H "Idempotency-Key: upload-001" \
  -F "file=@product-guide.pdf"

Step 2: Associate with Dataset

curl -X POST https://api.cuadra.ai/v1/files/file_abc123/associations \
  -H "Authorization: Bearer YOUR_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"datasetId": "ds_xyz789"}'

Supported Formats

Format	Extensions	Max Size
PDF	.pdf	50MB
Word	.docx	50MB
Text	.txt, .md	50MB
Data	.csv, .json	50MB

External Connectors

Sync content from external data sources.

Connector	Status
Google Drive	✅ Available
Notion	✅ Available

Connect Google Drive

curl -X POST https://api.cuadra.ai/v1/connections \
  -H "Authorization: Bearer YOUR_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "connectorSlug": "google_drive",
    "redirectUrl": "https://your-app.com/oauth/callback"
  }'

Link to Model

Connect a dataset to enable RAG:

curl -X POST https://api.cuadra.ai/v1/models/model_abc/datasets \
  -H "Authorization: Bearer YOUR_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"datasetId": "ds_xyz", "usageType": "rag"}'

Best Practices

Do	Avoid
Clean formatting before upload	Scanned images without OCR
Use descriptive filenames	Duplicate content across files
Split large docs into sections	Mixing unrelated topics
Group related content	PII or sensitive data

Specifications

Spec	Value
Max file size	50MB
Chunk size	~250 tokens
Search latency	40-120ms

FAQ

What file formats work best?

Markdown and plain text yield the best results. PDFs work well if they’re text-based (not scanned images). Use OCR preprocessing for scanned documents.

How often is content re-indexed?

Uploaded files are indexed once at upload. External connectors (Google Drive) sync based on your configuration—typically every 1-24 hours.

Can I preview what chunks were created?

Not via API currently. Use the Dashboard → Datasets → View to inspect chunks.

How do I improve retrieval quality?

Use specific, descriptive filenames
Add summaries at the start of documents
Remove boilerplate/headers that repeat across pages
Split very long documents into logical sections

What happens if I delete a document?

The document and its chunks are removed. This affects new chats only—existing chat histories retain their context.

Chat API

Use RAG in chat completions

Models

Link datasets to models

Getting Started

API Reference

Guides

Billing

Knowledge Bases

How RAG Works

Create Dataset

Upload Documents

Step 1: Upload File

Step 2: Associate with Dataset

Supported Formats

External Connectors

Connect Google Drive

Link to Model

Best Practices

Specifications

FAQ

What file formats work best?

How often is content re-indexed?

Can I preview what chunks were created?

How do I improve retrieval quality?

What happens if I delete a document?

Chat API

Models

Getting Started

API Reference

Guides

Billing

​How RAG Works

​Create Dataset

​Upload Documents

​Step 1: Upload File

​Step 2: Associate with Dataset

​Supported Formats

​External Connectors

​Connect Google Drive

​Link to Model

​Best Practices

​Specifications

​FAQ

​What file formats work best?

​How often is content re-indexed?

​Can I preview what chunks were created?

​How do I improve retrieval quality?

​What happens if I delete a document?

​Related

Chat API

Models

How RAG Works

Create Dataset

Upload Documents

Step 1: Upload File

Step 2: Associate with Dataset

Supported Formats

External Connectors

Connect Google Drive

Link to Model

Best Practices

Specifications

FAQ

What file formats work best?

How often is content re-indexed?

Can I preview what chunks were created?

How do I improve retrieval quality?

What happens if I delete a document?

Related