# AI Models

Curiosity Workspace uses AI models in three common ways:

Understanding: NLP pipelines that extract structure from text (entities, signals, links).
Retrieval: embeddings used for semantic similarity (vector search) and re-ranking.
Generation: LLMs used for synthesis, assistance, and workflow automation.

The important architectural point: AI features are most reliable when they are grounded in your workspace data via graph + search retrieval.

Embedding models map text (and sometimes other modalities) into vectors. In Curiosity Workspace, embeddings are used for:

Design considerations:

choose which fields get embeddings (usually long, descriptive text)
enable chunking when fields exceed model context limits
decide whether vector search is a primary retrieval method or a supplement to text search

NLP pipelines transform raw text into structured outputs, such as:

This enables:

LLMs are typically used to:

Recommended patterns:

retrieval first: fetch relevant nodes/documents before prompting
tooling: move business logic into endpoints/tasks rather than relying on prompts alone
auditability: store inputs/outputs where needed (policy dependent)

Production AI typically needs: