# Scheduled Tasks

# Scheduled Tasks

Scheduled tasks run work periodically inside a Curiosity Workspace environment. They are commonly used for:

  • periodic ingestion (hourly/daily sync)
  • reindexing and maintenance operations
  • enrichment jobs (NLP reparse, entity linking, batch similarity)
  • reporting and aggregation

# What makes a good scheduled task

  • Idempotent: reruns are safe.
  • Bounded: does not process unbounded amounts of data without pagination.
  • Observable: emits logs/metrics and failure reasons.
  • Permission-aware: respects access control where needed.

# Task categories

  • Ingestion tasks
    • call connectors or integration logic on a schedule
  • Maintenance tasks
    • rebuild indexes, backfills, schema migrations (with care)
  • Enrichment tasks
    • reparse fields with updated NLP pipelines
    • compute derived relationships
  • Analytics tasks
    • compute dashboards and cached aggregates

# Next steps