Skip to Content
DocsComponentsCorpusPanel

CorpusPanel

Manages your dataset lifecycle: uploading PDFs, converting to text, running GraphRAG indexing, and archiving graph states.

Capabilities

  • Drag-and-drop PDF upload (auto converts to .txt)
  • Real-time indexing via SSE with a progress bar
  • Indexed overview (entities, relationships, communities, text units)
  • Archive current graph; list/restore/rename/delete archives
  • View live logs and retained log history

Status & Registry

  • GET /api/corpus/state merges on-disk files with the persisted output/uploads.json registry
  • After successful indexing, files are marked indexed and timestamps recorded

Index Stream

  • GET /api/corpus/index/stream launches graphrag index --config settings.yaml
  • Also converts remaining PDFs to text and archives originals under input/_pdfs
Last updated on