Prerequisites:
- Authentication configured (see Authentication guide)
Upload a file
Send a multipart form-data request with your file. The server starts extraction automatically and returns the source metadata.Endpoint: Response:
POST /api/sources/uploadCheck extraction status
Poll the source until extraction_status reaches a terminal state: extracted (success), attention_required (partial — some pages failed but the source is still indexable), failed, or cancelled. Typically takes a few seconds for small documents.Endpoint:
GET /api/sources/{id}View extracted content
Retrieve the extracted text.
GET /page-texts returns { "page_texts": [string, ...], "count": N } — page_texts is an array of strings, one per page, in order. To fetch a single page, pass ?page=N, which returns { "text": string, "page": N, "count": N }.Endpoint: GET /api/sources/{id}/page-textsWhat’s Next
Create a Knowledge Base
Index your extracted content for semantic search.
Sources & Extraction
Understand the extraction pipeline in depth.
Sources API Reference
Full endpoint documentation.