Extracta.ai is an AI-powered platform that automates data extraction from various document types, including PDFs, images, and text files, without requiring prior training.
Creates a new document classification configuration. Define a list of possible document types with their characteristics (name, description, unique words). Returns a classification ID that can be used to upload documents for automatic type prediction. This is the first step before uploading documents for classification.
Creates a new extraction configuration for processing documents. Define what fields to extract (e.g., names, dates, amounts) and processing options. Returns an extraction ID that can be used to upload and process files. This is the first step before uploading documents for extraction.
Permanently deletes an entire document classification process including all associated batches, results, and uploaded files. Use this when you want to remove a classification that is no longer needed. WARNING: This action cannot be undone.
Permanently deletes an extraction job and its configuration from the system. Use this when you want to remove an extraction job that is no longer needed. This action is idempotent - calling it multiple times with the same ID will not cause errors. Requires the extraction ID obtained from creating or viewing an extraction.
Retrieves extraction results for a specific batch of documents. Returns the extracted data for each file in the batch, along with processing status and file information. If the batch is still processing, results may be empty or incomplete. Maintain 2-second intervals between consecutive requests to avoid rate-limiting.
Retrieves the current credit balance available on the account. The system operates on a per-page consumption model where 1 credit = 1 page of document processing. Use this action to check remaining credits before processing documents.