Skip to content

Schema Builder

A schema defines the fields you want DocuStack to extract from your documents. Instead of manually reading through pages, you describe the data you need — and DocuStack’s AI handles the rest.

For example, a medical claims schema might include:

FieldTypeDescription
patient_nameTextFull name of the patient
date_of_serviceDateDate the service was provided
diagnosis_codeTextICD-10 diagnosis code
total_billedCurrencyTotal amount billed
provider_nameTextName of the healthcare provider

The schema builder will provide a visual interface for creating and managing extraction schemas:

  1. Create a schema — Give it a name and description, then choose the document type it applies to.
  2. Add fields — Define each field with a name, data type, and optional description to guide the AI.
  3. Set validation rules — Mark fields as required, add format constraints, or define allowed values.
  4. Test against a sample — Upload a sample document and preview extraction results before deploying.
  5. Deploy — Apply the schema to incoming documents for automated extraction.

DocuStack schemas will support the following field types:

TypeExampleUse case
Text"Jane Doe"Names, descriptions, free-form text
Number42Counts, identifiers, numeric values
Currency$1,250.00Financial amounts with currency symbols
Date2025-03-15Dates in various formats
BooleantrueYes/no fields, checkboxes
List["item1", "item2"]Repeated or multi-value fields
  • Be specific in field descriptions — The more context you provide, the more accurate the extraction. Instead of “amount,” use “total amount billed to the insurance company.”
  • Start simple — Begin with a few high-value fields and add more as you validate results.
  • Use sample documents — Always test your schema against representative documents before production use.
  • Version your schemas — As document formats evolve, create new schema versions rather than modifying existing ones.