Skip to content

MARS Scanner Export Formats

After the Helix MARS Scanner application completes its processing workflow (separation, OCR, classification, extraction, optional verification), it exports the captured data and associated document metadata in structured formats suitable for ingestion into downstream systems or databases.

Common Export Formats:

  • CSV (Comma-Separated Values): A widely compatible text-based format where data fields are separated by commas (or other delimiters). Each row typically represents a document or a transactional record, with columns corresponding to the extracted data fields and metadata. CSV files are easily imported into spreadsheets (like Microsoft Excel) and databases.
  • XML (Extensible Markup Language): A flexible, tag-based format that allows for hierarchical data structuring. Exported XML files typically adhere to a predefined schema, organizing the extracted data fields, metadata, and potentially document relationships in a machine-readable structure suitable for system-to-system integration.

The choice of export format often depends on the requirements of the target system or application that will consume the extracted data. Configuration options usually allow customization of the specific fields included in the export file.