Product Overview: MARS Data Mining Studio (MDMS)
The MARS Data Mining Studio (MDMS) is a component of the MARS platform designed for advanced data extraction, transformation, and structuring, particularly from complex or unstructured sources.
Key Capabilities:
- Data Extraction: Extracts data from a wide array of sources including various file types, print streams (AFP, Line Data, etc.), archives, and database outputs.
- Structuring Unstructured Data: Capable of identifying and extracting specific information from unstructured or semi-structured content (like PDFs, images, text files) and organizing it into a structured format (e.g., XML, PDF, text).
- Rules Engine: Allows users to define rules and transformations for data extraction using methods like regular expressions (RegEx), wildcards, OCR triggers, and built-in controls for various data types (text, date, currency, numbers).
- Automation: Supports saving extraction rules into templates (Helix Config Files) to automate large-scale extraction and the generation of new reports from diverse inputs like documents, scans, and data feeds.
- Viewing & Interaction: Includes an advanced viewer (MDMS Viewer) capable of handling large files (e.g., 200,000+ page PDFs/AFPs) quickly, allowing programmatic search and data lifting via text, OCR, barcodes, or QR codes.
- Accessibility: Available via web interface, Windows desktop application, Linux, and iOS. Supports a REST API (MDMS API) for automation.