MDMS Data Extraction Sources
The MARS Data Mining Studio (MDMS) offers extensive connectivity, enabling it to access and extract data from a wide spectrum of enterprise information sources. This broad compatibility is fundamental to its role in consolidating and structuring data from potentially siloed systems.
Major Categories of Supported Sources:
- Enterprise Content Management (ECM) & Document Management Systems (DMS): MDMS possesses capabilities to interface with numerous commercial and legacy platforms. Examples include various generations of IBM FileNet (Panagon, P8, IS, CM8), IBM Content Manager OnDemand (CMOD), Hyland OnBase, the OpenText portfolio (Content Suite, Documentum, Vista Plus, Hummingbird, Alchemy), Oracle's content platforms (Optika, Stellant, UCM), Microsoft SharePoint, ASG Mobius, CA View/Deliver, BMC Control-D, Laserfiche, Newgen Software products, OIT Docfinity, Systemware X/PTR, Alfresco, among others. Integration often utilizes native methods, potentially bypassing standard vendor APIs.
- Print Streams & Report Archives: MDMS directly ingests and parses common enterprise report and print output formats, such as IBM AFP, Xerox DJDE/Metacode/LCDS, HP PCL, Adobe PostScript, and various types of Line Data (including EBCDIC and ASCII formats).
- Databases: It can process structured data derived from database query outputs or potentially establish connections to extract information directly from relational databases like Oracle DB, IBM DB2, Microsoft SQL Server, MySQL, and Informix.
- File Systems & Shares: MDMS accesses and processes files residing on standard network file systems (using network mounts, UNC paths, etc.) or local directories. This covers a vast range of file types including standard office documents (Word, Excel, PowerPoint), PDF files, various image formats (TIFF, JPEG, PNG), plain text files, XML, HTML, and many more.
- Scanned Document Input: MDMS integrates with the Helix MARS Scanner component, allowing it to directly process batches of digitized physical documents that originate from scanners, fax devices, or multi-function printers (MFPs).
- Custom & Other Systems: The platform's architecture often allows for configuration or development of connectors to interface with custom-built applications or less common data repositories encountered in specific enterprise environments.
This extensive source compatibility enables MDMS to serve as a powerful, centralized engine for gathering and processing information regardless of its original location or format within an organization's IT infrastructure.