PDFmdx recognize documents, split and extract Metadata
PDFmdx is an application to recognize PDF documents based on defined conditions and contents, classifythem as well as spli document packages to single documents.
Out of the classified and splitted documents you can extract content, based on defined templates, and write them to an index file for further processing and use.
With this way existing PDF documents or created (eg. PDF Printer) PDF documents or scanned documents plus OCR (Image in foreground and Text in background, can be converted and processed.
Beside recognition, split and extraction of contents PDFmdx also offers a range of post processing features for autmatic processes.
