Skip to content

16.3.0

📅 2025-10-17

A new module, Module.Extraction, is now available on the Windows-x64 and Windows-x86 platforms.

This module includes algorithms and technologies designed to extract additional information from document content. It uses OCR results combined with user-defined rules and instructions.

The first feature in this module is a fuzzy search algorithm. This algorithm analyzes OCR results—including alternative interpretations—to detect specific patterns such as dates, amounts, or any user-defined regular expression. You can access this feature through the new class CDataExtraction.

Code Samples

New sample projects have been added to the package to demonstrate how to use the fuzzy search feature:

  • C++ sample: located in samples/cpp/Extraction
  • C# sample: located in samples/cs/Extraction

These samples include source code and usage examples to help you integrate the feature quickly.

N/A

N/A

Internal IDDescriptionService desk IDs
IDRSRD-10052The iDRS doesn’t take work image into account when calling CPageAnalysis.AnalyzePage()
IDRSRD-10042The iDRS objects containing idrs_string are not thread-safe
IDRSRD-10041The iDRS .NET callback IProgressPageProcessing.OnPercentageUpdate() is never calledISD-37374
IDRSRD-10031The iDRS throws an exception when recognizing a specific imageISD-37314
IDRSRD-9964The iDRS creates DOCX Editable with text hidden by a table
IDRSRD-9846The iDRS does not output correctly checkmark characters in PDF documents
IDRSRD-9716The iDRS doesn’t load text from specific PDFsISD-36000