16.2.6+82
📅 2025-04-18
New features
Section titled “New features”Support for PDF/UA accessibility compliance
Section titled “Support for PDF/UA accessibility compliance”A new property has been added to CPdfOutputParams: CPdfOutputParams.AccessibilityConformance.
When this property is set, the generated PDF will conform to the PDF/UA standard, ensuring enhanced accessibility for assistive technologies.
Improvements
Section titled “Improvements”Improved DOCX Flowing output
Section titled “Improved DOCX Flowing output”Following the recent updates to the DOCX Exact and Editable layouts, the Flowing layout has now received significant enhancements.
All improvements made to the Editable layout since version 16.0 have also been incorporated into the Flowing layout, substantially increasing the quality of the generated DOCX outputs.
Zonal OCR output zone identifier
Section titled “Zonal OCR output zone identifier”To facilitate the matching between input and output zones during zonal OCR, an Identifier has been introduced to the following classes:
-
COcrZoneDescription: upon creation, each instance is assigned a unique identifier. This identifier can be modified to provide a more meaningful name. -
CPageZone: when zonal OCR is performed, the identifier from eachCOcrZoneDescriptionis propagated to the correspondingCPageZoneinstance generated.- For full page OCR, the
CPageZone.Identifierwill remain empty.
- For full page OCR, the
Example of usage:
// Define a zone where we expect an IBAN value COcrZoneDescription objZoneDescription1 = COcrZoneDescription::Create(ZoneType::Text, rc1); objZoneDescription1.SetIdentifier("IBAN"); ....
// Do zonal ocr recognition ....
// Retrieve IBAN result CPageZoneArray::iterator itIban = std::find_if(xZones.begin(), xZones.end(), [](const CPageZone& obj) { return obj.GetZoneType() == ZoneType::Text && obj.GetIdentifier() == "IBAN"; });Added/removed resources
Section titled “Added/removed resources”N/A
Fixed bugs
Section titled “Fixed bugs”| Internal ID | Description | Service desk IDs |
|---|---|---|
| IDRSRD-9880 | The iDRS throws an exception when converting a specific document to DOCX | |
| IDRSRD-9870 | The iDRS throws an exception when creating a DOCX document from a specific Hebrew image | |
| IDRSRD-9835 | The iDRS creates an invalid DOCX output from a specific Arabic document | |
| IDRSRD-9831 | The iDRS generates characters with incorrect stretch factors when processing documents with English and Korean languages | |
| IDRSRD-9825 | The iDRS seems to be hanging indefinitely when recognizing specific images | ISD-36630 |
| IDRSRD-9822 | The iDRS crashes in CTextRecognition.RecognizeText on a specific image | |
| IDRSRD-9820 | The iDRS throws an exception when creating a flowing DOCX in specific case | |
| IDRSRD-9813 | The iDRS can randomly crash when setting multiple thread for in-page processing | |
| IDRSRD-9800 | iDRS 16.2.5+119 consumes an unexpected amount of memory when running OCR | ISD-36544 |
| IDRSRD-9791 | Accuracy on Zonal OCR for Korean language drops by 7% on customer benchmark | |
| IDRSRD-9718 | The iDRS merges 2 clearly separate tables on a given document | |
| IDRSRD-9672 | Email and web links are not clickable in XLSX documents generated by iDRS | |
| IDRSRD-9628 | Language detection feature requires really unexpected resources | |
| IDRSRD-9588 | The supported character set of Japanese HQOCR lacks some expected characters | |
| IDRSRD-9479 | The iDRS generated DOCX documents with incorrect paragraph tabulations | |
| IDRSRD-9359 | The iDRS logic for selection of Word paper size should be improved | |
| IDRSRD-9277 | The iDRS inserts two spaces between words when exporting Arabic documents to docx with flowing mode | |
| IDRSRD-6469 | skew detection failure on grayscale image | |
| IDRSRD-7671 | Table elements not properly positionned in docx output | |
| IDRSRD-5873 | Layout of iDRS docx output with flowing text should be improved | ISD-17602, ISD-33550 |
| IDRSRD-5545 | The iDRS mis-recognizes some text in a specific japanese document | |
| IDRSRD-6962 | The iDRS should support PDF/UA standard | ISD-35915, ISD-34203, ISD-13201, ISD-21814 |
| IDRSRD-4742 | Numeric characters mismatched for Japanese OCR in some specific images |