16.2.6+82

📅 2025-04-18

`New features`

Support for PDF/UA accessibility compliance

A new property has been added to CPdfOutputParams: CPdfOutputParams.AccessibilityConformance.

When this property is set, the generated PDF will conform to the PDF/UA standard, ensuring enhanced accessibility for assistive technologies.

`Improvements`

Improved DOCX Flowing output

Following the recent updates to the DOCX Exact and Editable layouts, the Flowing layout has now received significant enhancements.

All improvements made to the Editable layout since version 16.0 have also been incorporated into the Flowing layout, substantially increasing the quality of the generated DOCX outputs.

Zonal OCR output zone identifier

To facilitate the matching between input and output zones during zonal OCR, an Identifier has been introduced to the following classes:

COcrZoneDescription: upon creation, each instance is assigned a unique identifier. This identifier can be modified to provide a more meaningful name.
CPageZone: when zonal OCR is performed, the identifier from each COcrZoneDescription is propagated to the corresponding CPageZone instance generated.
- For full page OCR, the CPageZone.Identifier will remain empty.

Example of usage:

  // Define a zone where we expect an IBAN value
  COcrZoneDescription objZoneDescription1 = COcrZoneDescription::Create(ZoneType::Text, rc1);
  objZoneDescription1.SetIdentifier("IBAN");
  ....

  // Do zonal ocr recognition
  ....

  // Retrieve IBAN result
  CPageZoneArray::iterator itIban = std::find_if(xZones.begin(), xZones.end(), [](const CPageZone& obj) { return obj.GetZoneType() == ZoneType::Text && obj.GetIdentifier() == "IBAN"; });

`Added/removed resources`

N/A

`Fixed bugs`

Internal ID	Description	Service desk IDs
IDRSRD-9880	The iDRS throws an exception when converting a specific document to DOCX
IDRSRD-9870	The iDRS throws an exception when creating a DOCX document from a specific Hebrew image
IDRSRD-9835	The iDRS creates an invalid DOCX output from a specific Arabic document
IDRSRD-9831	The iDRS generates characters with incorrect stretch factors when processing documents with English and Korean languages
IDRSRD-9825	The iDRS seems to be hanging indefinitely when recognizing specific images	ISD-36630
IDRSRD-9822	The iDRS crashes in CTextRecognition.RecognizeText on a specific image
IDRSRD-9820	The iDRS throws an exception when creating a flowing DOCX in specific case
IDRSRD-9813	The iDRS can randomly crash when setting multiple thread for in-page processing
IDRSRD-9800	iDRS 16.2.5+119 consumes an unexpected amount of memory when running OCR	ISD-36544
IDRSRD-9791	Accuracy on Zonal OCR for Korean language drops by 7% on customer benchmark
IDRSRD-9718	The iDRS merges 2 clearly separate tables on a given document
IDRSRD-9672	Email and web links are not clickable in XLSX documents generated by iDRS
IDRSRD-9628	Language detection feature requires really unexpected resources
IDRSRD-9588	The supported character set of Japanese HQOCR lacks some expected characters
IDRSRD-9479	The iDRS generated DOCX documents with incorrect paragraph tabulations
IDRSRD-9359	The iDRS logic for selection of Word paper size should be improved
IDRSRD-9277	The iDRS inserts two spaces between words when exporting Arabic documents to docx with flowing mode
IDRSRD-6469	skew detection failure on grayscale image
IDRSRD-7671	Table elements not properly positionned in docx output
IDRSRD-5873	Layout of iDRS docx output with flowing text should be improved	ISD-17602, ISD-33550
IDRSRD-5545	The iDRS mis-recognizes some text in a specific japanese document
IDRSRD-6962	The iDRS should support PDF/UA standard	ISD-35915, ISD-34203, ISD-13201, ISD-21814
IDRSRD-4742	Numeric characters mismatched for Japanese OCR in some specific images