16.2.2
📅 2024-07-31
New features
Section titled “New features”NuGet packages
Section titled “NuGet packages”Starting with this release, you can download the OCR SDK from the IRIS download center as NuGet packages. These packages are in addition to the platform-specific packages. You can download them and then reference the NuGet packages directly in your projects.
NuGet packages are organized as follows:
Convenience NuGets
Section titled “Convenience NuGets”Five NuGet packages allow you to import all elements for a platform by using a single dependency.
- iDRS.macOS includes all .NET components, native runtimes, and assets that you need to build an application for macOS.
- iDRS.Linux-x64 includes all .NET components, native runtimes, and assets that you need to build an application for Linux and run it on x64 hardware.
- iDRS.Windows-x64 includes all .NET components, .NET Framework, native runtimes, and assets that you need to build an application for Windows and run it on x64 hardware.
- iDRS.Windows-x86 includes all .NET components, .NET Framework, native runtimes, and assets that you need to build an application for Windows and run it on x86 hardware.
- iDRS.Windows includes all .NET components, .NET Framework, native runtimes, and assets that you need to build an application for Windows and run it on x86 or x64 hardware.
These NuGet packages contain only dependencies. They are provided for convenience to simplify integration.
Content NuGets
Section titled “Content NuGets”Six other NuGet packages include the elements required by the convenience NuGet packages:
- iDRS.NET contains iDRS .NET and .NET Framework runtimes and product documentation.
- iDRS.runtime.<Platform-Arch> contains the native runtimes for the specific platform or platform-architecture.
- iDRS.assets contains iDRS resources files.
If you import one or more convenience NuGet packages, you do not need to import content NuGet packages.
NuGet with sample code
Section titled “NuGet with sample code”The last NuGet package includes the source code for the sample iDRS applications (CsReader, CsBarcode, and CsImage2Html). This NuGet contains only the sample source code and does not depend on iDRS. A Readme file in the NuGet gives instructions to use the sample code.
Improvements
Section titled “Improvements”High Quality OCR (HQOCR) for Japanese
Section titled “High Quality OCR (HQOCR) for Japanese”With this new version, our high-quality OCR engine (HQOCR) now supports Japanese. This HQOCR engine completely replaces the previous engine for processing documents in this language.
This switch to our cutting-edge engine will bring significant improvements in performance and accuracy for challenging documents such as low-resolution images, magazines, or newspapers, as well as a slight improvement in accuracy for good- quality scans.
Added/removed resources
Section titled “Added/removed resources”Japanese OCR resources
Section titled “Japanese OCR resources”As a result of the replacement of the Japanese OCR engine, the following resource files have been added or removed:
| Resource file | Status |
|---|---|
| japanese_rnn_engine.bin | New/added |
| net_japanese_rnn_hori.bin | New/added |
| net_japanese_rnn_vert.bin | New/added |
| japanese.bin | Removed |
| japanese_bigrams.bin | Removed |
| japanese_duo.bin | Removed |
| japanese_features.bin | Removed |
| japanese_features_fast.bin | Removed |
| japanese_features_one_candidate.bin | Removed |
| japanese_font_features.bin | Removed |
| japanese_rosetta_same_proba.bin | Removed |
| japanese_same_proba.bin | Removed |
| japanese_similar_characters_differentiation.bin | Removed |
Extended fontmetrics resource
Section titled “Extended fontmetrics resource”The extended font set is the default in this release. It replaces the historical set of three standard Latin fonts. The following resource is removed:
| Resource file | Status |
|---|---|
| fontmetrics_ec.bin | Removed |
Fixed bugs
Section titled “Fixed bugs”| Internal ID | Description | Service desk IDs |
|---|---|---|
| IDRSRD-6391 | iDRS .NET leaks memory when parsing object arrays | |
| IDRSRD-6473 | The iDRS SDK does not output the same number of zones during zonal if no font information is requested | |
| IDRSRD-6509 | iDRS exceptions source should not contain an absolute path | |
| IDRSRD-6858 | The iDRS can print message on console during OCR, when missing resource file | |
| IDRSRD-7491 | Cell’s text alignment is not properly set in iDRS XLSX output for some specific images | |
| IDRSRD-7560 | The iDRS should propose iterators in its Array class | |
| IDRSRD-8293 | The iDRS does not detect the correct orientation for a specific image | |
| IDRSRD-8294 | TheiDRS sometimes recognizes letter ‘O’ instead of Zero next to currency (dollar) sign | ISD-33781 |
| IDRSRD-9163 | The iDRS does not output correct paragraph position on a specific image | |
| IDRSRD-9192 | The iDRS does not use the correct font metrics file for default settings | |
| IDRSRD-9206 | The iDRS crashes while deskewing a very small image | |
| IDRSRD-9233 | The iDRS can keep a lot of memory allocated when processing multiple images with the same CTextRecognition object | |
| IDRSRD-9240 | Language detection ‘allowed’ list should support regional variants | |
| IDRSRD-9243 | iDRS memory increases when setting SetMemoryThreshold in CPageCollectionStorageOptions | |
| IDRSRD-9250 | The iDRS crashes when exporting to Docx with FontInfoComputation set to None | |
| IDRSRD-9267 | The iDRS throws an exception when recognizing a specific image with new page segmentation | ISD-34288 |
| IDRSRD-9271 | OCR alternatives are not provided when new page segmentation is activated | |
| IDRSRD-9275 | The iDRS triggers an exception while processing XLSX output on a specific image | |
| IDRSRD-9281 | The iDRS creates Editable docx with incorrect text direction on a Trad. Chinese image, when using new segmentation | |
| IDRSRD-9295 | iOS SmartCapture sample App does not position properly vertical text on the OCR review screen | |
| IDRSRD-9314 | The iDRS developer guide should include some tips explaining how to output best quality XLSX documents | |
| IDRSRD-9318 | Conversion to Excel single table of a specific file could be improved | |
| IDRSRD-9347 | OCR accuracy is degraded on high resolution (>300 dpi) documents | |
| IDRSRD-9379 | Documentation for SetEmptyFormsDetections needs to be updated | ISD-34648 |
| IDRSRD-9396 | The iDRS leaks small chunks of memory when accessing page sections using .NET API | ISD-34638 |
| IDRSRD-9403 | The iDRS should group text in fewer textboxes when converting documents to docx Editable layout | |
| IDRSRD-9426 | The iDRS crashes when processing the attached PDF with new page segmentation activated | ISD-34948 |
| IDRSRD-9435 | The iDRS crashes when processing a specific image with new page segmentation activated | ISD-34967 |