Extract Text from Images & PDFs - .NET Parser API 24.6 (DLLs)

Browse our Products

Are you looking for an evaluation version of a product?

If so you can download any of the below versions for testing. The product will function as normal except for an evaluation limitation. At the time of purchase we provide a license file via email that will allow the product to work in its full capacity. If you would also like an evaluation license to test without any restrictions for 30 days, please follow the directions provided here.

Are you having troubles in downloading?

If you experience errors, when you try to download a file, make sure your network policies (enforced by your company or ISP) allow downloading ZIP and/or MSI files.

GroupDocs.Parser for .NET 24.6 (DLLs only)

Download Support Forum

File Details

Downloads:
1
File Size:
301.04MB
Date Added:
29/6/2024

Release Notes

https://releases.groupdocs.com/parser/net/release-notes/2024/groupdocs-parser-for-net-24-6-release-notes/

Description

This ZIP file contains only the GroupDocs.Parser for .NET 24.6 assemblies. The assemblies are the same as in the MSI installer of the product of the same version. Download this if you want to use GroupDocs.Parser for .NET without the MSI installer. This ZIP download does not contain the demo projects.

File Details

Unlock the power of image-based data extraction with GroupDocs.Parser for .NET 24.6 DLLs-only package. This release lets you extract text from images and PDFs easily on Windows, Linux, and macOS-powered applications.

New Feature: Extract Text from Images and PDFs

This release of the .NET parser API enables extracting text from image files and PDF documents lacking plain text content. OCR technology is utilized by this innovative feature to precisely convert image-based content into modifiable text. Here is how you can extract text from a PDF document in C#:


 // Create an instance of Parser class
using (Parser parser = new Parser("scanned.pdf"))
{
    // Create an instance of TextOptions to use OCR
    TextOptions options = new TextOptions(false, true);
    // Extract a text using OCR
    using(TextReader reader = parser.GetText(options))
    {
        // Print a text or 'not supported' message
        Console.WriteLine(reader == null ? "Text extraction isn't supported" : reader.ReadToEnd());
    }
}

Source*

This code sample illustrates extracting text from images:

// Create an instance of Parser class
using (Parser parser = new Parser("scanned.jpg"))
{
    // Extract a text using OCR
    using(TextReader reader = parser.GetText())
    {
        // Print a text or 'not supported' message
        Console.WriteLine(reader == null ? "Text extraction isn't supported" : reader.ReadToEnd());
    }
}

Source*

Enhanced Platform Compatibility

Please ensure your development environment is based on .NET Core 3.1 or later to effectively use this functionality. Currently, OCR supports the English language only.

Public API Changes

The OcrConnectorBase class was updated with IsTextAreasSupported, IsTextPageSupported, and IsTextSupported properties in the latest .NET API version.

You can view the list of all new features, enhancements, and bug fixes introduced in this release by visiting GroupDocs.Parser for .NET 24.6 Release Notes.

GroupDocs.Parser for .NET 24.5 GroupDocs.Parser for .NET 24.6