Optical Character Recognition (OCR)

Optical Character Recognition (OCR) allows you to detect and recognize printed characters in images and convert the characters into editable text in JSON format.

OCR provides services through open application programming interfaces (APIs). You can use programming languages such as Python and Java to call OCR APIs to recognize images as text, helping you automatically collect key data and build an intelligent service system to improve service efficiency. For details about how to obtain APIs, visit the API Reference.

OCR provides APIs for you to convert characters in images or scanned copies into editable text and returns the recognition result in JSON format. You can encode the recognition result and save it to a service system, or save it in TXT or Excel format.

Person uses various applications on the laptop

Reasons for OCR in the Open Telekom Cloud

Icon composition: Binary code and documents

Digitalizing paper documents

Automatically detects and extracts text, signatures, and seals from document images and converts them into structured data for faster review.

Icon composition: identification papers and binoculars

Wide scope

Identify key information from pictures of identification papers like ID cards, driver's licenses, passports or similar documents.

Icon composition: Server and touchscreen icon

Easy to use

Start using OCR quickly and with minimal effort with our standard RESTful APIs and high compatibility.

Key Features of Optical Character Recognition

Person looks at various documents on the laptop

Enhanced capabilities

Make the most of advanced features to effortlessly extract characters from challenging documents such as those with distorted or tilted backgrounds, seals, and interlaced elements or forms.

High accuracy

Implemented deep learning techniques tailored to each service scenario to achieve high-accuracy character recognition.

Constraints and Limitations

There are various factors, such as technology and cost, that limit the performance of OCR services. The system-level constraints are the most significant limitations that affect all sub-services. In addition to these system-level constraints, each sub-service also has its own independent limitations.

Only images in PNG, JPG, JPEG, BMP, or TIFF format can be recognized.
No side of the image can be smaller than 15 or larger than 8,192 pixels.
The area to be recognized must occupy more than 80% of the image. When scanning a table, ensure that the entire table and its surrounding area are included in the image.
An image can be rotated to any angle.
Text in images with complex backgrounds (such as outdoor scenery or anti-counterfeit watermarks) or distorted table lines cannot be recognized.
Supported languages: Chinese, English, Malay, Ukrainian, Hindi, Russian, Vietnamese, Indonesian, Thai, Arabic, German, Latin, French, Italian, Spanish, Portuguese, Romanian, Polish, Amharic, Japanese, Korean, Turkish, Norwegian, Danish, and Swedish.
Support for traditional Chinese characters is limited.

Related Services

Identity and Access Management

Identity and Access Management (IAM) lets you control user authentication and access to OCR.

Object Storage Service

Object Storage Service (OBS) is a stable, secure, efficient, and easy-to-use cloud storage service. OCR APIs involve processing user data, which can be efficiently handled in batches using OBS.

OCR APIs allow for data retrieval and processing from OBS through temporary or anonymous public authorization.

New Features

04/02/2024New Optical Character Recognition (OCR) Service is now available in EU-DE region.View Details

07/01/2025New „Smart Document Recognizer” feature in OCRView Details

Don't want to miss any updates?Visit our portfolio roadmap and discover new services and updates.
Learn more

view all release notes

Find out more

Pricing overview

Price calculator
Pricing Models: Database & Analysis
Service description incl. price list (PDF)

Documentation

Opticle Character Recognition
Ask & exchange
Best practices & Blueprint

Book now and claim starting credit of EUR 250* (code: 4UOTC250)

Book now

Take advantage of our consulting services!
Our experts will be happy to help you.

We will answer any questions you have regarding testing, booking and usage – free and tailored to your needs. Try it out today!

Hotline: 24 hours a day, seven days a week

0800 3304477from Germany

+800 33044770from abroad

Write an E-mail

* Voucher can be redeemed until December 31, 2025. Please contact us when using the voucher for booking. The discount is only valid for customers with a billing address in Germany and expires two months after conclusion of the contract. The credit is deducted according to the valid list prices as per the service description. Payment of the credit in cash is excluded.