Open Telekom Cloud for Business Customers

Optical Character Recognition (OCR)

Optical Character Recognition (OCR) allows you to detect and recognize printed characters in images and convert the characters into editable text in JSON format.

OCR provides services through open application programming interfaces (APIs). You can use programming languages such as Python and Java to call OCR APIs to recognize images as text, helping you automatically collect key data and build an intelligent service system to improve service efficiency. For details about how to obtain APIs, visit the API Reference.

OCR provides APIs for you to convert characters in images or scanned copies into editable text and returns the recognition result in JSON format. You can encode the recognition result and save it to a service system, or save it in TXT or Excel format.

Person uses various applications on the laptop

Reasons for OCR in the Open Telekom Cloud

Icon composition: Binary code and documents

Digitalizing paper documents

Automatically detects and extracts text, signatures, and seals from document images and converts them into structured data for faster review.

Icon composition: identification papers and binoculars

Wide scope

Identify key information from pictures of identification papers like ID cards, driver's licenses, passports or similar documents.

Icon composition: Server and touchscreen icon

Easy to use

Start using OCR quickly and with minimal effort with our standard RESTful APIs and high compatibility.


Key Features of Optical Character Recognition

Person looks at various documents on the laptop

Enhanced capabilities

Make the most of advanced features to effortlessly extract characters from challenging documents such as those with distorted or tilted backgrounds, seals, and interlaced elements or forms.

 
Icon checklist


High accuracy

Implemented deep learning techniques tailored to each service scenario to achieve high-accuracy character recognition.

 

Constraints and Limitations

There are various factors, such as technology and cost, that limit the performance of OCR services. The system-level constraints are the most significant limitations that affect all sub-services. In addition to these system-level constraints, each sub-service also has its own independent limitations.

  • Only images in PNG, JPG, JPEG, BMP, or TIFF format can be recognized.
  • No side of the image can be smaller than 15 or larger than 8,192 pixels.
  • The area to be recognized must occupy more than 80% of the image. When scanning a table, ensure that the entire table and its surrounding area are included in the image.
  • An image can be rotated to any angle.
  • Text in images with complex backgrounds (such as outdoor scenery or anti-counterfeit watermarks) or distorted table lines cannot be recognized.
  • Supported languages: Chinese, English, Malay, Ukrainian, Hindi, Russian, Vietnamese, Indonesian, Thai, Arabic, German, Latin, French, Italian, Spanish, Portuguese, Romanian, Polish, Amharic, Japanese, Korean, Turkish, Norwegian, Danish, and Swedish. 
    Support for traditional Chinese characters is limited.
 

Related Services

Identity and Access Management

Object Storage Service

 

Find out more

 
 

Book now and claim starting credit of EUR 250* (code: 4UOTC250)

 
Take advantage of our consulting services!
Our experts will be happy to help you.
We will answer any questions you have regarding testing, booking and usage – free and tailored to your needs. Try it out today!

Hotline: 24 hours a day, seven days a week 
0800 3304477from Germany
+800 33044770from abroad

* Voucher can be redeemed until December 31, 2024. Please contact us when using the voucher for booking. The discount is only valid for customers with a billing address in Germany and expires two months after conclusion of the contract. The credit is deducted according to the valid list prices as per the service description. Payment of the credit in cash is excluded.

 
  • Communities

    The Open Telekom Cloud Community

    This is where users, developers and product owners meet to help each other, share knowledge and discuss.

    Discover now

  • Telefon

    Free expert hotline

    Our certified cloud experts provide you with personal service free of charge.

     0800 3304477 (from Germany)

     
    +800 33044770 (from abroad)

     
    24 hours a day, seven days a week

  • E-Mail

    Our customer service is available free of charge via E-Mail

    Write an E-Mail