In OCR, how should I deal with the warped text on the sides of oval objects?

Question

Consider an image that contains one can (or bottle, or any similar oval object), which has texts all over it. In the image below, I have many bottles, but you can assume that each image only contains one such object.

As we can see, in each can, the text can flow from left to right, and any OCR system may miss the text on the left and right sides of the can, as they are not aligned with the camera angle.

So, is there any solution/s for this, like preprocessing in a certain way, so that we can read the text or make this round object into a straight one? (If there is any Python program that can solve this problem, could you please share it with me?)

Comments are not for extended discussion; this conversation has been [moved to chat](https://chat.stackexchange.com/rooms/118228/discussion-on-question-by-red-in-ocr-how-should-i-deal-with-the-warped-text-on). — nbro, Jan 09 '21 at 00:05

Brian O'Donnell · Accepted Answer · 2021-01-07T21:19:39.077

There are many papers on this but the following is a good start:

How to unwrap wine labels programmatically. The author includes source code in Python.

You mentioned you do not want to do a panoramic view but that has more than one meaning. If I assume you mean you do not want to rotate the can while taking multiple photos, or you don't want to take multiple photos from different angles, you could try a pericentric lens. This would require some image processing to do the unwrapping. More resolution is needed as the wrapping is much more severe. The advantage though is that you will have a single image of the full cylindrical surface and won't miss any features or text.

In OCR, how should I deal with the warped text on the sides of oval objects?

1 Answers1