3

Consider an image that contains one can (or bottle, or any similar oval object), which has texts all over it. In the image below, I have many bottles, but you can assume that each image only contains one such object.

enter image description here

As we can see, in each can, the text can flow from left to right, and any OCR system may miss the text on the left and right sides of the can, as they are not aligned with the camera angle.

So, is there any solution/s for this, like preprocessing in a certain way, so that we can read the text or make this round object into a straight one? (If there is any Python program that can solve this problem, could you please share it with me?)

nbro
  • 39,006
  • 12
  • 98
  • 176
Red
  • 155
  • 5
  • Comments are not for extended discussion; this conversation has been [moved to chat](https://chat.stackexchange.com/rooms/118228/discussion-on-question-by-red-in-ocr-how-should-i-deal-with-the-warped-text-on). – nbro Jan 09 '21 at 00:05

1 Answers1

2

There are many papers on this but the following is a good start:

enter image description here

You mentioned you do not want to do a panoramic view but that has more than one meaning. If I assume you mean you do not want to rotate the can while taking multiple photos, or you don't want to take multiple photos from different angles, you could try a pericentric lens. This would require some image processing to do the unwrapping. More resolution is needed as the wrapping is much more severe. The advantage though is that you will have a single image of the full cylindrical surface and won't miss any features or text.

enter image description here

Brian O'Donnell
  • 1,853
  • 6
  • 20