How can I use OCR on a partial screen capture to get text?

Question

When I was still using Windows I loved using the capture2text OCR program to grab Japanese kanji from manga and dump them into jisho.org, and was wondering how I could get the same functionality on Ubuntu. Namely:

Take a partial screenshot upon hitting a designated hotkey (click+drag style).
Process the image through an OCR engine.
Output the result into the clipboard.

score 4 · Answer 1 · edited Apr 13 '17 at 12:23

Based off of this script (the 2nd one) I whittled the script down to this:

#!/bin/bash 
# Dependencies: tesseract-ocr imagemagick gnome-screenshot xclip

SCR="/home/takingitcasual/Documents/Translate/temp"

gnome-screenshot -a -f $SCR.png

mogrify -modulate 100,0 -resize 400% $SCR.png 
#should increase detection rate

tesseract $SCR.png $SCR -psm 10 -l jpn
cat $SCR.txt | xclip -selection clipboard
#you can only scan one character at a time

exit

Some goals I had with the modified file:

Removing the need for sudo (to allow for easy hotkey binding)
Replacing scrot (the way gnome-screenshot works looks much nicer IMO)
Simplify the script to something I can more easily understand (removal of temp files)
Limit recognition to one character at a time. (recognition was abysmal without that "-psm 10", and Tesseract kept throwing "empty page" errors as well)

Another two things I did were:

chmod a+x /home/takingitcasual/Documents/Translate/

and setting the temp files permissions to read/write for all. Not sure if it was redundant to do both.

Last thing was giving the bash command

bash /home/takingitcasual/Documents/Translate/screen_ts.sh

a shortcut via this.

If anybody has suggestions on modifying the script or anything else I'd love to hear it. (I have no previous experience with scripting, so I'm sure it can be improved.)

I have a similar question related to this question. Can you please check it out https://askubuntu.com/questions/1038099/capture2text-alternative-capture-text-from-screen-directly-in-ubuntu-mate — Ahmad Ismail, May 19 '18 at 14:12

How can I use OCR on a partial screen capture to get text?

1 Answers1

Linked