tesseract – Page 2 – Tarik Billa

What is the ideal font for OCR?

September 11, 2023 by Tarik

After trying a lot of different fonts and OCR engines I tend to get the best results using Consolas. It is a monospaced typeface like OCR-A, but easier to read for humans. Consolas is included in several Microsoft products. There is also an open source font Inconsolata, which is influenced by Consolas. Inconsolata is a … Read more

Preprocessing image for Tesseract OCR with OpenCV

September 5, 2023 by Tarik

I described some tips for preparing images for Tesseract here: Using tesseract to recognize license plates In your example, there are several things going on… You need to get the text to be black and the rest of the image white (not the reverse). That’s what character recognition is tuned on. Grayscale is ok, as … Read more

How do I segment a document using Tesseract then output the resulting bounding boxes and labels

August 24, 2023 by Tarik

Success. Many thanks to the people at the Pattern Recognition and Image Analysis Research Lab (PRImA) for producing tools to handle this. You can obtain them freely on their website or github. Below I give the full solution for a Mac running 10.10 and using the homebrew package manager. I use wine to run windows … Read more

Using tesseract to recognize license plates

August 13, 2023 by Tarik

Two things will fix this completely: Remove everything which is not text from the image. You need to use some CV to find the plate area (for example by color, etc) and then mask out all of the background. You want the input to tesseract to be black and white, where text is black and … Read more

Where are the Tesseract API docs?

August 9, 2023 by Tarik

The latest documentation is now available here and here.

Tesseract and tiff format – spp not in set {1,3}

July 31, 2023 by Tarik

It probably means your TIFF image has an alpha channel and therefore the underlying Leptonica library used by Tesseract doesn’t support it. If you’re using Imagemagick then be aware that operations such as -draw can cause alpha channels to be added. If you’re using convert in your workflow and want to remove the channel again … Read more

Extracting code from photograph of T-shirt via OCR

July 30, 2023 by Tarik

You can probably type faster than you can clean up images and install OCR engines: #!/usr/bin/perl (my$d=q[AA GTCAGTTCCT CGCTATGTA ACACACACCA TTTGTGAGT ATGTAACATA CTCGCTGGC TATGTCAGAC AGATTGATC GATCGATAGA ATGATAGATC GAACGAGTGA TAGATAGAGT GATAGATAGA GAGAGA GATAGAACGA TC GATAGAGAGA TAGATAGACA G ATCGAGAGAC AGATA GAACGACAGA TAGATAGAT TGAGTGATAG ACTGAGAGAT AGATAGATTG ATAGATAGAT AGATAGATAG ACTGATAGAT AGAGTGATAG ATAGAATGAG AGATAGACAG ACAGACAGAT AGATAGACAG AGAGACAGAT TGATAGATAG ATAGATAGAT TGATAGATAG … Read more

Set Tesseract font for OCR

June 4, 2023 by Tarik

Until now this option is not available. The current version is Tesseract 5.

Using Tesseract for handwriting recognition

June 3, 2023 by Tarik

In short, you would have to train the Tesseract engine to recognize the handwriting. Take a look at this link: Tesseract handwriting with dictionary training This is what the linked post says: It’s possible to train tesseract to recognize handwriting. Here are the instructions: https://tesseract-ocr.github.io/tessdoc/Training-Tesseract But don’t expect very good results. Academics have typically gotten … Read more

Getting the bounding box of the recognized words using python-tesseract

March 31, 2023 by Tarik

Use pytesseract.image_to_data() import pytesseract from pytesseract import Output import cv2 img = cv2.imread(‘image.jpg’) d = pytesseract.image_to_data(img, output_type=Output.DICT) n_boxes = len(d[‘level’]) for i in range(n_boxes): (x, y, w, h) = (d[‘left’][i], d[‘top’][i], d[‘width’][i], d[‘height’][i]) cv2.rectangle(img, (x, y), (x + w, y + h), (0, 255, 0), 2) cv2.imshow(‘img’, img) cv2.waitKey(0) Among the data returned by pytesseract.image_to_data(): … Read more