Tesseract OCR

Download & Install Tesseract

Visit the Tesseract at UB Mannheim
Select the tesseract-ocr-w64-setup-v5.3.x.exe (64 bit) file to download the Tesseract executable installer
Once downloaded, open the executable file and follow the installation prompts

Make sure you have installed the tesseract-64bit in C:\Program Files\Tesseract-OCR

Trained Data Files (Languages)

You can download the .traineddata file for the language you need and place it in Tesseract OCR installation directory C:\Program Files\Tesseract-OCR\tessdata\[here] (this should be the same as where the tessdata directory is installed)

tessdata https://github.com/tesseract-ocr/tessdata Speed : Faster than tessdata-best Accuracy : Slightly less accurate than tessdata-best

tessdata-best (Recommended for video games) https://github.com/tesseract-ocr/tessdata_best Speed : Slowest Accuracy : Most accurate

tessdata-fast https://github.com/tesseract-ocr/tessdata_fast Speed : Fastest Accuracy : Least accurate

Page Segmentation Modes

The PSM allows you to select a segmentation method dependent on your particular image and the environment in which it was captured

Page segmentation modes

Orientation and script detection (OSD) only.

Automatic page segmentation with OSD.

Automatic page segmentation, but no OSD, or OCR. (not implemented)

Fully automatic page segmentation, but no OSD. (Default)

Assume a single column of text of variable sizes.

Assume a single uniform block of vertically aligned text.

Assume a single uniform block of text.

Treat the image as a single text line.

Treat the image as a single word.

Treat the image as a single word in a circle.

Treat the image as a single character.

Sparse text. Find as much text as possible in no particular order.

Sparse text with OSD.

Raw line. Treat the image as a single text line, bypassing hacks that are Tesseract-specific.

PreviousGemma 3 4b Vision NextWindows OCR

hashtagDownload & Install Tesseract

hashtagTrained Data Files (Languages)

hashtagPage Segmentation Modes

Download & Install Tesseract

Trained Data Files (Languages)

Page Segmentation Modes