VNTranslator
Become a PatronJoin Our Discord
  • Overview
  • GETTING STARTED
    • Getting Started
    • System Requirements
    • Download & Installation
    • Interface Basics
  • Features
    • Launcher
    • Modules
    • Translation
      • Translation Settings
      • Translation Glossary
      • Extra Options
        • Pre-translation
        • Post-translation
        • Variables
      • Transcheck
      • Advanced Settings
      • Translation Editor
      • Export Translation
      • Import Translation
    • MT Engines
      • MT Settings
      • DeepL API
      • OpenAI Translate (Legacy)
    • Extra Window
    • Hyper Overlay
    • Extensions
      • JParser
      • Jisho
      • Fast OCR
    • Hotkeys
  • User Guide
    • Clipboard
    • OCR
      • Features
        • OCR Screen
        • OCR Master
        • Pre-processing
        • OCR Engines
          • Tesseract OCR
          • Windows OCR
          • Google Cloud Vision
          • Azure Cloud Vision
          • Google Lens
          • Custom - Command Line
          • Custom - HTTP POST
        • Post-processing
      • OCR Engine Installer
      • Understanding OCR and Improving Accuracy
      • OCR Engines Comparison
    • OCR GX 🎮
    • AutoTrans
      • Translation Modes
      • Font Replacement
      • Feature Settings
        • Translation
        • Mods
      • Extract & Translate
      • Steam Connect
      • RTL
      • FAQ
        • How Can I Improve Game Speed in AutoTrans?
        • How to Change Font Type in RenPy?
        • How to Change Font Size in Unity?
    • RenPy Games
    • Tyrano Builder
    • TextractorCLI
  • ADVANCED
    • LLMs
      • System Prompt
      • OpenAI API
      • GeminiAI API
    • Custom MT
      • Schema
      • Form Builder
      • Request & Response
      • Components
      • webLLM
      • MT Kit
      • V1 & V2 (Archive)
    • OCR Server Kit
      • EasyOCR
      • SuryaOCR
    • API Gateway
      • Translate
      • Translation Memory 🚧
    • RegExp
      • Matching
      • Replacement
  • Help
    • FAQ
    • Troubleshooting
      • Launcher
        • VNTranslator appears as a black box
      • Network Connection
      • Machine Translation (MT)
        • Web Scraping Timeout
        • API Error Codes
        • Clear Cookies & Site Data
      • OCR
      • AutoTrans
      • TextractorCLI
    • Glossary
    • Archives
      • Comparison of OCR Version 1.0 and 2.0
      • Workflow Explanation for OCR
      • OCR 1.0 (Archive)
        • Tesseract OCR
        • Screen Capture
        • Post-Capture Actions
    • Credits
Powered by GitBook
On this page
  • How OCR Works in VNTranslator OCR
  • 1. Screen Capture
  • 2. Pre-processing (Image Processing)
  • 3. Selecting the OCR Engine
  • 4. Post-processing
  • Tips for Improving OCR Accuracy in VNTranslator
  1. User Guide
  2. OCR

Understanding OCR and Improving Accuracy

This guide explains how OCR works in VNTranslator and provides practical tips to improve text recognition accuracy

PreviousOCR Engine InstallerNextOCR Engines Comparison

Last updated 5 months ago

How OCR Works in VNTranslator OCR

1. Screen Capture

The first step in the OCR process is capturing an image from the screen. The quality of the captured image significantly impacts the OCR engine to recognize text accurately.

2. Pre-processing (Image Processing)

This step is crucial for improving image quality. During pre-processing, the image is adjusted to display black text on a white background. This contrast makes it easier for the OCR engine to recognize the text.

3. Selecting the OCR Engine

Text recognition accuracy also depends on the chosen OCR engine. For example:

  • Traditional OCR Engines like Tesseract OCR and Windows OCR are optimized for recognizing black text on a white background. These engines work well in simple situations but may have trouble with colored text or complicated backgrounds. However, Windows OCR is slightly better than Tesseract OCR for recognizing colored text.

  • Modern OCR Engines like Google Cloud Vision or Azure Cloud Vision are more advanced and can recognize text with colored text or complex backgrounds.

4. Post-processing

After the OCR engine processes the text recognition, the result will be displayed. If recognition is inaccurate, corrections can be made during post-processing using Regular Expressions (RegExp) to refine the results.


Tips for Improving OCR Accuracy in VNTranslator

  • Ensure High-Quality Image Captures: The better the quality of the screen capture, the higher the accuracy of OCR. Avoid blurry or low-resolution images.

  • Use Effective Pre-processing (For Tesseract OCR and WIndows OCR): Adjust the image to have high contrast (black text on a white background) to make text recognition easier for the OCR engine.

  • Select the Appropriate OCR Engine: Choose an OCR engine that suits the characteristics of your text and background. Use modern OCR engines for colored text and complex backgrounds.

  • Utilize Post-processing: If text recognition is incorrect or you want to remove and match specific characters, you can use RegExp during post-processing to refine the output.