A desktop application that extracts text from images and screenshots using Tesseract OCR.
- Screenshot Capture: Capture screenshots directly from the application
- Image Loading: Load images from your computer
- OCR Processing: Extract text from images using Tesseract OCR
- Text Editing: Edit extracted text before saving
- Image Adjustments: Adjust contrast, brightness, and sharpness for better OCR results
- Multiple Tabs: Work with multiple images simultaneously
The application features a clean, modern interface with multiple tabs for working with different images simultaneously:
- Download the OCR Text Extractor Setup from the releases page
- Run the installer and follow the on-screen instructions
- The installer will automatically install Tesseract OCR for you
-
Clone this repository:
git clone https://github.com/kbtale/ocr-to-txt.git cd ocr-to-txt -
Install the required Python packages:
pip install -r requirements.txtOr use the provided setup script on Windows:
setup.bat -
Install Tesseract OCR:
- Windows: Download and install from https://github.com/UB-Mannheim/tesseract/wiki
- macOS:
brew install tesseract - Linux:
sudo apt install tesseract-ocr
-
Make sure the Tesseract executable is in your system PATH or specify its location in the application settings.
-
Run the application:
-
If you used the installer, launch from the Start menu or desktop shortcut
-
If you installed manually:
python src/main.pyOr use the provided batch file on Windows:
run.bat
-
-
Use the "Capture Screenshot" or "Load Image" buttons to get an image into the application
-
Adjust image processing settings if needed (contrast, brightness, etc.)
-
The OCR text will appear in the right panel where you can edit it
-
Use the "Save Text" option from the File menu to save your data
- Page Segmentation Mode (PSM): Controls how Tesseract analyzes the layout of the image
- OCR Engine Mode (OEM): Selects which OCR engine Tesseract uses
- Font Type: Optimizes processing for different types of fonts
- Auto-deskew: Automatically straightens tilted images for better OCR results
If you encounter issues with OCR quality:
- Ensure your image has sufficient resolution (300 DPI recommended)
- Try adjusting contrast and brightness
- Experiment with different PSM modes
- Make sure Tesseract is properly installed and accessible
This project is licensed under the MIT License - see the LICENSE file for details.

