OCR Text Extractor

A desktop application that extracts text from images and screenshots using Tesseract OCR.

Features

Screenshot Capture: Capture screenshots directly from the application
Image Loading: Load images from your computer
OCR Processing: Extract text from images using Tesseract OCR
Text Editing: Edit extracted text before saving
Image Adjustments: Adjust contrast, brightness, and sharpness for better OCR results
Multiple Tabs: Work with multiple images simultaneously

Application Interface

The application features a clean, modern interface with multiple tabs for working with different images simultaneously:

Installation

Option 1: Windows Installer (Recommended)

Download the OCR Text Extractor Setup from the releases page
Run the installer and follow the on-screen instructions
The installer will automatically install Tesseract OCR for you

Option 2: Manual Installation

Clone this repository:

git clone https://github.com/kbtale/ocr-to-txt.git
cd ocr-to-txt

Install the required Python packages:
```
pip install -r requirements.txt
```
Or use the provided setup script on Windows:
```
setup.bat
```
Install Tesseract OCR:
- Windows: Download and install from https://github.com/UB-Mannheim/tesseract/wiki
- macOS: brew install tesseract
- Linux: sudo apt install tesseract-ocr
Make sure the Tesseract executable is in your system PATH or specify its location in the application settings.

Quick Start

Run the application:
- If you used the installer, launch from the Start menu or desktop shortcut
- If you installed manually:
```
python src/main.py
```
  Or use the provided batch file on Windows:
```
run.bat
```
Use the "Capture Screenshot" or "Load Image" buttons to get an image into the application
Adjust image processing settings if needed (contrast, brightness, etc.)
The OCR text will appear in the right panel where you can edit it
Use the "Save Text" option from the File menu to save your data

Advanced Options

Page Segmentation Mode (PSM): Controls how Tesseract analyzes the layout of the image
OCR Engine Mode (OEM): Selects which OCR engine Tesseract uses
Font Type: Optimizes processing for different types of fonts
Auto-deskew: Automatically straightens tilted images for better OCR results

Troubleshooting

If you encounter issues with OCR quality:

Ensure your image has sufficient resolution (300 DPI recommended)
Try adjusting contrast and brightness
Experiment with different PSM modes
Make sure Tesseract is properly installed and accessible

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run.bat		run.bat
run.py		run.py
setup.bat		setup.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

OCR Text Extractor

Features

Application Interface

Installation

Option 1: Windows Installer (Recommended)

Option 2: Manual Installation

Quick Start

Advanced Options

Troubleshooting

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

License

kbtale/ocr-to-txt

Folders and files

Latest commit

History

Repository files navigation

OCR Text Extractor

Features

Application Interface

Installation

Option 1: Windows Installer (Recommended)

Option 2: Manual Installation

Quick Start

Advanced Options

Troubleshooting

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages