Skip to content

A Flask-based web app that uses OCR technology to extract text from uploaded images. Users can upload images, preview them, and download the extracted text as a file.

Notifications You must be signed in to change notification settings

HariPasapuleti/Text-Extractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🖼️ Image Text Extractor

A Flask-powered web application that extracts text from images using Tesseract OCR. The app supports image preprocessing for improved accuracy, an intuitive interface, and the ability to download extracted text.

🌟 Features

  • OCR-Based Text Extraction: Leverages Tesseract OCR for reliable text recognition.
  • Preprocessing for Accuracy: Automatically preprocesses images (grayscale and thresholding).
  • Image Preview: See your uploaded image before processing.
  • Text Export: Download extracted text as a .txt file with a single click.
  • Responsive UI: A mobile-friendly, modern interface.

🚀 Live Demo

You can view a live demo of the app here


🛠️ Tech Stack

  • Backend: Flask, Python
  • Frontend: HTML, CSS, JavaScript
  • Image Processing: OpenCV, Pillow (PIL)
  • OCR: Tesseract OCR

🧑‍💻 Installation Guide

Prerequisites

  1. Python 3.7+
  2. Tesseract OCR:

Steps to Run Locally

  1. Clone the repository:

    git clone https://github.com/HariPasapuleti/Text-Extractor.git cd Text-Extractor
  2. Install dependencies:

    pip install -r requirements.txt
  3. Configure Tesseract path:

    • Windows:
      pytesseract.pytesseract.tesseract_cmd = r"C:\Program Files\Tesseract-OCR\tesseract.exe"
    • Linux/Mac: No changes needed (default path).
  4. Start the Flask server:

    python text_extractor.py

🖥️ Usage

  1. Upload an Image: Accepts .jpg or .png formats.
  2. Preview: Check the image preview before processing.
  3. Extract Text: View extracted text directly on the page.
  4. Download: Save the extracted text as a .txt file.

About

A Flask-based web app that uses OCR technology to extract text from uploaded images. Users can upload images, preview them, and download the extracted text as a file.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •