Build software better, together

AbhishekMudaraddi / Google-Vision-OCR

Extract text from PDFs using Google Vision API. This script converts PDF pages to images, preprocesses them for OCR accuracy, and uses Google Vision API for text extraction. It supports parallel processing for efficiency and saves extracted text in a structured format for each PDF.

ocr vision google-vision-api google-cloud-vision googlevisionapi ocr-python text-extraction-from-image

Updated Sep 19, 2024
Python

deepeshdm / Textract-Web-App

Star

A simple web interface which extracts texts from Images & PDF files using AWS Textract service

aws natural-language-processing aws-s3 text-recognition aws-textract text-extraction-from-image text-recognition-from-image

Updated May 5, 2024
Python

kanchan2803 / ImgToText

Star

This repository contains code for a simple application to detect text from images using Python, & optical character Recognition(OCR), and Streamlit for creating a user-friendly web application. The application allows users to upload images or capture them via camera input and extracts text present

computer-vision text-extraction tesseract-ocr streamlit-webapp text-extraction-from-image

Updated Jun 3, 2024
Python

ricochetservice / Gemma3_OCR_Text_Extractor_LLM

Star

Gemma-3 OCR exemplifies the confluence of abstruse computer vision and arcane NLP, leveraging Gemma-3 Vision’s neural framework for precise OCR and semantically refined text curation. Powered by Streamlit and Ollama, this hermetic system converts visual data into perspicuous, markdown-rendered output, ensuring maximal accuracy and confidentiality.

ocr base64 deep-learning image-processing transformers pillow text-extraction ocr-recognition streamlit text-extraction-from-image llm vision-language-model ollama gemma3

Updated Oct 23, 2025
Python

udit-asopa / vision-text-extractor

Star

Extract text from images using multiple AI providers - local SmolVLM, Ollama LLaVA, or OpenAI GPT-4o

python ocr computer-vision text-extraction openai pixi opencv-python cli-tool ocr-python huggingface huggingface-models gpt-4 text-extraction-from-image vision-ai langchain llava langchain-python ollama

Updated Oct 16, 2025
Python

Gauff / TextProcessing

Star

Text extraction, transcription, punctuation restoration, translation, summarization and text to speech from almost any file type

python cli text-to-speech translator ocr transcoding text-extraction text-processing transcription summarizer file-downloader punctuation-restoration text-extraction-from-image llm

Updated Nov 5, 2024
Python

SD7Campeon / Gemma3_OCR_Text_Extractor_LLM

Star

Gemma-3 OCR exemplifies the confluence of abstruse computer vision and arcane NLP, leveraging Gemma-3 Vision’s neural framework for precise OCR and semantically refined text curation. Powered by Streamlit and Ollama, this hermetic system converts visual data into perspicuous, markdown-rendered output, ensuring maximal accuracy and confidentiality.