Build software better, together

Official Python client library for Nutrient Document Web Services API - PDF processing, OCR, watermarking, and document manipulation with automatic Office format conversion

python pdf-converter pdf-generation pdf-document-processor ocr-python pdf-processing

Updated Oct 13, 2025
Python

Govind-S-B / pdf-to-text-chroma-search

Star

Python scripts that converts PDF files to text, splits them into chunks, and stores their vector representations using GPT4All embeddings in a Chroma DB. It also provides a script to query the Chroma DB for similarity search based on user input.

text-extraction similarity-search pdf-processing vector-embeddings chromadb

Updated Oct 23, 2023
Python

tetratensor / ML-powered_resume_analyser

Star

Local, privacy-friendly resume analysis: convert, classify, and get advice using TF‑IDF, Logistic Regression, and sentence-transformer embeddings.

python nlp data-science machine-learning text-classification sklearn kaggle-dataset resume-analysis pdf-processing resume-screening sentence-transformers

Updated Sep 24, 2025
Python

ranguy9304 / LangGraphRAG

Star

LangGraphRAG: A terminal-based Retrieval-Augmented Generation system using LangGraph. Features include message history caching, query transformation, and vector database retrieval. Ideal for NLP researchers and developers working on advanced conversational AI and information retrieval systems.

python natural-language-processing information-retrieval chatbot web-scraping nlp-machine-learning rag terminal-application pdf-processing vector-database openai-api langgraph

Updated Jul 13, 2024
Python

Remy2404 / Polymind

Star

A powerful, multi-modal Telegram bot leveraging cutting-edge AI technologies including Gemini, DeepSeek, OpenRouter, and 50+ AI models for comprehensive conversational assistance, media processing, and collaborative features with MCP (Model Context Protocol) integration.

telegram-bot voice image-processing voice-recognition gemini multi-model pdf-processing ai-assistant openrouter mermiad deepseek-r1

Updated Oct 16, 2025
Python

DioCrafts / ai-book-summarizer

Star

📚 AI-Powered Book EPUB Knowledge Extractor & Summarizer Transform your PDF books into structured knowledge effortlessly! This tool leverages AI to analyze books page by page, extracting key insights, definitions, and concepts, and organizes them into Markdown summaries for easier study

python markdown pdf machine-learning natural-language-processing automation ai text-analysis openai text-summarization document-analysis study-materials pymupdf knowledge-extraction pdf-processing book-summary educational-tools pdf-summarization ai-powered-tools

Updated Sep 28, 2025
Python

Inc44 / MaTools

Sponsor

Star

An all-in-one GUI management toolkit built with PyQt6, offering a suite of tools for file synchronization, media organization, PDF merging, code formatting, and more.

python rust productivity application gui qt ocr image-processing video-processing speech-recognition youtube-downloader file-management audio-processing pdf-processing code-formatting

Updated Oct 19, 2025
Python

noorjotk / local-rag-engine

Star

Local RAG app with zero-config Docker setup. FastAPI + Streamlit + Qdrant + Ollama. Just run `docker-compose up --build`! 🚀

python docker semantic-search rag fastapi pdf-processing privacy-focused streamlit vector-database qdrant llm qdrant-vector-database local-llm local-ai ollama local-ollama

Updated Jul 26, 2025
Python

AkshayG999 / MistralOCR---AI-Powered-Document-Extraction

Star

MistralOCR is an open-source application that transforms documents into structured data using Mistral AI's OCR capabilities. Built with FastAPI and Streamlit, it provides an intuitive interface for extracting and processing text from PDFs and images, making document digitization effortless and accurate.

Updated Mar 11, 2025
Python

Alijanloo / Pdf2Table

Star

A Python library for extracting tables from PDF documents using computer vision and image processing techniques. It converts PDF pages to images, detects tables, recognizes their structure, and outputs clean data in JSON format.

python image-processing clean-architecture data-extraction document-analysis table-extraction pdf-parser document-processing pdf-mining table-recognition table-detection pdf-processing grid-detection

Updated Oct 18, 2025
Python

Aleptonic / PdfSnipper

Star

PdfSnipper is a lightweight and efficient Python package designed to simplify the management of PDF files, pages, and their conversions during various NLP, Computer Vision (CV), or other data processing tasks. The package eliminates the need for repetitive code by providing intuitive, ready-to-use functions for common PDF-related operations.

utilities pdf-processing nlp-tools

Updated Feb 3, 2025
Python

arsath-eng / RAG1-NVIDIA-GENAI

Star

A powerful Retrieval Augmented Generation (RAG) application built with NVIDIA AI endpoints and Streamlit. This solution enables intelligent document analysis and question-answering using state-of-the-art language models, featuring multi-PDF processing, FAISS vector store integration, and advanced prompt engineering.

embeddings question-answering document-analysis faiss rag pdf-processing streamlit llm langchain vector-store nvidia-ai-faundry llama-models

Updated Oct 31, 2024
Python

Yardenrsk / PsychometryReceiverCV

Star

A side project to easily get and annotate questions and answers to the PsychometryBot project DB using computer vision and pdf parsing

pandas opencv-python pdf-processing

Updated Sep 18, 2022
Python

thinhuos0913 / python_useful_mini_projects

Star

This is some useful mini projects that I had worked for self-learning Python programming.

python opencv ocr image-processing pdf-processing

Updated May 20, 2024
Python

UjjwalSaini07 / OllamaMulti-RAG

Sponsor

Star

OllamaMulti-RAG 🚀 is a multimodal AI chat app combining Whisper AI for audio, LLaVA for images, and Chroma DB for PDFs, enhanced with Ollama and OpenAI API. 📄 Built for AI enthusiasts, it welcomes contributions—features, bug fixes, or optimizations—to advance practical multimodal AI research and development collaboratively.

openai chat-application trend whisper trending-topics image-understanding rag pdf-processing audio-transcription vector-database ai-chatbot llm langchain ollama knowledge-retrieval llava-llama3 multimodal-ai

Updated Sep 5, 2025
Python

VisionExpo / QA-System-using-Gemini-Pro-API

Star

A powerful Q&A system using Google's Gemini Pro API with vector storage (AstraDB) and LLM monitoring. Supports text, images, PDFs, DOCXs, URLs, and YouTube videos.

nlp flask machine-learning chatbot question-answering text-processing image-analysis document-processing multimodal pdf-processing vector-database sentence-transformers ai-assistant astradb llm generative-ai langsmith gemini-pro-api youtube-processing

Updated Apr 24, 2025
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pdf-processing

Here are 117 public repositories matching this topic...

allenai / papermage

ahmedkhemiri95 / PDFs-TextExtract

postralai / masquerade

aws-samples / document-processing-pipeline-for-regulated-industries

PSPDFKit / nutrient-dws-client-python

Govind-S-B / pdf-to-text-chroma-search

tetratensor / ML-powered_resume_analyser

ranguy9304 / LangGraphRAG

Remy2404 / Polymind

DioCrafts / ai-book-summarizer

Inc44 / MaTools

noorjotk / local-rag-engine

AkshayG999 / MistralOCR---AI-Powered-Document-Extraction

Alijanloo / Pdf2Table

Aleptonic / PdfSnipper

arsath-eng / RAG1-NVIDIA-GENAI

Yardenrsk / PsychometryReceiverCV

thinhuos0913 / python_useful_mini_projects

UjjwalSaini07 / OllamaMulti-RAG

VisionExpo / QA-System-using-Gemini-Pro-API

Improve this page

Add this topic to your repo