Multilingual PDF Study Copilot

A web app that summarizes and answers questions about PDF documents in multiple languages. It can condense complex texts such as lecture notes, textbooks, or research papers into concise summaries, providing clear insights in the original language of the document.

As it runs locally, your data is 100% safe

try the app here: Study Copilot

Features

Upload PDFs in any language and get summaries or answers in the same language.
Ask questions about the PDF content and receive accurate responses.
Summarizes complex topics into clear, digestible text.
Powered by LangChain, Ollama LLM, and HuggingFace embeddings.
Multilingual support for global usage.

How It Works

PDF Processing: Splits the uploaded PDF into manageable text chunks.
Embedding Generation: Converts each chunk into vector embeddings using HuggingFace.
Retrieval: Chroma vector store retrieves the most relevant chunks for your query.
LLM Response: Ollama LLM generates concise answers or summaries based solely on the PDF content.

Requirements

Python 3.10+
Virtual environment recommended

Install dependencies:

pip install -r requirements.txt

Usage

Clone or download the project.
Activate a virtual environment:

python -m venv .venv source .venv/bin/activate # macOS/Linux .venv\Scripts\activate # Windows

Install dependencies:

pip install -r requirements.txt

Run the app:

python app.py

Open the Gradio interface in your browser:
- Upload a PDF.
- Enter a question about the content.
- Receive summaries or answers instantly in the PDF's language.

Example

Upload a PDF in French about statistical methods.
Ask: "Quels sont les principaux coefficients de régression?"
Receive a concise answer or summary in French.

Notes

Answers and summaries are strictly based on the uploaded content; the tool does not generate information outside the PDF.
Supports any language recognized by the underlying LLM.

Dependencies

Gradio – Web interface
LangChain – LLM orchestration
LangChain Ollama – LLM backend
HuggingFace Embeddings – Embedding generation
Chroma – Vector store for retrieval
pypdf – PDF parsing

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
Study Copilot.jpg		Study Copilot.jpg
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multilingual PDF Study Copilot

Features

How It Works

Requirements

Usage

Example

Notes

Dependencies

About

Uh oh!

Releases

Packages

Languages

JavieraAlmendrasVilla/Files-summarizer

Folders and files

Latest commit

History

Repository files navigation

Multilingual PDF Study Copilot

Features

How It Works

Requirements

Usage

Example

Notes

Dependencies

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages