Repo Serializer

A Python utility for serializing local Git repositories into a structured text file, capturing the directory structure (in ASCII format), file names, and contents of source files. Ideal for providing a comprehensive snapshot of a repository for code review, documentation, or interaction with large language models (LLMs).

Installation

# Install from PyPI pip install repo-serializer # For development pip install "repo-serializer[dev]"

Usage

Command Line

# Basic usage repo-serializer /path/to/repository # Specify output file repo-serializer /path/to/repository -o output.txt # Copy to clipboard in addition to saving to file repo-serializer /path/to/repository -c # Use structure-only mode to output only the directory structure and filenames repo-serializer /path/to/repository -s # Skip specific directories (can be used multiple times or as a comma-separated list) repo-serializer /path/to/repository --skip-dir build,dist repo-serializer /path/to/repository --skip-dir build --skip-dir dist # Only include Python files (.py, .ipynb) repo-serializer /path/to/repository --python # Only include JavaScript/TypeScript files repo-serializer /path/to/repository --javascript # Combine with other options repo-serializer /path/to/repository --python -s -c # Python files, structure only, copy to clipboard # Extract AI/LLM prompts from the repository repo-serializer /path/to/repository -p repo-serializer /path/to/repository --prompt -o prompts.txt

Python API

from repo_serializer import serialize # Serialize a repository, skipping specific directories serialize("/path/to/repository", "output.txt", skip_dirs=["build", "dist"])

Features

Directory Structure: Clearly visualize repository structure in ASCII format.
Structure-Only Mode: Option to output only the directory structure and filenames without file contents.
File Filtering: Excludes common binary files, cache directories, hidden files, and irrelevant artifacts to keep outputs concise and focused.
Smart Content Handling:
- Parses Jupyter notebooks to extract markdown and code cells with sample outputs
- Limits CSV files to first 5 lines
- Truncates large text files after 1000 lines
- Handles non-UTF-8 and binary files gracefully
Extensive Filtering: Skips common configuration files, build artifacts, test directories, and more.
Clipboard Integration: Option to copy output directly to clipboard.
Prompt Extraction: Extract AI/LLM prompts from various sources:
- Inline strings in Python/JavaScript code near LLM API calls (OpenAI, Anthropic, etc.)
- Standalone prompt files (.prompt.txt, .prompt.md, etc.)
- YAML/JSON configuration files with prompt definitions
- Jupyter notebooks containing prompts
- Markdown files in prompt-related directories

Example

# Create a serialized snapshot of your project repo-serializer /Users/example_user/projects/my_repo -o repo_snapshot.txt

Prompt Extraction

The prompt extraction feature (-p or --prompt) helps you analyze and audit AI/LLM prompts used throughout your codebase. This is particularly useful for:

Reviewing prompt consistency and quality
Refactoring duplicate prompts
Creating prompt libraries
Documenting AI behavior
Security auditing of prompts

What Gets Extracted

Inline Prompts in Code
- Strings near OpenAI, Anthropic, and other LLM API calls
- Multi-line strings assigned to prompt-related variables
- Template literals in JavaScript/TypeScript
Standalone Prompt Files
- Files with extensions like .prompt.txt, .prompt.md
- Files in prompts/ or similar directories
- YAML/JSON files with prompt configurations
Configuration Files
- YAML/JSON files with keys like prompt, system_prompt, instructions
- Nested prompt definitions in configuration objects

Example Output

Found 5 prompts in the repository: ================================================================================ File: src/ai_agent.py -------------------------------------------------------------------------------- Line 15 (inline_string) - Near LLM API call:

You are an expert Python developer. Your task is to help users write clean, efficient, and well-documented Python code. Follow these guidelines:

Always use proper error handling
Write comprehensive docstrings
Follow PEP 8 style guidelines

 File: config/prompts.yaml -------------------------------------------------------------------------------- Line 3 (config_file) - Key: system:

You are a code reviewer. Analyze the provided code for:

Security vulnerabilities
Performance issues
Code quality and maintainability

Contributing

Pull requests and improvements are welcome! Please ensure your contributions are clearly documented and tested.

Development

Quick Testing

For quick testing during development:

# Install in development mode pip install -e . # Now any changes to the source code take effect immediately repo-serializer /path/to/test/repo -o test_output.txt

Full Test Suite

Run the test script:

./dev/test_dev.py

This will:

Install the package in development mode
Run multiple test scenarios
Generate test outputs for review

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github/workflows		.github/workflows
dev		dev
repo_serializer		repo_serializer
tests		tests
.gitignore		.gitignore
README.md		README.md
dev-setup.sh		dev-setup.sh
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
setup.py		setup.py
test.sh		test.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Repo Serializer

Installation

Usage

Command Line

Python API

Features

Example

Prompt Extraction

What Gets Extracted

Example Output

Contributing

Development

Quick Testing

Full Test Suite

About

Uh oh!

Releases 12

Packages

Uh oh!

Languages

marutilai/repo-serializer

Folders and files

Latest commit

History

Repository files navigation

Repo Serializer

Installation

Usage

Command Line

Python API

Features

Example

Prompt Extraction

What Gets Extracted

Example Output

Contributing

Development

Quick Testing

Full Test Suite

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 12

Packages 0

Uh oh!

Languages

Packages