An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
- Updated
Oct 6, 2025 - Python
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
Filter sensitive information from free text before sending it to external services or APIs, such as chatbots and LLMs.
Python CLI tool to redact and un-redact sensitive data from text files. 🔐📝
Maskwise detects, redacts, masks, and anonymizes sensitive data across text, images, and structured data in training datasets for LLM systems. Powered by Microsoft Presidio
Rust derive macro for redacting sensitive data in std::fmt::Debug
resources for programmatically redacting personally identifiable information
Redactify is an efficient data redaction tool that secures sensitive text using advanced NLP and rule-based methods. It combines transformer-based NER, regex, and Presidio analysis to detect and mask personal information through full redaction or partial masking—ensuring compliance while preserving data utility.
TrueSight is an innovative NLP and Vision-based redaction tool that allows user-defined gradational redaction, masking, and anonymization across multiple input formats, including text, images, PDFs, and videos. TrueSight is designed for high security, ensuring that no input data is stored or retrievable, making it an ideal solution for sensitive da
Add a description, image, and links to the data-redaction topic page so that developers can more easily learn about it.
To associate your repository with the data-redaction topic, visit your repo's landing page and select "manage topics."