A Python library for extracting tables from PDF documents using computer vision and image processing techniques. It converts PDF pages to images, detects tables, recognizes their structure, and outputs clean data in JSON format.
python image-processing clean-architecture data-extraction document-analysis table-extraction pdf-parser document-processing pdf-mining table-recognition table-detection pdf-processing grid-detection
- Updated
Oct 18, 2025 - Python