edspdf-mupdf provides a MuPdf-based PDF parser component for EDS-PDF
Beware, MuPdf is AGPL-licensed: edspdf-mupdf is therefore also AGPL-licensed, and any model depending on this component must be too.
Install the library with pip:
$ pip install edspdf-mupdffrom edspdf import Pipeline model = Pipeline() model.add_pipe('mupdf-extractor') model(pdf_bytes) Visit the documentation for more information!
We would like to thank Assistance Publique – Hôpitaux de Paris and AP-HP Foundation for funding this project.