Python library to interact with the PDFTables.com API.
PIP:
pip install git+https://github.com/pdftables/python-pdftables-api.git Locally:
python setup.py install import pdftables_api c = pdftables_api.Client('my-api-key') c.xlsx('input.pdf', 'output.xlsx')python -m unittest test.test_pdftables_api If you are converting a large document (hundreds or thousands of pages), you may want to increase the timeout.
Here is an example of the sort of error that might be encountered:
ReadTimeout: HTTPSConnectionPool(host='pdftables.com', port=443): Read timed out. (read timeout=300) The below example allows 60 seconds to connect to our server, and 1 hour to convert the document:
import pdftables_api c = pdftables_api.Client('my-api-key', timeout=(60, 3600)) c.xlsx('input.pdf', 'output.xlsx')