Python Forum
Convert From PDf into JPEG Problem
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Convert From PDf into JPEG Problem
#1
I'd recently found one issuse from converting PDF into JPG file and It will be stopped at File Number 52 without continuous on.

I'd tried neither issues on PdfReader from causing this problem. Hope anyone could help. Thanks

#!/usr/bin/python import os import base64 import warnings import pandas as pd import pypdfium2 as pdfium from pypdf import PdfReader from PIL import Image def into_jpg(file, file_loc): try: write_loc = '/home/pi/code/images/train' new_file_name = str(base64.b64encode(str.encode(file.split('.')[0]))[:15], 'utf-8') new_file_loc = os.path.join(write_loc, new_file_name + '.jpg') new_count = 1 pdfs = pdfium.PdfDocument(file_loc) n_pages = len(pdfs) for page_num in range(len(pdfs)): page = pdfs.get_page(page_num) pil_image = page.render( scale = 1, rotation = 0, crop = (0, 0, 0, 0) #optimise_mode = pdfium.OptimiseMode.NONE ).to_pil() pil_image.save(os.path.join(write_loc, f'{new_file_name}_{page_num+1}.jpg')) new_file_loc = os.path.join(write_loc, new_file_name + '_' + str(page_num) + '.jpg') except Exception as exp: print(exp) return file_dir = '/home/pi/code/ma_1' file_loc = None count = 1 for file in os.listdir(file_dir): if os.path.isdir(os.path.join(file_dir, file)): for file_ in os.listdir(os.path.join(file_dir, file)): file_loc = os.path.join(os.path.join(file_dir, file), file_) IsProcess = True while IsProcess: if os.path.isfile(file_loc): try: PdfReader(file_loc) print('File Location_{0}:'.format(count), file_loc) into_jpg(file_, file_loc) IsProcess = False except Exception as exp: print(exp) count = count + 1 

Attached Files

Thumbnail(s)
   
Reply
#2
Problem solved, while loop caused in hang issues.
Reply
#3
I often split PDFs to jpgs for OMR. This works well for me.

import pdf2image # can use glob to get a list of pdfs and loop through the list pdf = "2_cv外贸助理.pdf" # for saving various pdfs path2pdf = '/home/pedro/babystuff/' # for saving split pdfs as jpg path2jpg = '/home/pedro/babystuff/pdf2jpg/' def splitPDF(aPDF, source, destination): print(f'Splitting {source + aPDF} to individual jpgs ... ') outputName = aPDF.split('.') savename = outputName[0] # images is a list images = pdf2image.convert_from_path(source + aPDF) i=1 for image in images: image.save(destination + savename + str(i) + '.jpg', 'JPEG') i+=1
Reply
#4
(Sep-04-2023, 08:42 AM)Pedroski55 Wrote: I often split PDFs to jpgs for OMR. This works well for me.

import pdf2image # can use glob to get a list of pdfs and loop through the list pdf = "2_cv外贸助理.pdf" # for saving various pdfs path2pdf = '/home/pedro/babystuff/' # for saving split pdfs as jpg path2jpg = '/home/pedro/babystuff/pdf2jpg/' def splitPDF(aPDF, source, destination): print(f'Splitting {source + aPDF} to individual jpgs ... ') outputName = aPDF.split('.') savename = outputName[0] # images is a list images = pdf2image.convert_from_path(source + aPDF) i=1 for image in images: image.save(destination + savename + str(i) + '.jpg', 'JPEG') i+=1

hm. Tq.
Reply
#5
(Sep-05-2023, 06:19 AM)koklimabc Wrote: hm. Tq.
This may be a cultural problem, but please think that some forum members like me don't understand your abbreviations. Looking for HMTQ online, I find only "Her Majesty The Queen". This is probably not what you meant. Confused
Larz60+ likes this post
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
Exclamation Date format convert problem for SQL server eotret 5 3,988 Aug-09-2024, 07:19 AM
Last Post: Pedroski55
  convert to bin problem kucingkembar 3 1,851 Apr-19-2024, 12:53 PM
Last Post: kucingkembar
  Convert string to float problem vasik006 8 6,646 Jun-03-2022, 06:41 PM
Last Post: deanhystad
  how to parse multipart/form-data for xls or jpeg stream into python code and store v71017 0 4,438 Mar-20-2018, 01:09 PM
Last Post: v71017

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020
This forum uses Lukasz Tkacz MyBB addons.
Forum use Krzysztof "Supryk" Supryczynski addons.