Python Forum
Concatenate multiple PDFs using python
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Concatenate multiple PDFs using python
#1
I wrote a code to merge pdf files together
Following is my code:
def merge_pdf(pdf_path,output_pdf):
    ## Create Logger
    logging.basicConfig(filename='LogFile.log',format='%(asctime)s %(message)s',filemode='a')

    logger=logging.getLogger()
    logger.setLevel(logging.DEBUG)
    try:
        os.chdir(pdf_path)
    except:
        logger.error(f"{pdf_path} does not exist")
        return f"{pdf_path} does not exist"

    ###  Get PDF list from the folder
    pdfmerge=[]
    for pdf_file in os.listdir('.'):
        if pdf_file.endswith('.pdf'):
            pdfmerge.append(pdf_file)

    pdfWriter = PyPDF2.PdfFileWriter()

    ###  loop through all PDFs
    for pdf_file in pdfmerge:
        pdfFileObj=open(pdf_file,'rb')
        pdfReader = PyPDF2.PdfFileReader(pdfFileObj)
    ## Open pages for pdf and add them
        for pageNum in range(pdfReader.numPages):
            pageObj=pdfReader.getPage(pageNum)
            pdfWriter.addPage(pageObj)
    try:
        ###  Save PDF
        pdfOutput=open(output_pdf+'.pdf','wb')

        pdfWriter.write(pdfOutput)
        pdfOutput.close()
        logger.debug(f"Successfully concatenated PDFs")
        return "Success"
    except:
        logger.error(f"File Not Created \n Check '{output_pdf}'")
        return f"File Not Created \n Check '{output_pdf}'"

## Enter input folder containing PDFs and Output path to Save COncatenated PDF
merge_pdf(pdf_path=r'input path\PDFs',output_pdf=r'output path\PDFs\Output')
The code is a user input code and is working file but if i put an output pdf which already exists it shows the following error:
Error:
Traceback (most recent call last): File "path\Final.py", line 125, in <module> merge_pdf(pdf_path=r'path',output_pdf=r'C:\Users\MUNISH\Desktop\UI Codes\pathPDFs\Output') File "path\Final.py", line 107, in merge_pdf pdfReader = PyPDF2.PdfFileReader(pdfFileObj) File "C:\Python37\lib\site-packages\PyPDF2\pdf.py", line 1084, in __init__ self.read(stream) File "C:\Python37\lib\site-packages\PyPDF2\pdf.py", line 1689, in read stream.seek(-1, 2) OSError: [Errno 22] Invalid argument
I need to change the code so that if i put an output path which already exists, it does not show this error instead goes in the except condition. please help
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  Extracting data from bank statement PDFs (Accountant) a4avinash 4 4,916 Feb-27-2025, 01:53 PM
Last Post: griffinhenry
  Concatenate array for 3D plotting armanditod 1 1,282 Mar-21-2024, 08:08 PM
Last Post: deanhystad
  python convert multiple files to multiple lists MCL169 6 3,170 Nov-25-2023, 05:31 AM
Last Post: Iqratech
  Comparing PDFs CaseCRS 5 3,182 Apr-01-2023, 05:46 AM
Last Post: DPaul
  How to concatenate filepath with variable? Mark17 2 9,199 Jan-31-2022, 09:13 PM
Last Post: Mark17
  Concatenate str JohnnyCoffee 2 3,673 May-01-2021, 03:58 PM
Last Post: JohnnyCoffee
  download pubmed PDFs using pubmed2pdf in python Wooki 8 8,290 Oct-19-2020, 03:06 PM
Last Post: jefsummers
  Concatenate two dataframes moralear27 2 2,535 Sep-15-2020, 08:04 AM
Last Post: moralear27
  concatenate a request to the endpoint of OSM-API?! - how to? apollo 0 1,641 Aug-18-2020, 03:21 PM
Last Post: apollo
  How to compare two PDFs for differences Normanie 2 3,168 Jul-30-2020, 07:31 AM
Last Post: millpond

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020