Python Forum
Search Results
Post Author Forum Replies Views Posted [asc]
    Thread: Docx Convert Word Header to Body
Post: RE: Docx Convert Word Header to Body

I am fairly used the pdfminer and have also worked with PyPDF2 and pdfrw. I have to extract each paragraph in the pdf, with as many data as possible: font, size, color, bold, italic etc, as much as ...
CaptainCsaba General Coding Help 3 2,768 Jun-02-2021, 11:43 AM
    Thread: Docx Convert Word Header to Body
Post: Docx Convert Word Header to Body

I am trying to scrape Pdf information. I need a to collect a lot of information about each paragraph. Right now it seems that the way I can extract all the information I want is by converting the pdf ...
CaptainCsaba General Coding Help 3 2,768 Jun-02-2021, 09:06 AM
    Thread: Running Ajax multiple times on button click?
Post: RE: Running Ajax multiple times on button click?

Nobody? :(
CaptainCsaba General Coding Help 1 2,117 Mar-11-2020, 02:17 PM
    Thread: Running Ajax multiple times on button click?
Post: Running Ajax multiple times on button click?

Hi! I have just started learning Flask. Turns out I need Ajax for my code to work and I canT figure it out. I have and index.html page. I have two buttons on it. The buttons call two different functi...
CaptainCsaba General Coding Help 1 2,117 Mar-10-2020, 12:48 PM
    Thread: IndexError: list index out of range bug?
Post: RE: IndexError: list index out of range bug?

You were absolutely right, it's kind of embarrassing to admit, but I wrote the rownumber = rownumber + 1 outside the loop and that is what caused the error. Thank you for the help!
CaptainCsaba General Coding Help 4 6,364 Oct-18-2019, 12:38 PM
    Thread: IndexError: list index out of range bug?
Post: RE: IndexError: list index out of range bug?

It's a fair question and I have no excuse for it, I wrote the code when I was tired and working late and did not realise that adding the +1-s was a waste of time and i only should have changed rownumb...
CaptainCsaba General Coding Help 4 6,364 Oct-18-2019, 10:45 AM
    Thread: IndexError: list index out of range bug?
Post: IndexError: list index out of range bug?

Hello! I have a code that gets a string cell which is a date and converts it to a certain date format. The thing is, I am out of index and have no idea why. This is the code: import pandas df = rea...
CaptainCsaba General Coding Help 4 6,364 Oct-18-2019, 06:26 AM
    Thread: Inner Join merging bug?
Post: Inner Join merging bug?

Hi! I have the following very simple code: dfTycho = read_excel('TychoList.xlsx') dfCodes = read_excel('CompaniesCodes.xlsx') dfcomphousecodes = pd.merge(dfTycho, dfCodes, on='CompanyName', how='inn...
CaptainCsaba Data Science 0 1,252 Oct-10-2019, 12:59 PM
    Thread: Reading a copy-protected PDF
Post: RE: Reading a copy-protected PDF

hey. Unfortunately we don't have permission to use QPDF, although it seemed useful. I tried pikepdf as it should wrok (since it uses the same code). I used the following code. If I did not include the...
CaptainCsaba General Coding Help 6 6,190 Oct-04-2019, 12:42 PM
    Thread: Reading a copy-protected PDF
Post: RE: Reading a copy-protected PDF

(Oct-03-2019, 12:43 PM)ichabod801 Wrote: This sounds like a management problem to me. Tell your boss you need them to get on corporation X to provide machine readable pdfs, or you will have no choic...
CaptainCsaba General Coding Help 6 6,190 Oct-03-2019, 02:11 PM
    Thread: Reading a copy-protected PDF
Post: Reading a copy-protected PDF

Hi! In our job we use PDF reports that we get from multiple corporate clients. We have one which has very corporate views about handing out information. The PDFs come with almost all of the restricti...
CaptainCsaba General Coding Help 6 6,190 Oct-03-2019, 07:06 AM
    Thread: Too big CSV file management
Post: RE: Too big CSV file management

That was so simple I started to wonder how that not came to my mind lol. Thank you, it worked like a charm. Had to add UTF8 encoding to it ,in the end it looked like this: with open('AllCompanies.csv...
CaptainCsaba General Coding Help 3 2,203 Sep-10-2019, 05:52 AM
    Thread: Too big CSV file management
Post: Too big CSV file management

Hey! I am pretty new to "pandas" in python and I would like to ask for some help. I don't think it's complicated, I just can't figure it out. I have a huge CSV file (around 2 gigabytes, 4,4 million l...
CaptainCsaba General Coding Help 3 2,203 Sep-09-2019, 02:30 PM
    Thread: Paste Special with multiple arguments with Dispatch
Post: RE: Paste Special with multiple arguments with Dis...

It's inside the excel file and I need to make the conversion in excel with the tools of excel via Python. Just converting the via "int()" did not work. Or do you mean it in another way?
CaptainCsaba General Coding Help 3 5,703 Jun-20-2019, 02:18 PM
    Thread: Paste Special with multiple arguments with Dispatch
Post: Paste Special with multiple arguments with Dispatc...

Hi! I am trying to change some cells in excel with python. There is an error that sometimes occures in column K. Basically the "The number in this cell is formatted as text or preceded by an apostrop...
CaptainCsaba General Coding Help 3 5,703 Jun-20-2019, 12:40 PM
    Thread: How "continue" in another indentation?
Post: RE: How "continue" in another indentation?

I don't know why but it worked this way. Thank you!
CaptainCsaba General Coding Help 6 3,046 May-07-2019, 08:17 AM
    Thread: How "continue" in another indentation?
Post: RE: How "continue" in another indentation?

Thank you for the reply Nuzvee. I can see the logic and it is a good idea but it does not work for some reason.
CaptainCsaba General Coding Help 6 3,046 May-03-2019, 01:13 PM
    Thread: How "continue" in another indentation?
Post: How "continue" in another indentation?

hey! I have the following part of a code. It basically does lots of things with PDFs in folders. For reasons I needed to add a criteria wheer if a folder has more than 100 PDF it should be left out. ...
CaptainCsaba General Coding Help 6 3,046 May-03-2019, 12:17 PM
    Thread: Re.search misses string end
Post: Re.search misses string end

Hi! I have a strange problem. I have a PDF that I convert into a string using PDFminer. I change every "\n" to nothing (beause sometimes they appear at codebreaking places) I then search for a substr...
CaptainCsaba General Coding Help 3 3,293 Apr-01-2019, 02:12 PM
    Thread: Where is the wrong indentation?
Post: Where is the wrong indentation?

Hi! I have the following code. I don't know how PDFminer works so the first part is somebody else code which I modified a bit. It seems to work but does everything 8 or more times instead of just one...
CaptainCsaba General Coding Help 2 2,285 Mar-05-2019, 03:13 PM

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020