Search Results
|
Post |
Author |
Forum |
Replies |
Views |
Posted
[asc]
|
|
|
Thread: Docx Convert Word Header to Body
Post: RE: Docx Convert Word Header to Body
I am fairly used the pdfminer and have also worked with PyPDF2 and pdfrw.
I have to extract each paragraph in the pdf, with as many data as possible: font, size, color, bold, italic etc, as much as ... |
|
CaptainCsaba |
General Coding Help |
3 |
2,768 |
Jun-02-2021, 11:43 AM |
|
|
Thread: Docx Convert Word Header to Body
Post: Docx Convert Word Header to Body
I am trying to scrape Pdf information. I need a to collect a lot of information about each paragraph. Right now it seems that the way I can extract all the information I want is by converting the pdf ... |
|
CaptainCsaba |
General Coding Help |
3 |
2,768 |
Jun-02-2021, 09:06 AM |
|
|
Thread: Running Ajax multiple times on button click?
Post: RE: Running Ajax multiple times on button click?
|
CaptainCsaba |
General Coding Help |
1 |
2,117 |
Mar-11-2020, 02:17 PM |
|
|
Thread: Running Ajax multiple times on button click?
Post: Running Ajax multiple times on button click?
Hi!
I have just started learning Flask. Turns out I need Ajax for my code to work and I canT figure it out. I have and index.html page. I have two buttons on it. The buttons call two different functi... |
|
CaptainCsaba |
General Coding Help |
1 |
2,117 |
Mar-10-2020, 12:48 PM |
|
|
Thread: IndexError: list index out of range bug?
Post: RE: IndexError: list index out of range bug?
You were absolutely right, it's kind of embarrassing to admit, but I wrote the rownumber = rownumber + 1 outside the loop and that is what caused the error. Thank you for the help! |
|
CaptainCsaba |
General Coding Help |
4 |
6,364 |
Oct-18-2019, 12:38 PM |
|
|
Thread: IndexError: list index out of range bug?
Post: RE: IndexError: list index out of range bug?
It's a fair question and I have no excuse for it, I wrote the code when I was tired and working late and did not realise that adding the +1-s was a waste of time and i only should have changed rownumb... |
|
CaptainCsaba |
General Coding Help |
4 |
6,364 |
Oct-18-2019, 10:45 AM |
|
|
Thread: IndexError: list index out of range bug?
Post: IndexError: list index out of range bug?
Hello!
I have a code that gets a string cell which is a date and converts it to a certain date format. The thing is, I am out of index and have no idea why. This is the code:
import pandas
df = rea... |
|
CaptainCsaba |
General Coding Help |
4 |
6,364 |
Oct-18-2019, 06:26 AM |
|
|
Thread: Inner Join merging bug?
Post: Inner Join merging bug?
Hi!
I have the following very simple code:
dfTycho = read_excel('TychoList.xlsx')
dfCodes = read_excel('CompaniesCodes.xlsx')
dfcomphousecodes = pd.merge(dfTycho, dfCodes, on='CompanyName', how='inn... |
|
CaptainCsaba |
Data Science |
0 |
1,252 |
Oct-10-2019, 12:59 PM |
|
|
Thread: Reading a copy-protected PDF
Post: RE: Reading a copy-protected PDF
hey. Unfortunately we don't have permission to use QPDF, although it seemed useful. I tried pikepdf as it should wrok (since it uses the same code). I used the following code. If I did not include the... |
|
CaptainCsaba |
General Coding Help |
6 |
6,190 |
Oct-04-2019, 12:42 PM |
|
|
Thread: Reading a copy-protected PDF
Post: RE: Reading a copy-protected PDF
(Oct-03-2019, 12:43 PM)ichabod801 Wrote: This sounds like a management problem to me. Tell your boss you need them to get on corporation X to provide machine readable pdfs, or you will have no choic... |
|
CaptainCsaba |
General Coding Help |
6 |
6,190 |
Oct-03-2019, 02:11 PM |
|
|
Thread: Reading a copy-protected PDF
Post: Reading a copy-protected PDF
Hi!
In our job we use PDF reports that we get from multiple corporate clients. We have one which has very corporate views about handing out information. The PDFs come with almost all of the restricti... |
|
CaptainCsaba |
General Coding Help |
6 |
6,190 |
Oct-03-2019, 07:06 AM |
|
|
Thread: Too big CSV file management
Post: RE: Too big CSV file management
That was so simple I started to wonder how that not came to my mind lol. Thank you, it worked like a charm. Had to add UTF8 encoding to it ,in the end it looked like this:
with open('AllCompanies.csv... |
|
CaptainCsaba |
General Coding Help |
3 |
2,203 |
Sep-10-2019, 05:52 AM |
|
|
Thread: Too big CSV file management
Post: Too big CSV file management
Hey!
I am pretty new to "pandas" in python and I would like to ask for some help. I don't think it's complicated, I just can't figure it out. I have a huge CSV file (around 2 gigabytes, 4,4 million l... |
|
CaptainCsaba |
General Coding Help |
3 |
2,203 |
Sep-09-2019, 02:30 PM |
|
|
Thread: Paste Special with multiple arguments with Dispatch
Post: RE: Paste Special with multiple arguments with Dis...
It's inside the excel file and I need to make the conversion in excel with the tools of excel via Python. Just converting the via "int()" did not work. Or do you mean it in another way? |
|
CaptainCsaba |
General Coding Help |
3 |
5,703 |
Jun-20-2019, 02:18 PM |
|
|
Thread: Paste Special with multiple arguments with Dispatch
Post: Paste Special with multiple arguments with Dispatc...
Hi!
I am trying to change some cells in excel with python. There is an error that sometimes occures in column K. Basically the "The number in this cell is formatted as text or preceded by an apostrop... |
|
CaptainCsaba |
General Coding Help |
3 |
5,703 |
Jun-20-2019, 12:40 PM |
|
|
Thread: How "continue" in another indentation?
Post: RE: How "continue" in another indentation?
I don't know why but it worked this way. Thank you! |
|
CaptainCsaba |
General Coding Help |
6 |
3,046 |
May-07-2019, 08:17 AM |
|
|
Thread: How "continue" in another indentation?
Post: RE: How "continue" in another indentation?
Thank you for the reply Nuzvee. I can see the logic and it is a good idea but it does not work for some reason. |
|
CaptainCsaba |
General Coding Help |
6 |
3,046 |
May-03-2019, 01:13 PM |
|
|
Thread: How "continue" in another indentation?
Post: How "continue" in another indentation?
hey!
I have the following part of a code. It basically does lots of things with PDFs in folders. For reasons I needed to add a criteria wheer if a folder has more than 100 PDF it should be left out. ... |
|
CaptainCsaba |
General Coding Help |
6 |
3,046 |
May-03-2019, 12:17 PM |
|
|
Thread: Re.search misses string end
Post: Re.search misses string end
Hi!
I have a strange problem. I have a PDF that I convert into a string using PDFminer. I change every "\n" to nothing (beause sometimes they appear at codebreaking places) I then search for a substr... |
|
CaptainCsaba |
General Coding Help |
3 |
3,293 |
Apr-01-2019, 02:12 PM |
|
|
Thread: Where is the wrong indentation?
Post: Where is the wrong indentation?
Hi!
I have the following code. I don't know how PDFminer works so the first part is somebody else code which I modified a bit. It seems to work but does everything 8 or more times instead of just one... |
|
CaptainCsaba |
General Coding Help |
2 |
2,285 |
Mar-05-2019, 03:13 PM |