Python Forum
Search Results
Post Author Forum Replies Views Posted [asc]
    Thread: how to extract financial data from photocopy of document
Post: RE: how to extract financial data from photocopy o...

(Feb-13-2020, 12:18 AM)DeaD_EyE Wrote: This are embedded images. You need OCR to solve this problem. pytesseract is a wrapper around Tesseract. But the results are very worse (maybe my own mistake?)...
angela1 Data Science 6 3,618 Feb-14-2020, 11:22 AM
    Thread: how to extract financial data from photocopy of document
Post: RE: how to extract financial data from photocopy o...

(Feb-12-2020, 11:31 PM)jim2007 Wrote: Is it an actual PDF document or just an image embedded in a PDF? If so there is not much you can do. Is there a reason why you can’t use the XBRL format inst...
angela1 Data Science 6 3,618 Feb-14-2020, 07:52 AM
    Thread: how to extract financial data from photocopy of document
Post: RE: how to extract financial data from photocopy o...

(Feb-12-2020, 11:31 PM)jim2007 Wrote: Is it an actual PDF document or just an image embedded in a PDF? If so there is not much you can do. Is there a reason why you can’t use the XBRL format inst...
angela1 Data Science 6 3,618 Feb-14-2020, 04:05 AM
    Thread: how to extract financial data from photocopy of document
Post: how to extract financial data from photocopy of do...

I have a lot of company annual reports in PDF format, and they are scanned copies (an example is in link 1 below). I need to extract data from the financial statements from the PDF, such as 'revenue' ...
angela1 Data Science 6 3,618 Feb-12-2020, 11:21 AM

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020