Python Forum
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Address Extraction
#4
I think you need to consider other ocr options.
It is possible to get the x-y coordinates of the word PROVIDER.
But then i would transform the pdf into an image and use tesseract for the OCR.
I remember seeing a post here a few weeks ago , where coordinates of any "word"
can also be found with another module.
I'll see if I can find it.
Paul

Edit: I think it is pdfplumber.
It is more important to do the right thing, than to do the thing right.(P.Drucker)
Better is the enemy of good. (Montesquieu) = French version for 'kiss'.
Reply


Messages In This Thread
Address Extraction - by standenman - Apr-06-2024, 03:47 PM
RE: Address Extraction - by DPaul - Apr-07-2024, 09:36 AM
RE: Address Extraction - by standenman - Apr-07-2024, 12:43 PM
RE: Address Extraction - by DPaul - Apr-07-2024, 05:20 PM
RE: Address Extraction - by Pedroski55 - Apr-08-2024, 04:45 PM
RE: Address Extraction - by DPaul - Apr-08-2024, 05:32 PM
RE: Address Extraction - by standenman - Apr-10-2024, 04:00 PM
RE: Address Extraction - by DPaul - Apr-10-2024, 05:22 PM

Possibly Related Threads…
Thread Author Replies Views Last Post
  Strategy for data extraction standenman 1 694 Mar-11-2024, 01:44 PM
Last Post: carecavoador
  Python Machine Learning: For Data Extraction JaneTan 0 1,935 Nov-24-2020, 06:45 AM
Last Post: JaneTan
  Feature extraction algorithm lukaznt 1 2,676 Mar-02-2018, 05:16 AM
Last Post: Larz60+

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020