Python Forum
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
OCR again
#3
(Oct-29-2022, 08:16 AM)Gribouillis Wrote: Can't you convert all the tifs into jpg outside of the program by invoking e.g. Imagemagick's convert command?
Another trick would be to use a ramdisk as the target device.
@Gribouillis:
1) Yes I can convert all the tiffs outside the program: tried that. (using batch conversion in XNview, or even python itself ...)
But: say the extreme compressed tiffs take 1 terabyte, anything converted takes 4 TB. That is only temporary, and
it also takes a very long time to do. Not practical.
2) Ramdrive: not sure how to do that, will look it up. Thanks.

i've been looking at this for some time, and as you read about OCR and pytesseract,
most of the examples use cv2 to open images.
a) It is faster than PIL (written in C ?)
b) It has more features
c) the Image feature of PIL conflicts with tKinter if you're not careful.

BUT: as i discovered in the past hour, PIL seems to have one thing over cv2: it can (apparently) read more types of tif compression.

What I have to do now is test if the slower PIL, but able to read "exotic" compressions, would be faster/slower than eg. a ramdrive
approach with cv2.
Paul
It is more important to do the right thing, than to do the thing right.(P.Drucker)
Better is the enemy of good. (Montesquieu) = French version for 'kiss'.
Reply


Messages In This Thread
OCR again - by DPaul - Oct-29-2022, 06:49 AM
RE: OCR again - by Gribouillis - Oct-29-2022, 08:16 AM
RE: OCR again - by DPaul - Oct-29-2022, 08:39 AM
RE: OCR again - by Gribouillis - Oct-29-2022, 09:08 AM
RE: OCR again - by DPaul - Oct-29-2022, 09:33 AM
RE: OCR again - by Gribouillis - Oct-29-2022, 10:08 AM
RE: OCR again - by DPaul - Oct-29-2022, 10:17 AM
RE: OCR again - by DPaul - Oct-30-2022, 06:26 AM
RE: OCR again - by DPaul - Oct-30-2022, 07:36 AM
RE: OCR again - by wavic - Oct-31-2022, 08:41 AM
RE: OCR again - by DPaul - Oct-31-2022, 11:10 AM
RE: OCR again - by wavic - Oct-31-2022, 01:44 PM
RE: OCR again - by DPaul - Oct-31-2022, 04:31 PM
RE: OCR again - by wavic - Oct-31-2022, 05:51 PM
RE: OCR again - by DPaul - Oct-31-2022, 06:29 PM
RE: OCR again - by wavic - Oct-31-2022, 07:14 PM
RE: OCR again - by DPaul - Nov-01-2022, 06:44 AM
RE: OCR again - by DPaul - Nov-01-2022, 08:31 AM
RE: OCR again - by wavic - Nov-01-2022, 09:22 AM
RE: OCR again - by DPaul - Nov-01-2022, 10:14 AM
RE: OCR again - by DPaul - Nov-04-2022, 07:15 AM
RE: OCR again - by DPaul - Nov-05-2022, 08:05 AM
RE: OCR again - by DPaul - Nov-05-2022, 09:49 AM

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020