Aug-11-2024, 08:48 AM
Thanks for the reply!
After installing unstructured, I had to spend about an hour installing other modules which were needed, like unstructured-inference, pillow-heif and quite a few others. I think, in my situation, with a.pdf, convert to jpg and use tesseract was much much easier and produced a good result!.
I don't know if nltk is a builtin. I did not install it, maybe ustructured did? I only installed python on this new laptop last week, so whatever I have should be fairly recent!
After installing unstructured, I had to spend about an hour installing other modules which were needed, like unstructured-inference, pillow-heif and quite a few others. I think, in my situation, with a.pdf, convert to jpg and use tesseract was much much easier and produced a good result!.
I don't know if nltk is a builtin. I did not install it, maybe ustructured did? I only installed python on this new laptop last week, so whatever I have should be fairly recent!