Jan-17-2019, 11:07 AM
see: https://pypi.org/search/?q=pdf+image+extract
minecart: https://pypi.org/project/minecart/ looks promising, sample code:
minecart: https://pypi.org/project/minecart/ looks promising, sample code:
>>> pdffile = open('example.pdf', 'rb') >>> doc = minecart.Document(pdffile) >>> page = doc.get_page(3) >>> for shape in page.shapes.iter_in_bbox((0, 0, 100, 200)): ... print shape.path, shape.fill.color.as_rgb() >>> im = page.images[0].as_pil() # requires pillow >>> im.show()