This time I have been given a truckload of pdfs.
They are scans of historical articles. Quality is good, OCR is OK.
I can search the texts programatically.
Here is where I could use some advice/ideas:
A researcher comes in and e.g. wants to find out which articles (pdfs)
contain the word "Napoleon" at least once. He wants to read the whole article for context.
I can do that, say, i find 10 'hits'.
Now I have to present the researcher with some type of media
with 10 hyperlinks to the 10 pdfs.
My initial thought is to make a html page on-the-fly,
start the default browser, somehow show the page if all that is possible?
When he/she wants to ask another question, get rid of the temporary html page, and do the same again.
We are NOT talking about an internet environment, just a desktop happening.
Any suggestions?
thx,
Paul
They are scans of historical articles. Quality is good, OCR is OK.
I can search the texts programatically.
Here is where I could use some advice/ideas:
A researcher comes in and e.g. wants to find out which articles (pdfs)
contain the word "Napoleon" at least once. He wants to read the whole article for context.
I can do that, say, i find 10 'hits'.
Now I have to present the researcher with some type of media
with 10 hyperlinks to the 10 pdfs.
My initial thought is to make a html page on-the-fly,
start the default browser, somehow show the page if all that is possible?
When he/she wants to ask another question, get rid of the temporary html page, and do the same again.
We are NOT talking about an internet environment, just a desktop happening.
Any suggestions?
thx,
Paul
It is more important to do the right thing, than to do the thing right.(P.Drucker)
Better is the enemy of good. (Montesquieu) = French version for 'kiss'.
Better is the enemy of good. (Montesquieu) = French version for 'kiss'.