HELP:
I have several TIFF images I need to make it available on web as
searchable content.
Option 1:
I am planning of OCRing the images to text files. The file name
will be a unique ID. A DB table will contain meta data inforamtion
about the file. A search engine (??) can index these files. And the
results contain summary from the OCR text. And the resulting head
line will contain link to a JSP passing the UNIQUE ID as arguement
and JSP will read the tiff image for display. Thus I can track who
viewing the image and how many times.
Now any one know if there is a free search engine that can do this ?
I am looking into Lucene but I think it does not return summaries and
I am not sure if I can manipulate headlines..
Option 2: I can store the images and text in a DB (Oracle) is there
a search eingine that can index these ocred text files in db and
the result should be heading (from db meta data) linked to jsp with
doc id.
Example:
This is test <- from meta data
<p>This document contain result of your search... <- from ocr text