[KinoSearch] How do you index ms office (.doc, .xls, .ppt) files with kinosearch

Henry henka at cityweb.co.za
Mon Aug 25 06:42:32 PDT 2008



On Mon, August 25, 2008 1:12 pm, Ben Aurel wrote:
> My question is, what would you suggest for indexing office formats ?
> How do you extract text without ole and and an office installation on
> the server?

You use file conversion utilities such as pdftotext, xlhtml, wvHtml etc. 
Most of these are far from perfect, sometimes crashing, etc.

Regards
Henry


_______________________________________________
KinoSearch mailing list
KinoSearch at rectangular.com
http://www.rectangular.com/mailman/listinfo/kinosearch




More information about the kinosearch mailing list