[KinoSearch] Highlighter Bug
Marvin Humphrey
marvin at rectangular.com
Tue Nov 16 15:52:39 PST 2010
On Tue, Nov 16, 2010 at 05:15:42PM +0100, Nick Wellnhofer wrote:
>
> I found a bug in the highlighter in KinoSearch 0.30_121 and 0.31. For
> some searches the excerpt doesn't contain any of the search terms, but
> only a sentence following the one that looks like it should have been
> picked.
>
> I had a quick look at the code in
> core/KinoSearch/Highlight/Highlighter.c and the test "candidate >= top"
> in Highlighter_raw_excerpt looks fishy to me. AFAICS this might cause
> sentences to be skipped.
You're probably right that that's where we're skipping ahead, but it's not
clear what aspect of the input is causing Raw_Excerpt() to malfunction. The
check inside that block is supposed to verify that the chosen excerpt contains
at least some relevant material -- that's what the S_has_heat() call is there
for.
This will be hard to debug without sample material. Once we have a document
which triggers the bad behavior, we can start throwing in debugging printf's.
If you can supply me with sample code which has the problem, I'll have a good
hard look.
Marvin Humphrey
More information about the kinosearch
mailing list