[KinoSearch] get doc/query similarity
Marvin Humphrey
marvin at rectangular.com
Wed Apr 16 11:04:20 PDT 2008
On Apr 16, 2008, at 7:43 AM, jack_tanner at yahoo.com wrote:
> Will do, and thanks for your time and advice. I still think it'd be
> nice if KS exposed such a similarity computation via the API; it'd
> be much more efficient that way.
I agree, and I would have liked to have discussed that. Had you not
been constrained by having to use the maint branch, I might have
steered things in that direction.
A lot of best work on KS, both high-level design and low-level code,
has arisen from collaborations between myself and someone who has an
itch to scratch. I'm always on the lookout for such potential partners.
In your case, though, my impression was that you were quite
knowledgeable, but that your project did not need the devel branch
badly enough to guarantee sustained momentum over the course of what
would likely be a drawn-out design discussion.
Exposing similarity measures would be superficially easy -- all the
relevant material is in KinoSearch::Search::Similarity. However, the
actual APIs to interface with the math in Similarity are internal and
not set up for use the way you described your needs. The bigger
problems were how to get at "an indexed document", how to list its
terms, and so on, outside of the context of the existing search API.
Marvin Humphrey
Rectangular Research
http://www.rectangular.com/
_______________________________________________
KinoSearch mailing list
KinoSearch at rectangular.com
http://www.rectangular.com/mailman/listinfo/kinosearch
More information about the kinosearch
mailing list