[KinoSearch] Serialized Schema

Marvin Humphrey marvin at rectangular.com
Thu Oct 4 18:46:11 PDT 2007




On Oct 4, 2007, at 5:26 PM, Peter Karman wrote:

> Sounds like what you want isn't an official subset of the language,  
> but rather something like a SGML document type definition (called  
> (overload overload) a schema in XML parlance). Just an official  
> declaration of what constitutes a legal KS header.

I started messing around with defining what aspects of YAML are key.  
(http://www.rectangular.com/kinosearch/wiki/FileFormat).

Is there an XSD schema for the Swish format?  I haven't written one  
before, but being able to follow a schema for writing schemas  
(overload overload overload) appeals to me.

A switch to XML for KS metadata file serialization might be in  
order.  It was kind of a toss up between the two contenders.   But  
when I brought this up on the Lucene list a while ago, people were  
like "YAML, what's that?".  And Swish uses XML.  Might be time to go  
with the flow.  (Switching wouldn't even be disruptive, since we'd  
just look for the segments_XXX.whatever file and parse it according  
to the extension.)

I'd kind of like to stick with using a minimal custom parser rather  
than adding a full-on XML parser as a dependency.  That means placing  
restrictions on the XML akin to those I laid out for YAML.  You know  
whether spec'ing those sort of restrictions is something XSD is set  
up to handle?

Marvin Humphrey
Rectangular Research
http://www.rectangular.com/




_______________________________________________
KinoSearch mailing list
KinoSearch at rectangular.com
http://www.rectangular.com/mailman/listinfo/kinosearch




More information about the kinosearch mailing list