wiki:LuceneSail

Version 1 (modified by grimnes, 18 years ago) (diff)

--

LuceneSail

The lucene sail is a clever little hack that lets (parts of) RDF resources descriptions be indexed as full-text.

There are two versions, both are designed to be stackable sails - they must sit on top of a memory,native or something else store.

Each are configured given two lists of RDF predicates:

  • The lucene inclusion list - any triple with this predicates has the value indexed
  • The RDF exclusion list - any triple with this predicate is not added to the underlying Sail.

By default they are both empty and all triples are stored in RDF and none are indexed.

LuceneSailAperture

This has a method addResource(SesameRDFContainer) which adds a single resource crawled from aperture to the underlying sail. Each aperture resource becomes a separate lucene document.

LuceneSail

This will only triples in a transaction. When a transaction is commited each resource that was described becomes a separate lucene document.

Searching

There is a textQuery(String) method which returns Lucene Hits objects.