All About Programming: [Update] Accessing words around a positional match in Lucene 4

[Update] Accessing words around a positional match in Lucene 4
the key differences from the previous examples have to do with using an AtomicReader (see Uwe Schindler’s excellent talk on the subject for more details) instance, as well as some new parameters to pass in to the SpanTermQuery.getSpans() method I used and finally, how to access the term vectors.

In the getSpans() method, the first parameter is essentially providing access to the Reader, the second parameter can be used to filter out documents and the third, the termContexts, can be used to enable better performance when looking up terms.

The other big change is accessing Term Vectors in the span lookup loop. You no longer need to use TermVectorMapper instances, but instead simply use instances of Terms, TermsEnum and DocsAndPositionsEnum, as in:

it spits out some info about the actual query, and then shows some context about the matches. Note, of course, you could also extend this to access things like payloads and more.
Please read full article from [Update] Accessing words around a positional match in Lucene 4

[Update] Accessing words around a positional match in Lucene 4

No comments:

Post a Comment

Labels

Popular Posts