Google-like Fragmenter for Solr?

I am implementing a Solr application that originally used KinoSearch.

I have everything that has now moved to Solr and the results page, but I notice a difference in the results. In particular, the backlight is not exactly the same.

With KinoSearch, there is a KinoSearch :: Highlight :: Highlighter object that appears to create fragments similar to Google (tries to break sentences and add elipsis (...) separated by a space if it breaks the middle sentence).

Does anyone have any suggestions for implementing something similar with Solr. I tried a regex fragmenter to break into sentences, but it seems to actually apply the regular expression in reverse order and start fragments with the period of the previous sentence.

I can add elipsis logic to the view code. I am just wondering if anyone met something like this and how it was handled.

Thank!

+3
source share
1 answer

My question consisted of two parts. The first question regarding the search does not seem to follow the regular expression and sets a period before everything is examined here: http://lucene.472066.n3.nabble.com/Basic-sentence-parsing-with-the-regex- highlighter-fragmenter-td505749.html

The second elipsis problem that I am going to implement in front-end code.

I will leave this question open as I am still wondering if a better solution exists.

+1
source

All Articles