Stemming:
Decide the stemming strategy based on your use case. For mor info on strategies, please see this article.
It is highly recommend you use Porter Stemming because of its simplicity and flexibility. It usually is sufficient to yield the majority of stemming based search results.
Porter is a transforming algorithm for the English language that reduces any of the forms of a word such as "walks, walking, walked" or "teach, teaching" to their elemental roots "walk" and "teach".
Porter is based on rules and does not need a dictionary.
Go to the /oak:index/ntHierarchyNode/analyzers/default/filters path in the repository.
Create a node with these properties:
- Name: PorterStem
- Type: nt:unstructured
Finally, test stemming functionality by:
Going to search in the Geometrixx Demo site at http://localhost:4502/editor.html/content/geometrixx/en/toolbar/search.html
Typing the keyword "teach". You will get results for the term "teaching" as well.
For more info refer :
No comments:
Post a Comment